myTAI

Evolutionary Transcriptomics with R

https://github.com/drostlab/mytai

Science Score: 59.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 3 DOI reference(s) in README
  • Academic publication links
    Links to: sciencedirect.com
  • Committers with academic emails
    1 of 8 committers (12.5%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (11.5%) to scientific vocabulary

Keywords

biological-processes conservation evo-devo evolution evolutionary-transcriptomics expression-profiles gene-expression gene-sets mytai r studies-transcriptomes transcriptome

Keywords from Contributors

genome
Last synced: 6 months ago · JSON representation

Repository

Evolutionary Transcriptomics with R

Basic Info
Statistics
  • Stars: 45
  • Watchers: 5
  • Forks: 18
  • Open Issues: 4
  • Releases: 3
Topics
biological-processes conservation evo-devo evolution evolutionary-transcriptomics expression-profiles gene-expression gene-sets mytai r studies-transcriptomes transcriptome
Created over 11 years ago · Last pushed 6 months ago
Metadata Files
Readme Changelog License

README.md

myTAI

Paper link install with bioconda Visitors R-CMD-check GitHub release

Evolutionary Transcriptomics with R

```r library(myTAI)

obtain an example phylo-expression object

data("examplephyexset")

plot away!

myTAI::plotsignature(examplephyexset)
myTAI::plot
contribution(examplephyexset) myTAI::plotgenespace(examplephyexset) ```

plot_signature function output plot_contribution function output plot_gene_space function output

Detailed documentation provided here

Package summary

Using myTAI, any existing or newly generated transcriptome dataset can be combined with evolutionary information (find details here) to retrieve novel insights about the evolutionary conservation of the transcriptome at hand.

For the purpose of performing large scale evolutionary transcriptomics studies, the myTAI package implements the quantification, statistical assessment, and analytics functionality to allow researchers to study the evolution of biological processes by determining stages or periods of evolutionary conservation or variability in transcriptome data.

We hope that myTAI will become the community standard tool to perform evolutionary transcriptomics studies and we are happy to add required functionality upon request.

Detailed background In the past years, a variety of studies aimed to uncover the molecular basis of morphological innovation and variation from the evolutionary developmental perspective. These studies often rely on transcriptomic data to establish the molecular patterns driving the complex biological processes underlying phenotypic plasticity. Although transcriptome information is a useful start to study the molecular mechanisms underlying a biological process of interest (molecular phenotype), they rarely capture __how__ these expression patterns emerged in the first place or to __what extent__ they are possibly constrained, thereby neglecting the evolutionary history and developmental constraints of genes contributing to the overall pool of expressed transcripts. To overcome this limitation, the `myTAI` package introduces procedures summarized under the term `evolutionary transcriptomics` to integrate gene age information into classical gene expression analysis. `Gene age inference` can be performed with [various existing software](https://drostlab.github.io/myTAI/articles/phylostratigraphy.html), but we recommend using [GenEra](https://github.com/josuebarrera/GenEra) or [orthomap](https://github.com/kullrich/orthomap), since they address published shortcomings of `gene age inference` ([see detailed discussion here](https://drostlab.github.io/myTAI/articles/phylostratigraphy.html)). In addition, users can easily retrieve [previously precomputed gene age information](https://github.com/drostlab/published_phylomaps) via our data package [phylomapr](https://github.com/LotharukpongJS/phylomapr). Evolutionary transcriptomics studies can serve as a first approach to screen _in silico_ for the potential existence of evolutionary and developmental constraints within a biological process of interest. This is achieved by quantifying transcriptome conservation patterns and their underlying gene sets in biological processes. The exploratory analysis functions implemented in `myTAI` provide users with a standardized, automated and statistically sound framework to detect and analyze patterns of evolutionary constraints in any transcriptome dataset of interest.
Scientific background Today, phenomena such as morphological mutations, diseases or developmental processes are primarily investigated on the molecular level using transcriptomics approaches. Transcriptomes denote the total number of quantifiable transcripts present at a specific stage in a biological process. In disease or developmental (defect) studies, transcriptomes are usually measured over several time points. In treatment studies aiming to quantify differences in the transcriptome due to biotic stimuli, abiotic stimuli, or diseases usually treatment / disease versus non-treatment / non-disease transcriptomes are compared. In either case, comparing changes in transcriptomes over time or between treatments allows us to identify genes and gene regulatory mechanisms that might be involved in governing the biological process of investigation. Although classic transcriptomics studies are based on an established methodology, little is known about the evolution and conservation mechanisms underlying such transcriptomes. Understanding the evolutionary mechanism that change transcriptomes over time, however, might give us a new perspective on how diseases emerge in the first place or how morphological changes are triggered by changes of developmental transcriptomes. Evolutionary transcriptomics aims to capture and quantify the evolutionary conservation of genes that contribute to the transcriptome during a specific stage of the biological process of interest. The resulting temporal conservation pattern then enables to detect stages of development or other biological processes that are evolutionarily conserved ([Drost et al., 2018](https://academic.oup.com/bioinformatics/advance-article/doi/10.1093/bioinformatics/btx835/4772684)). This quantification on the highest level is achieved through transcriptome indices (e.g. [Transcriptome Age Index](https://drostlab.github.io/myTAI/articles/phylostratigraphy.html) or [Transcriptome Divergence Index](https://drostlab.github.io/myTAI/articles/other-strata.html)) which aim to quantify the average evolutionary age [Barrera-Redondo et al., 2023](https://genomebiology.biomedcentral.com/articles/10.1186/s13059-023-02895-z) or sequence conservation [Drost et al., 2015](https://academic.oup.com/mbe/article/32/5/1221/1125964) of genes that contribute to the transcriptome at a particular stage. In general, evolutionary transcriptomics can be used as a method to quantify the evolutionary conservation of transcriptomes at particular developmental stages and to investigate how transcriptomes underlying biological processes are constrained or channeled due to events in evolutionary history (Dollo's law) ([Drost et al., 2017](https://www.sciencedirect.com/science/article/pii/S0959437X16302040)). __Please note, since myTAI relies on gene age inference and there has been an extensive debate about the best approaches for gene age inference in the last years, please follow my [updated discussion about the gene age inference literature](https://drostlab.github.io/myTAI/articles/phylostratigraphy.html). With [GenEra](https://github.com/josuebarrera/GenEra), we addressed all previously raised issues and we encourage users to run `GenEra` when aiming to infer gene ages for further `myTAI` analyses.__

Installation

myTAIv2 is still work in progress. To install the development version, do:

r devtools::install_github("drostlab/myTAI")

To install the previous version of myTAI, and access the old vignettes, do:

r devtools::install_github("drostlab/myTAI@v1.0")

Soon, users will be able to install myTAI from CRAN:

```r

install myTAIv2 from CRAN

install.packages("myTAI", dependencies = TRUE) ```

Citation

Please cite the following paper when using myTAI for your own research. This will allow us to continue working on this software tool and will motivate us to extend its functionality and usability in the next years. Many thanks in advance!

Drost et al. myTAI: evolutionary transcriptomics with R. Bioinformatics 2018, 34 (9), 1589-1590. doi:10.1093

Studies that successfully used `myTAI` to quantify transcriptome novelty and conservation > > - _Evolutionary trends in the emergence of skeletal cell types_ A Damatac, II , S Koska , K K Ullrich , T Domazet-Lošo , A Klimovich , M Kaucká… - __Evolution Letters__, 2025 > > - _Phylostratigraphic analysis revealed that ancient ohnologue PtoWRKY53 innovated a vascular transcription regulatory network in Populus_ W Huang, M Quan, W Qi, L Xiao, Y Fang, J Zhou… - __New Phytologist__, 2025 > > - _Pra-GE-ATLAS: Empowering Pinus radiata stress and breeding research through a multi-omics database_ V Roces, MJ Cañal, JL Mateo, L Valledor… - __Journal of Integrative Plant Biology__, 2025 > > - _Developmental phylotranscriptomics in grapevine suggests an ancestral role of somatic embryogenesis_ S Koska, D Leljak-Levanic, N Malenica, K Bigovic Villi… - __Communications Biology__, 2025 > > - _Proteomic analyses reveal the key role of gene co-option in the evolution of the scaly-foot snail scleritome_ WC Wong, YH Kwan, X He, C Chen, S Xiang, Y Xiao… - __Communications Biology__, 2025 > > - _Genome assembly of Stewartia sinensis reveals origin and evolution of orphan genes in Theaceae_ L Cheng, Q Han, Y Hao, Z Qiao, M Li, D Liu… - __Communications Biology__, 2025 > > - _A transcriptomic hourglass in brown algae_ JS Lotharukpong, M Zheng, R Luthringer, D Liesner, H-G Drost, SM Coelho - __Nature__, 2024 > > - _Genome assemblies of 11 bamboo species highlight diversification induced by dynamic subgenome dominance_ PF Ma, YL Liu, C Guo, G Jin, ZH Guo, L Mao, YZ Yang… - __Nature Genetics__, 2024 > > - _Hemichordate cis-regulatory genomics and the gene expression dynamics of deuterostomes_ A Pérez-Posada, CY Lin, TP Fan, CY Lin, YC Chen… - __Nature Ecology & Evolution__, 2024 > > - _Comparison between 16S rRNA and shotgun sequencing in colorectal cancer, advanced colorectal lesions, and healthy human gut microbiota_ D Bars-Cortina, E Ramon, B Rius-Sansalvador… - __BMC genomics__, 2024 > > - _Heat stress reprograms herbivory-induced defense responses in potato plants_ J Zhong, J Zhang, Y Zhang, Y Ge, W He, C Liang… - __BMC Plant Biology__, 2024 > > - _The transcriptomic signature of adaptations associated with perfume collection in orchid bees_ K Darragh, SR Ramírez - __Journal of Evolutionary Biology__, 2024 > > - _Proteomic dynamics revealed sex‐biased responses to combined heat‐drought stress in Marchantia_ S Guerrero, V Roces, L García‐Campa, L Valledor… - __Journal of Integrative Plant Biology__, 2024 > > - _Evolution of gene networks underlying adaptation to drought stress in the wild tomato Solanum chilense_ K Wei, S Sharifova, X Zhao, N Sinha, H Nakayama… - __Molecular Ecology__, 2024 > > - _Conserved and specific gene expression patterns in the embryonic development of tardigrades_ C Li, Z Yang, X Xu, L Meng, S Liu, D Yang - __Evolution & Development__, 2024 > > - _The functions and factors governing fungal communities and diversity in agricultural waters: insights into the ecosystem services aquatic mycobiota provide_ P Pham, Y Shi, I Khan, M Sumarah, J Renaud… - __Frontiers in Microbiology__, 2024 > > - _An evolutionary timeline of the oxytocin signaling pathway_ AM Sartorius, J Rokicki, S Birkeland, F Bettella, C Barth… - __Communications Biology__, 2024 > > - _The Evolution of Foraging Webs is Associated with Young Genes in Araneoidea Spiders_ A Jia, T Yang, W Hu, S Ma, Z Zhang, Y Wang - __Available at SSRN 4383994__ > > - _Multiplexed transcriptomic analyses of the plant embryonic hourglass_ H Wu, R Zhang, KJ Niklas, MJ Scanlon - __BioRxiv__, 2024 > > - _Brachiopod genome unveils the evolution of the BMP–Chordin network in bilaterian body patterning_ TD Lewin, K Shimizu, IJY Liao, ME Chen, K Endo… - __BioRxiv__, 2024 > > - _The angiosperm seed life cycle follows a developmental reverse hourglass_ AA Sami, L Bentsink, MAS Artur - __BioRxiv__, 2024 > > - _Evolutionary trends in the emergence of skeletal cell types_ A Damatac, S Koska, KK Ullrich, T Domazet-Loso… - __BioRxiv__, 2024 > > - _Transcriptome age of individual cell types in Caenorhabditis elegans_ F Ma, C Zheng - __Proceedings of the National Academy of Sciences__, 2023 > > - _Single-cell atlases of two lophotrochozoan larvae highlight their complex evolutionary histories_ L Piovani, DJ Leite, LA Yañez Guerra, F Simpson… - __Science Advances__, 2023 > > - _oggmap: a Python package to extract gene ages per orthogroup and link them with single-cell RNA data_ KK Ullrich, NE Glytnasi - __Bioinformatics__, 2023 > > - _Discovery of putative long non-coding RNAs expressed in the eyes of Astyanax mexicanus (Actinopterygii: Characidae)_ I Batista da Silva, D Aciole Barbosa, KF Kavalco… - __Scientific Reports__, 2023 > > - _An ancient split of germline and somatic stem cell lineages in Hydra_ C Nishimiya-Fujisawa, H Petersen, TC-T Koubková-Yu, C Noda, S Shigenobu, J Bageritz, T Fujisawa, O Simakov, S Kobayashi, TW Holstein - __BioRxiv__, 2023 > > - _Oxytocin receptor expression patterns in the human brain across development_ J Rokicki, T Kaufmann, A-MG de Lange, D van der Meer, S Bahrami, AM Sartorius, UK Haukvik, NE Steen, E Schwarz, DJ Stein, T Nærland, OA Andreassen, LT Westlye, DS Quintana - __Neuropsychopharmacology__, 2022 > > - _The Phylotranscriptomic Hourglass Pattern in Fungi: An Updated Model_ Y Xie, HS Kwan, PL Chan, WJ Wu, J Chiou, J Chang __BioRxiv__, 2022 > > - _Embryo-Like Features in Developing Bacillus subtilis Biofilms_ M Futo, L Opašić, S Koska, N Čorak, T Široki, V Ravikumar, A Thorsell, M Lenuzzi, D Kifer, M Domazet-Lošo, K Vlahoviček, I Mijakovic, T Domazet-Lošo - __Molecular Biology and Evolution__, 2021 > > - _New Genes Interacted With Recent Whole-Genome Duplicates in the Fast Stem Growth of Bamboos_ G Jin, P-F Ma, X Wu, L Gu, M Long, C Zhang, DZ Li - __Molecular Biology and Evolution__, 2021 > > - _Evolutionary transcriptomics of metazoan biphasic life cycle supports a single intercalation origin of metazoan larvae_ J Wang, L Zhang, S Lian, Z Qin, X Zhu, X Dai, Z Huang et al. - __Nature Ecology & Evolution__, 2020 > > - _Pervasive convergent evolution and extreme phenotypes define chaperone requirements of protein homeostasis_ Y Draceni, S Pechmann - __Proceedings of the National Academy of Sciences__, 2019 > > - _Reconstructing the transcriptional ontogeny of maize and sorghum supports an inverse hourglass model of inflorescence development_ S Leiboff, S Hake - __Current Biology__, 2019 > > - _The Transcriptional Landscape of Polyploid Wheats and their Diploid Ancestors during Embryogenesis and Grain Development_ D Xiang, TD Quilichini, Z Liu, P Gao, Y Pan et al. - __The Plant Cell__, 2019 > > - _Pervasive convergent evolution and extreme phenotypes define chaperone requirements of protein homeostasis_ Y Draceni, S Pechmann - __Proceedings of the National Academy of Sciences__, 2019 > > - _A unicellular relative of animals generates a layer of polarized cells by actomyosin-dependent cellularization_ O Dudin, A Ondracka, X Grau-Bové, AAB Haraldsen et al. - __eLife__, 2019 > > - _Gene Expression Does Not Support the Developmental Hourglass Model in Three Animals with Spiralian Development_ L Wu, KE Ferger, JD Lambert - __Molecular Biology and Evolution__, 2019 > > - _Phylostratr: a framework for phylostratigraphy_ Z Arendsee, J Li, U Singh, A Seetharam et al. - __Bioinformatics__, 2019 > > - _Algorithms for synteny-based phylostratigraphy and gene origin classification_ Z Arendsee - 2019 > > - _Elucidating the endogenous synovial fluid proteome and peptidome of inflammatory arthritis using label-free mass spectrometry_ SM Mahendran, EC Keystone, RJ Krawetz et al. - __Clinical proteomics__, 2019 > > - _Environmental DNA reveals landscape mosaic of wetland plant communities_ ME Shackleton, GN Rees, G Watson et al. - __Global Ecology and Conservation__, 2019 > > - _Developmental constraints on genome evolution in four bilaterian model species_ J Liu, M Robinson-Rechavi - __Genome Biology and Evolution__, 2018 > > - _Mapping selection within Drosophila melanogaster embryo's anatomy_ I Salvador-Martínez et al. - __Molecular Biology and Evolution__, 2017 > > - _Distribution and diversity of enzymes for polysaccharide degradation in fungi_ R Berlemont - __Scientific reports__, 2017 > > - _The origins and evolutionary history of human non-coding RNA regulatory networks_ M Sherafatian, SJ Mowla - __Journal of bioinformatics and computational biology__, 2017 > > - _High expression of new genes in trochophore enlightening the ontogeny and evolution of trochozoans_ F Xu, T Domazet-Lošo, D Fan, TL Dunwell, L Li et al. - __Scientific reports__, 2016 > > - _Evidence for active maintenance of phylotranscriptomic hourglass patterns in animal and plant embryogenesis_ >HG Drost, A Gabel, I Grosse, M Quint - __Molecular Biology and Evolution__, 2015 >

NEWS

The current status of the package as well as a detailed history of the functionality of each version of myTAI can be found in the NEWS section.

Tutorials

The following tutorials will provide use cases and detailed explanations of how to quantify transcriptome conservation with myTAI and how to interpret the results generated with this software tool.

Main: - Getting started - Bring your datasets into myTAI - Statistical testing with myTAI - Transforming dataset for myTAI - Break TAI patterns using gaTAI - Beautiful plots made via myTAI

Advanced: - Gene age inference - Other evolutionary and expression indices

Users can also read the tutorials within (RStudio) :

```r

source the myTAI package

library(myTAI)

look for all tutorials (vignettes) available in the myTAI package

this will open your web browser

browseVignettes("myTAI") ```

Object classes in myTAI

Workflow to load your own dataset:

bulk

PhyEx overview

bulk with replicates

PhyEx replicates overview

single cell

scPhyEx overview

Discussions and Bug Reports

We would be very happy to learn more about potential improvements of the concepts and functions provided in this package.

Furthermore, in case you find some bugs or need additional (more flexible) functionality of parts of this package, please let us know:

https://github.com/drostlab/myTAI/issues

Owner

  • Name: Drost Laboratory - Digital Biology Research Group
  • Login: drostlab
  • Kind: organization
  • Location: United Kingdom

School of Life Sciences, University of Dundee

GitHub Events

Total
  • Create event: 14
  • Release event: 1
  • Issues event: 6
  • Watch event: 5
  • Delete event: 9
  • Member event: 4
  • Issue comment event: 39
  • Push event: 90
  • Pull request event: 38
  • Fork event: 2
Last Year
  • Create event: 14
  • Release event: 1
  • Issues event: 6
  • Watch event: 5
  • Delete event: 9
  • Member event: 4
  • Issue comment event: 39
  • Push event: 90
  • Pull request event: 38
  • Fork event: 2

Committers

Last synced: 9 months ago

All Time
  • Total Commits: 1,362
  • Total Committers: 8
  • Avg Commits per committer: 170.25
  • Development Distribution Score (DDS): 0.124
Past Year
  • Commits: 70
  • Committers: 5
  • Avg Commits per committer: 14.0
  • Development Distribution Score (DDS): 0.614
Top Committers
Name Email Commits
HajkD h****t@g****m 1,193
LotharukpongJS s****t@g****m 99
lavakin n****a@k****u 35
Hajk-Georg Drost h****t@a****l 12
nifets s****l@g****m 11
filipamc19 4****9 9
Jialin Liu l****y@g****m 2
Cristóbal Gallardo g****o@i****e 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 20
  • Total pull requests: 61
  • Average time to close issues: 6 months
  • Average time to close pull requests: about 20 hours
  • Total issue authors: 15
  • Total pull request authors: 6
  • Average comments per issue: 2.6
  • Average comments per pull request: 1.56
  • Merged pull requests: 53
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 4
  • Pull requests: 33
  • Average time to close issues: 23 days
  • Average time to close pull requests: about 20 hours
  • Issue authors: 3
  • Pull request authors: 4
  • Average comments per issue: 0.5
  • Average comments per pull request: 1.27
  • Merged pull requests: 26
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • LotharukpongJS (4)
  • nifets (2)
  • gallardoalba (1)
  • vinaybaranwal (1)
  • TobyBaril (1)
  • dmontecino (1)
  • weikai-320722 (1)
  • YXXEMXL (1)
  • Arthfael (1)
  • Chenleinice (1)
  • abb44 (1)
  • maxnest (1)
  • erinhunter4 (1)
Pull Request Authors
  • LotharukpongJS (28)
  • filipamc19 (17)
  • lavakin (14)
  • nifets (8)
  • ljljolinq1010 (2)
  • gallardoalba (1)
Top Labels
Issue Labels
enhancement (5) bug (4)
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • cran 396 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 1
  • Total versions: 15
  • Total maintainers: 1
cran.r-project.org: myTAI

Evolutionary Transcriptomics

  • Versions: 15
  • Dependent Packages: 0
  • Dependent Repositories: 1
  • Downloads: 396 Last month
Rankings
Forks count: 4.4%
Stargazers count: 8.8%
Average: 19.7%
Dependent repos count: 23.8%
Dependent packages count: 28.7%
Downloads: 32.7%
Last synced: 6 months ago

Dependencies

DESCRIPTION cran
  • R >= 3.1.1 depends
  • Matrix * imports
  • RColorBrewer >= 1.1 imports
  • Rcpp * imports
  • RcppThread * imports
  • decor * imports
  • doParallel >= 1.0.8 imports
  • dplyr >= 0.3.0 imports
  • fitdistrplus >= 1.1 imports
  • foreach >= 1.4.2 imports
  • ggplot2 >= 1.0.1 imports
  • grDevices >= 3.1.1 imports
  • graphics >= 3.1.1 imports
  • gridExtra * imports
  • methods >= 3.1.1 imports
  • nortest >= 1.0 imports
  • parallel >= 3.1.1 imports
  • readr >= 0.2.2 imports
  • reshape2 >= 1.4.1 imports
  • scales * imports
  • stats >= 3.1.1 imports
  • tibble * imports
  • utils >= 3.1.1 imports
  • Seurat * suggests
  • devtools >= 1.6.1 suggests
  • knitr >= 1.6 suggests
  • mgcv * suggests
  • rmarkdown >= 0.3.3 suggests
  • taxize >= 0.6.0 suggests
  • testthat >= 0.9.1 suggests