https://github.com/bioconductor-source/timeseriesanalysis
Science Score: 26.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
✓DOI references
Found 1 DOI reference(s) in README -
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (17.3%) to scientific vocabulary
Last synced: 6 months ago
·
JSON representation
Repository
Basic Info
- Host: GitHub
- Owner: bioconductor-source
- License: mit
- Language: R
- Default Branch: devel
- Size: 288 MB
Statistics
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
- Releases: 0
Created over 1 year ago
· Last pushed over 1 year ago
Metadata Files
Readme
Changelog
License
README.Rmd
---
output: github_document
---
```{r, include = FALSE}
knitr::opts_chunk$set(
collapse = TRUE,
comment = "#>",
fig.path = "man/figures/README-",
out.width = "100%"
)
```
# TimeSeriesAnalysis
TiSA: TimeSeriesAnalysis - Is a transcriptomic analysis tool for both RNA sequencing and microarray data
## Overview
TimeSeriesAnalysis (TiSA) is an analysis and visualization package for RNAseq and
microarray data. TS extracts significant genes from time course
transcriptomic data by performing differential gene expression on both
the conditional and temporal axes. It then employs partitioning
algorithm based on recursive thresholding (PART) clustering to identify
small genomic clusters of relevance, followed by running the clusters
through gprofiler to reveal the biological relevance of each cluster.
## TS performs:
* Data normalization and processing
* PCA plots
* Differential gene expression (conditional and temporal)
* PART clustering
* Heatmaps for both differential expression summary and clustering results
* Trajectory of identified clusters
* Gprofiler (functional enrichment) analysis of clusters
* Dotplots and MDS plots of Gprofiler results
* Nearest ancestor clustering of GOs
* GO ancestor queries
## Installation
**A Bioconductor release is in progress**
You can install the development version of TimeSeriesAnalysis like so:
``` r
install.packages("devtools")
devtools::install_github("Ylefol/TimeSeriesAnalysis")
```
To instal Devtools, visit the
[Devtools main page](https://www.r-project.org/nosvn/pandoc/devtools.html)
Certain bioconductor packages will have to be installed before
installation of TimeSeriesAnalysis. The code snippet below gives the method along with
some of the packages that should be installed via this method.
``` r
if (!require("BiocManager", quietly = TRUE))
install.packages("BiocManager")
bio_pkgs <- c('DESeq2','GOSemSim','GO.db','limma','ComplexHeatmap',
'SummarizedExperiment','org.Hs.eg.db','org.Mm.eg.db','org.Ce.eg.db',
BiocFileCache)
BiocManager::install(bio_pkgs)
```
## Rmarkdown format
TimeSeriesAnalysis was developed with user-friendliness in mind, the
core script of the package is a Rmarkdown file with explanations of what
each code block performs. Computationally intensive code blocks save
their work in order to avoid loss of computation time.
TimeSeriesAnalysis comes with a test dataset. Users are recommended to
first run the package with the test data to then modify the Rmarkdown
script with the necessary information for their purposes (input files,
organism, genes of interest etc…). The rmarkdown script and the expected
result from an example run can be found and downloaded from this
repository, they are located in the ‘rmarkdown_method’ folder. If users
prefer a script approach, the equivalent of the rmarkdown report is
provided in two scripts, one for the computation tasks and a second for
the analyses. These two scripts are found in the ‘script_method’ folder.
Both the rmarkdown and script methods are found in the ‘supp_to_bioconductor’ branch.
## Data
Three different datasets have been used to test this pipeline, one of which has been
preserved as an example datasets for the vignettes of this pipeline. More information
on the other datasets can be viewed through the publications listed below.
The PBMC dataset is a time series experiment with three time points that explores
and compares three AICDA/AID (Activation Induced Cytidine DeAminase) activation cocktails.
The experiment seeked to identify which activation cocktail properly activated AID
through both the expression of the AID gene and the activation of class switch recombination.
## Tutorial
A tutorial can be found within the [pipelines website](https://ylefol.github.io/TimeSeriesAnalysis/).
Additionally the documentation can be found in the references tab.
## Microarray based data
For microarray data, a streamlined method is in the works. Currently microarray data needs to be inputed
as a Elist, specifically an E list saved as a rds object.
``` r
my_path_data<-'data/micro_arr/my_limma_dta.rds'
my_path_sample_dta<-'data/micro_arr/sample_file.csv'
#Set-up time series object parameters
diff_exp_type<-'limma'
```
The rds file given to 'my_path_data' contains the Elist produced by limma processing
of microarray data. It is also important to set the differential gene expression type (diff_exp_type)
to 'limma'.
## Using the pipeline
NOTE: You must still install the TimeSeriesAnalysis package as defined above
Since the pipeline's source code is available, individuals are free to take the code
and adapt it as needed. However for users who would like the easiest approach to use the
pipeline as is, it is recommended to clone or download this github repository to your local computer
and work from the Rmarkdown method folder within the repository.
Cloning or downloading can be done from the [TimeSeriesAnalysis github page](https://github.com/Ylefol/TimeSeriesAnalysis)
by using the green button called 'code'.
From there, the Rmarkdown file can be edited.
To edit the file, it is recommended to use rstudio, which can be downloaded [here](https://www.rstudio.com/products/rstudio/download/).
Note that it is recommended to download the free version of RStudio Desktop, not RStudio Server.
The only code block of code is the parameter set-up block (2nd) as well as the title
at the very top of the Rmarkdown document.
## Launching the pipeline
The pipeline can be launched from rstudio itself by using the 'knit' button, or it can be launched from a command line using the
following command from the TimeSeriesAnalysis repository:
```{r, engine = 'bash', eval = FALSE}
Rscript -e "rmarkdown::render('rmarkdown_method/TS_analysis.Rmd',output_file='TS_analysis.html')"
```
## Publication
A manuscript detailing the TimeSeriesAnalysis pipeline published with NAR genomics and bioinformatics.
[TiSA: TimeSeriesAnalysis—a pipeline for the analysis of longitudinal transcriptomics data](https://academic.oup.com/nargab/article/5/1/lqad020/7069286)
The version/DOI of this pipeline used within the manuscript is 10.5281/zenodo.7616032.
A ageing related study was performed in part through the use of TiSA.
[Krill oil protects dopaminergic neurons from age-related degeneration through temporal transcriptome rewiring and suppression of several hallmarks of aging](https://www.aging-us.com/article/204375/text)
## Additional information
The 'clusterGenomics' package was no longer maintained on CRAN and therefore the necessary
scripts were brought over to this package for it's implementation without the need
to download the clusterGenomics package from source.
Within this package, in the 'R/clusterGenomics.R' code was written by the authors of
clusterGenomics. It was imported to TimeSeriesAnalysis by the authors of
TimeSeriesAnalysis.
Owner
- Name: (WIP DEV) Bioconductor Packages
- Login: bioconductor-source
- Kind: organization
- Email: maintainer@bioconductor.org
- Website: https://bioconductor.org
- Repositories: 1
- Profile: https://github.com/bioconductor-source
Source code for packages accepted into Bioconductor
GitHub Events
Total
Last Year
Dependencies
DESCRIPTION
cran
- BiocFileCache * depends
- R >= 4.2 depends
- SummarizedExperiment * depends
- AnnotationDbi * imports
- BiocFileCache * imports
- BiocGenerics * imports
- BiocManager * imports
- ComplexHeatmap * imports
- DESeq2 * imports
- GO.db * imports
- GOSemSim * imports
- GenomicRanges * imports
- RColorBrewer * imports
- SummarizedExperiment * imports
- data.table * imports
- dplyr * imports
- dynamicTreeCut * imports
- ggplot2 * imports
- ggrepel * imports
- gprofiler2 * imports
- grDevices * imports
- grid * imports
- htmltools * imports
- htmlwidgets * imports
- limma * imports
- methods * imports
- org.Ce.eg.db * imports
- org.Hs.eg.db * imports
- org.Mm.eg.db * imports
- plotly * imports
- reshape2 * imports
- rstudioapi * imports
- stats * imports
- stringi * imports
- stringr * imports
- tibble * imports
- tictoc * imports
- utils * imports
- knitr * suggests
- rmarkdown * suggests