NormalyzerDE

Tools for normalization, evaluation of outliers, technical biases and batch effects and differential expression analysis.

https://github.com/computationalproteomics/normalyzerde

Science Score: 59.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
✓
DOI references
Found 2 DOI reference(s) in README
✓
Academic publication links
Links to: acs.org
✓
Committers with academic emails
1 of 9 committers (11.1%) from academic institutions
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (14.3%) to scientific vocabulary

Keywords

bioconductor bioinformatics limma normalization proteomics

Keywords from Contributors

bioconductor-package rnaseq derfinder mirror similarity-measurement similarity pathway-analysis gene-sets gene functional-similarity

Last synced: 6 months ago · JSON representation

Repository

Tools for normalization, evaluation of outliers, technical biases and batch effects and differential expression analysis.

Basic Info

Host: GitHub
Owner: ComputationalProteomics
Language: R
Default Branch: devel
Homepage: https://quantitativeproteomics.org/normalyzerde
Size: 6.01 MB

Statistics

Stars: 24
Watchers: 4
Forks: 7
Open Issues: 0
Releases: 2

Topics

bioconductor bioinformatics limma normalization proteomics

Created almost 8 years ago · Last pushed 8 months ago

Metadata Files

Readme Changelog

About

An online server running NormalyzerDE can be accessed at the following link:

https://quantitativeproteomics.org/normalyzerde

NormalyzerDE is a software designed to ease the process of selecting an optimal normalization approach for your dataset and to perform subsequent differential expression analysis.

NormalyzerDE includes several normalization approaches, a empirical Bayes-based statistical approach implemented as part of Limma and a newly implemented retention-time segmented normalization approach inspired by previously outlined approaches. The emprical-based based statistics has been shown to increase sensitivity over ANOVA when detecting differentially expressed features.

Cite NormalyzerDE

NormalyzerDE is published here

Willforss, J., Chawade, A., Levander, F. NormalyzerDE: Online tool for improved normalization of omics expression data and high-sensitivity differential expression analysis. Journal of Proteome Research 2018, 10.1021/acs.jproteome.8b00523.

Installation

NormalyzerDE can be installed from Bioconductor, or directly from GitHub:

install.packages("devtools") devtools::install_github("ComputationalProteomics/NormalyzerDE")

Running NormalyzerDE - Minimal example

library(NormalyzerDE)

Generate normalizations and normalization performance report.

normalyzer(jobName="rscript_norm", designPath="test_design.tsv", dataPath="test_data.tsv")

Calculate differential expression between groups 1-2 and 1-3 (defined in the design matrix).

normalyzerDE(jobName="rscript_de", designPath="test_design.tsv", dataPath="test_data.tsv", comparisons=c("1-2", "1-3"))

For more comprehensive documentation, check the Vignette at NormalyzerDE's Bioconductor page. More information about required input formats is available here.

Executing from command line

If you want to run NormalyzerDE directly from the command line this is possible by executing it through the Rscript command.

Rscript -e 'NormalyzerDE::normalyzer(jobName="rscript_norm", designPath="test_design.tsv", dataPath="test_data.tsv")' Rscript -e 'NormalyzerDE::normalyzerDE(jobName="rscript_de", designPath="test_design.tsv", dataPath="test_data.tsv", comparisons=c("1-2", "1-3"))'

References

(1) Bolstad, B. preprocessCore: A collection of pre-processing functions. 2018; https://github.com/bmbolstad/preprocessCore.

(2) Gentleman, R. C. et al. Bioconductor: open software development for computational biology and bioinformatics. Genome Biol. 2004, 5, R80.

(3) Huber, W.; von Heydebreck, A.; Sultmann, H.; Poustka, A.; Vingron, M. Variance stabilization applied to microarray data calibration and to the quantification of differential expression. Bioinformatics 2002, 18, S96–S104.

(4) Kammers, K.; Cole, R. N.; Tiengwe, C.; Ruczinski, I. Detecting significant changes in protein abundance. EuPA Open Proteom. 2015, 7, 11-19.

(5) Lyutvinskiy, Y.; Yang, H.; Rutishauser, D.; Zubarev, R. A. In Silico Instrumental Response Correction Improves Precision of Label-free Proteomics and Accuracy of Proteomics-based Predictive Models. Mol. Cell Proteomics 2013, 12, 2324–2331.

(6) Ritchie, M. E.; Phipson, B.; Wu, D.; Hu, Y.; Law, C. W.; Shi, W.; Smyth, G. K. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015, 43, e47.

(7) van Ooijen, M. P.; Jong, V. L.; Eijkemans, M. J.; Heck, A. J.; Andeweg, A. C.; Binai, N. A.; van den Ham, H.-J. Identification of differentially expressed peptides in high-throughput proteomics data. Brief. Bioinform. 2017, 1–11.

(8) Wolfgang, H. et al. Orchestrating high-throughput genomic analysis with Bioconductor. Nat. Methods 2015, 12, 115–121.

Code organization

NormalyzerDE consists of a number of scripts and classes. They are focused around two separate workflows. One is for normalizing and evaluating the normalizations. The second is for performing differential expression analysis. Classes are contained in scripts with the same name.

NormalyzerDE schematics

The standard workflow for the normalization is the following:

The normalyzer function in the NormalyzerDE.R script is called, starting the process.
If applicable (that is, input is in Proteois or MaxQuant format), the dataset is preprocessed into the standard format using code in preparsers.R.
The input is verified to capture standard errors early on using code in inputVerification.R. This results in an instance of the NormalyzerDataset class.
The data is normalized using several normalization methods present in normMethods.R. This yields an instance of NormalyzerResults which links to the original NormalyzerDataset instance and also contains all the resulting normalized datasets.
If specified (and if a column with retention time values is present) retention-time segmented approaches are performed by applying normalizations from normMethods.R over retention time using functions present in higherOrderNormMethods.R.
The results are analyzed using functions present in analyzeResults.R. This yields an instance of NormalyzerEvaluationResults containing the evaluation results. This instance is attached to the NormalyzerResults object.
The final results are sent to outputUtils.R where the normalizations are written to an output directory, and to generatePlots.R which contains visualizations for the performance measures. It also uses code in printMeta.R and printPlots.R to output the results in a desired format.

When a normalized matrix is selected the analysis proceeds to the statistical analysis.

The normalyzerde function in the NormalyzerDE.R script is called starting the differential expression analysis pipeline.
An instance of NormalyzerStatistics is prepared containing the input data.
Code in the calculateStatistics.R script is used to calculate the statistical contrasts. The results are attached to the NormalyzerStatistics object.
The resulting statistics are used to generate a report and an annotated output matrix where key statistical measures are attached to the original matrix.

Owner

Name: Computational Proteomics
Login: ComputationalProteomics
Kind: organization
Location: Lund University

Repositories: 6
Profile: https://github.com/ComputationalProteomics

Computational Proteomics at department of Immunotechnology, Lund University

GitHub Events

Total

Watch event: 7
Push event: 2
Fork event: 1

Last Year

Watch event: 7
Push event: 2
Fork event: 1

Committers

Last synced: over 2 years ago

All Time

Total Commits: 484
Total Committers: 9
Avg Commits per committer: 53.778
Development Distribution Score (DDS): 0.157

Past Year

Commits: 26
Committers: 3
Avg Commits per committer: 8.667
Development Distribution Score (DDS): 0.154

Top Committers

Name	Email	Commits
Jakob Willforss	j**s@h**m	408
Jakob Willforss	j**s@i**e	49
Nitesh Turaga	n**a@g**m	14
Jakob Willforss	J****7	5
J Wokaty	j**y@s**u	2
vobencha	v**a@g**m	2
J Wokaty	j****y	2
Hervé Pagès	h**b@g**m	1
Fredrik Levander	f**r@n**e	1

Committer Domains (Top 20 + Academic)

nbis.se: 1 sph.cuny.edu: 1 immun.lth.se: 1

Issues and Pull Requests

Last synced: 6 months ago

All Time

Total issues: 37
Total pull requests: 3
Average time to close issues: 12 months
Average time to close pull requests: about 6 hours
Total issue authors: 14
Total pull request authors: 1
Average comments per issue: 1.81
Average comments per pull request: 1.0
Merged pull requests: 3
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 0
Pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Issue authors: 0
Pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

View more stats

Top Authors

Issue Authors

Jakob37 (21)
lisiarend (2)
arnscott (2)
flevander (2)
Bizes (1)
ofleitas (1)
fionamassspec (1)
froz9 (1)
delucalab (1)
Luq16 (1)
Jeff-Gui (1)
L3ft2di3 (1)
IgnatiusPang (1)

Pull Request Authors

Jakob37 (5)

Top Labels

Issue Labels

enhancement (11) bug (8) minor (5)

Pull Request Labels

Packages

Total packages: 1
Total downloads:
- bioconductor 18,987 total

Total dependent packages: 0
Total dependent repositories: 0
Total versions: 5
Total maintainers: 1

bioconductor.org: NormalyzerDE

Evaluation of normalization methods and calculation of differential expression analysis statistics

Homepage: https://github.com/ComputationalProteomics/NormalyzerDE
Documentation: https://bioconductor.org/packages/release/bioc/vignettes/NormalyzerDE/inst/doc/NormalyzerDE.pdf
License: Artistic-2.0
Latest release: 1.26.0
published 10 months ago

Versions: 5
Dependent Packages: 0
Dependent Repositories: 0
Downloads: 18,987 Total

Rankings

Dependent repos count: 0.0%

Dependent packages count: 0.0%

Stargazers count: 7.3%

Forks count: 10.0%

Average: 14.0%

Downloads: 52.6%

Maintainers (1)

jakob.willforss@hotmail.com

Last synced: 6 months ago

Dependencies

DESCRIPTION cran

R >= 3.6 depends
Biobase * imports
MASS * imports
RcmdrMisc * imports
SummarizedExperiment * imports
ape * imports
car * imports
ggforce * imports
ggplot2 * imports
limma * imports
matrixStats * imports
methods * imports
preprocessCore * imports
raster * imports
stats * imports
utils * imports
vsn * imports
BiocStyle * suggests
hexbin * suggests
knitr * suggests
rmarkdown * suggests
roxygen2 * suggests
testthat * suggests

NormalyzerDE

Science Score: 59.0%

Keywords

Keywords from Contributors

Repository

Basic Info

Statistics

Topics

Metadata Files

README.md

About

Cite NormalyzerDE

Installation

Running NormalyzerDE - Minimal example

Executing from command line

References

Code organization

Owner

GitHub Events

Total

Last Year

Committers

All Time

Past Year

Top Committers

Committer Domains (Top 20 + Academic)

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels

Packages

bioconductor.org: NormalyzerDE

Rankings

Maintainers (1)

Dependencies