https://github.com/compomics/spectrum_similarity
Scoring functions to compare MS/MS spectra
Science Score: 23.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
○codemeta.json file
-
○.zenodo.json file
-
✓DOI references
Found 2 DOI reference(s) in README -
✓Academic publication links
Links to: acs.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (8.5%) to scientific vocabulary
Keywords
Repository
Scoring functions to compare MS/MS spectra
Basic Info
Statistics
- Stars: 11
- Watchers: 18
- Forks: 7
- Open Issues: 0
- Releases: 0
Topics
Metadata Files
README.md
Project description
The scoring functions to assess spectrum similarity play a crucial role in many computational mass spectrometry algorithms. These scoring functions can be used to compare an acquired MS/MS spectrum against two different types of target spectra: either against a theoretical MS/MS spectrum derived from a peptide from a sequence database, or against another, previously acquired acquired MS/MS spectrum. The former is typically encountered in database searching, while the latter is used in spectrum clustering, spectral library searching, or the detection of unique spectra between different data sets in comparative proteomics studies.
The most commonly used scoring functions in experimental versus theoretical spectrum matching could be divided into two groups:
- non-probabilistic (cross correlations which was used for SEQUEST)
- probabilistic (cumulative binomial probability derived scoring functions in Andromeda and MS Amanda)
Scoring functions for the comparison of two experimental spectra: - Normalized dot product (most commonly used in spectrum library search algorithms such as SpectraST, BiblioSpec) - Pearson’s and Spearman's correlation coefficients
Avaliable scoring functions
This project contains the enlisted scoring functions in Project description. The scoring functions in order to compare an acquired MS/MS spectrum against:
a theoretical spectrum via
- SEQUEST-like scoring function (non-probabilistic)
- Andromeda-like scoring function (probabilistic)
another acquired spectrum via
- Dot product
- Normalized dot product
- Normalized dot product with introducting weights from peak intensities
- Pearson's r
- Spearman's rho
- Mean Squared Error (MSE) (and also root MSE)
- Median Squared Error (MdSE) (and also root MdSE)
- Probabilistic scoring functon (including peak intensities)
Citation
Yılmaz et al: J Proteome Res., 2016, 15 (6), pp 1963–1970 (DOI:10.1021/acs.jproteome.6b00140)
If you use our differential pipeline stand-alone tool or GUI version, please include the reference above.
Download
Differential pipeline
The probabilistic scoring function was succesfully applied on the differential pipeline in order to compare two experimental data sets in a differential analysis.
A stand-alone program and the GUI version of this stand-alone program can be downloaded here.
A pairwise spectrum view GUI
A pairwise spectrum view GUI enables the manual inspection of how spectra actually look alike and can be downloaded here.
Book chapter
The scoring functions enlisted [Avaliable scoring functions] were used to evaluate their ability to assess spectrum similarity by evaluating them on one of the CPTAC data sets. This has been described in a book chapter.
The program to compare spectra with the avaliable scoring functions, against either theoretical or experimental spectra can be downloaded here.
The settings to perform spectrum comparison are in the bookChapter.properties file.
Usage
See the wiki for additional information on how to setup, run and configure spectrum comparison related projects.
Owner
- Name: Computational Omics and Systems Biology Group
- Login: CompOmics
- Kind: organization
- Email: compomics.list@gmail.com
- Website: https://www.compomics.com/
- Twitter: CompOmics
- Repositories: 93
- Profile: https://github.com/CompOmics
The CompOmics group, headed by Prof. Dr. Lennart Martens, specializes in the management, analysis and integration of high-throughput Omics data.
GitHub Events
Total
Last Year
Dependencies
- com.compomics:dbtoolkit 4.2
- com.compomics:pride-asa-pipeline 1.5.0
- com.compomics:utilities 4.3.12
- com.jhlabs:filters 2.0.235
- commons-configuration:commons-configuration 1.7
- log4j:log4j 1.2.17
- org.apache.commons:commons-math 2.0
- org.springframework:spring-core 4.2.4.RELEASE
- org.springframework:spring-web 4.2.4.RELEASE
- junit:junit 3.8.1 test