https://github.com/compomics/spectrum_similarity

Scoring functions to compare MS/MS spectra

Science Score: 23.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
○
codemeta.json file
○
.zenodo.json file
✓
DOI references
Found 2 DOI reference(s) in README
✓
Academic publication links
Links to: acs.org
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (8.5%) to scientific vocabulary

Keywords

bioinformatics proteomics scoring-functions spectrum-similarity

Last synced: 5 months ago · JSON representation

Repository

Scoring functions to compare MS/MS spectra

Basic Info

Host: GitHub
Owner: CompOmics
Language: Java
Default Branch: master
Homepage:
Size: 271 KB

Statistics

Stars: 11
Watchers: 18
Forks: 7
Open Issues: 0
Releases: 0

Archived

Topics

bioinformatics proteomics scoring-functions spectrum-similarity

Created over 10 years ago · Last pushed almost 9 years ago

Metadata Files

Readme License

Project description

The scoring functions to assess spectrum similarity play a crucial role in many computational mass spectrometry algorithms. These scoring functions can be used to compare an acquired MS/MS spectrum against two different types of target spectra: either against a theoretical MS/MS spectrum derived from a peptide from a sequence database, or against another, previously acquired acquired MS/MS spectrum. The former is typically encountered in database searching, while the latter is used in spectrum clustering, spectral library searching, or the detection of unique spectra between different data sets in comparative proteomics studies.

The most commonly used scoring functions in experimental versus theoretical spectrum matching could be divided into two groups:

non-probabilistic (cross correlations which was used for SEQUEST)
probabilistic (cumulative binomial probability derived scoring functions in Andromeda and MS Amanda)

Scoring functions for the comparison of two experimental spectra: - Normalized dot product (most commonly used in spectrum library search algorithms such as SpectraST, BiblioSpec) - Pearson’s and Spearman's correlation coefficients

Avaliable scoring functions

This project contains the enlisted scoring functions in Project description. The scoring functions in order to compare an acquired MS/MS spectrum against:

a theoretical spectrum via
- SEQUEST-like scoring function (non-probabilistic)
- Andromeda-like scoring function (probabilistic)
another acquired spectrum via
- Dot product
- Normalized dot product
- Normalized dot product with introducting weights from peak intensities
- Pearson's r
- Spearman's rho
- Mean Squared Error (MSE) (and also root MSE)
- Median Squared Error (MdSE) (and also root MdSE)
- Probabilistic scoring functon (including peak intensities)

Citation

Yılmaz et al: J Proteome Res., 2016, 15 (6), pp 1963–1970 (DOI:10.1021/acs.jproteome.6b00140)

If you use our differential pipeline stand-alone tool or GUI version, please include the reference above.

Download

Differential pipeline

The probabilistic scoring function was succesfully applied on the differential pipeline in order to compare two experimental data sets in a differential analysis.

A stand-alone program and the GUI version of this stand-alone program can be downloaded here.

A pairwise spectrum view GUI

A pairwise spectrum view GUI enables the manual inspection of how spectra actually look alike and can be downloaded here.

Book chapter

The scoring functions enlisted [Avaliable scoring functions] were used to evaluate their ability to assess spectrum similarity by evaluating them on one of the CPTAC data sets. This has been described in a book chapter.

The program to compare spectra with the avaliable scoring functions, against either theoretical or experimental spectra can be downloaded here.

The settings to perform spectrum comparison are in the bookChapter.properties file.

Usage

See the wiki for additional information on how to setup, run and configure spectrum comparison related projects.

Owner

Name: Computational Omics and Systems Biology Group
Login: CompOmics
Kind: organization
Email: compomics.list@gmail.com

Website: https://www.compomics.com/
Twitter: CompOmics
Repositories: 93
Profile: https://github.com/CompOmics

The CompOmics group, headed by Prof. Dr. Lennart Martens, specializes in the management, analysis and integration of high-throughput Omics data.

GitHub Events

Total

Last Year

Dependencies

pom.xml maven

com.compomics:dbtoolkit 4.2
com.compomics:pride-asa-pipeline 1.5.0
com.compomics:utilities 4.3.12
com.jhlabs:filters 2.0.235
commons-configuration:commons-configuration 1.7
log4j:log4j 1.2.17
org.apache.commons:commons-math 2.0
org.springframework:spring-core 4.2.4.RELEASE
org.springframework:spring-web 4.2.4.RELEASE
junit:junit 3.8.1 test

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

https://github.com/compomics/spectrum_similarity

Science Score: 23.0%

Keywords

Repository

Basic Info

Statistics

Topics

Metadata Files

README.md

Project description

Avaliable scoring functions

Citation

Download

Differential pipeline

A pairwise spectrum view GUI

Book chapter

Usage

Owner

GitHub Events

Total

Last Year

Dependencies