https://github.com/compomics/spectrum_similarity

Scoring functions to compare MS/MS spectra

https://github.com/compomics/spectrum_similarity

Science Score: 23.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
    Found 2 DOI reference(s) in README
  • Academic publication links
    Links to: acs.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (8.5%) to scientific vocabulary

Keywords

bioinformatics proteomics scoring-functions spectrum-similarity
Last synced: 5 months ago · JSON representation

Repository

Scoring functions to compare MS/MS spectra

Basic Info
  • Host: GitHub
  • Owner: CompOmics
  • Language: Java
  • Default Branch: master
  • Homepage:
  • Size: 271 KB
Statistics
  • Stars: 11
  • Watchers: 18
  • Forks: 7
  • Open Issues: 0
  • Releases: 0
Archived
Topics
bioinformatics proteomics scoring-functions spectrum-similarity
Created over 10 years ago · Last pushed almost 9 years ago
Metadata Files
Readme License

README.md

Project description

The scoring functions to assess spectrum similarity play a crucial role in many computational mass spectrometry algorithms. These scoring functions can be used to compare an acquired MS/MS spectrum against two different types of target spectra: either against a theoretical MS/MS spectrum derived from a peptide from a sequence database, or against another, previously acquired acquired MS/MS spectrum. The former is typically encountered in database searching, while the latter is used in spectrum clustering, spectral library searching, or the detection of unique spectra between different data sets in comparative proteomics studies.

The most commonly used scoring functions in experimental versus theoretical spectrum matching could be divided into two groups:

  • non-probabilistic (cross correlations which was used for SEQUEST)
  • probabilistic (cumulative binomial probability derived scoring functions in Andromeda and MS Amanda)

Scoring functions for the comparison of two experimental spectra: - Normalized dot product (most commonly used in spectrum library search algorithms such as SpectraST, BiblioSpec) - Pearson’s and Spearman's correlation coefficients

Avaliable scoring functions

This project contains the enlisted scoring functions in Project description. The scoring functions in order to compare an acquired MS/MS spectrum against:

  • a theoretical spectrum via

    • SEQUEST-like scoring function (non-probabilistic)
    • Andromeda-like scoring function (probabilistic)
  • another acquired spectrum via

    • Dot product
    • Normalized dot product
    • Normalized dot product with introducting weights from peak intensities
    • Pearson's r
    • Spearman's rho
    • Mean Squared Error (MSE) (and also root MSE)
    • Median Squared Error (MdSE) (and also root MdSE)
    • Probabilistic scoring functon (including peak intensities)

Citation

Yılmaz et al: J Proteome Res., 2016, 15 (6), pp 1963–1970 (DOI:10.1021/acs.jproteome.6b00140)

If you use our differential pipeline stand-alone tool or GUI version, please include the reference above.


Download

Differential pipeline

The probabilistic scoring function was succesfully applied on the differential pipeline in order to compare two experimental data sets in a differential analysis.

A stand-alone program and the GUI version of this stand-alone program can be downloaded here.

A pairwise spectrum view GUI

A pairwise spectrum view GUI enables the manual inspection of how spectra actually look alike and can be downloaded here.

Book chapter

The scoring functions enlisted [Avaliable scoring functions] were used to evaluate their ability to assess spectrum similarity by evaluating them on one of the CPTAC data sets. This has been described in a book chapter.

The program to compare spectra with the avaliable scoring functions, against either theoretical or experimental spectra can be downloaded here.

The settings to perform spectrum comparison are in the bookChapter.properties file.


Usage

See the wiki for additional information on how to setup, run and configure spectrum comparison related projects.

Owner

  • Name: Computational Omics and Systems Biology Group
  • Login: CompOmics
  • Kind: organization
  • Email: compomics.list@gmail.com

The CompOmics group, headed by Prof. Dr. Lennart Martens, specializes in the management, analysis and integration of high-throughput Omics data.

GitHub Events

Total
Last Year

Dependencies

pom.xml maven
  • com.compomics:dbtoolkit 4.2
  • com.compomics:pride-asa-pipeline 1.5.0
  • com.compomics:utilities 4.3.12
  • com.jhlabs:filters 2.0.235
  • commons-configuration:commons-configuration 1.7
  • log4j:log4j 1.2.17
  • org.apache.commons:commons-math 2.0
  • org.springframework:spring-core 4.2.4.RELEASE
  • org.springframework:spring-web 4.2.4.RELEASE
  • junit:junit 3.8.1 test