ms2pip

MS²PIP: Fast and accurate peptide spectrum prediction for multiple fragmentation methods, instruments, and labeling techniques.

https://github.com/compomics/ms2pip

Science Score: 57.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 12 DOI reference(s) in README
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (11.8%) to scientific vocabulary

Keywords

machine-learning mass-spectrometry peptide-identification peptide-spectrum proteomics spectrum-prediction
Last synced: 6 months ago · JSON representation ·

Repository

MS²PIP: Fast and accurate peptide spectrum prediction for multiple fragmentation methods, instruments, and labeling techniques.

Basic Info
Statistics
  • Stars: 43
  • Watchers: 18
  • Forks: 18
  • Open Issues: 9
  • Releases: 40
Topics
machine-learning mass-spectrometry peptide-identification peptide-spectrum proteomics spectrum-prediction
Created almost 9 years ago · Last pushed 12 months ago
Metadata Files
Readme License Citation

README.rst

.. image:: https://github.com/compomics/ms2pip_c/raw/releases/img/ms2pip_logo_1000px.png
   :width: 150px
   :height: 150px

|

.. image:: https://img.shields.io/github/v/release/compomics/ms2pip_c?include_prereleases&style=flat-square
   :target: https://github.com/compomics/ms2pip_c/releases/latest/
.. image:: https://img.shields.io/pypi/v/ms2pip?style=flat-square
   :target: https://pypi.org/project/ms2pip/
.. image:: https://img.shields.io/github/actions/workflow/status/compomics/ms2pip_c/test.yml?branch=releases&label=tests&style=flat-square
   :target: https://github.com/compomics/ms2pip_c/actions/workflows/test.yml
.. image:: https://img.shields.io/github/actions/workflow/status/compomics/ms2pip_c/build_and_publish.yml?style=flat-square
   :target: https://github.com/compomics/ms2pip_c/actions/workflows/build_and_publish.yml
.. image:: https://img.shields.io/github/issues/compomics/ms2pip_c?style=flat-square
   :target: https://github.com/compomics/ms2pip_c/issues/
.. image:: https://img.shields.io/github/last-commit/compomics/ms2pip_c?style=flat-square
   :target: https://github.com/compomics/ms2pip_c/commits/releases/
.. image:: https://img.shields.io/github/license/compomics/ms2pip_c?style=flat-square
   :target: https://www.apache.org/licenses/LICENSE-2.0
.. image:: https://img.shields.io/twitter/follow/compomics?style=social
   :target: https://twitter.com/compomics

---------------------------------------------------------------------------------------------------

MSPIP: MS2 Peak Intensity Prediction - Fast and accurate peptide fragmentation
spectrum prediction for multiple fragmentation methods, instruments and labeling techniques.

---------------------------------------------------------------------------------------------------

About
-----

MSPIP is a tool to predict MS2 peak intensities from peptide sequences. The result is a predicted
peptide fragmentation spectrum that accurately resembles its observed equivalent. These predictions
can be used to validate peptide identifications, generate proteome-wide spectral libraries, or to
select discriminative transitions for targeted proteomics. MSPIP employs the
`XGBoost `_ machine learning algorithm and is written in
Python and C.

.. figure:: https://raw.githubusercontent.com/compomics/ms2pip/v4.0.0/img/mirror-DVAQIFNNILR-2.png

   Mirror plot of an observed (top) and MSPIP-predicted (bottom) spectrum for the peptide
   ``DVAQIFNNILR/2``.

You can install MSPIP on your machine by following the
`installation instructions `_. For a more
user-friendly experience, go to the `MSPIP web server `_. There,
you can easily upload a list of peptide sequences, after which the corresponding predicted MS2
spectra can be downloaded in multiple file formats. The web server can also be contacted through
the `RESTful API `_.

The MSPIP Python application can perform the following tasks:

- ``predict-single``: Predict fragmentation spectrum for a single peptide and optionally visualize
  the spectrum.
- ``predict-batch``: Predict fragmentation spectra for a batch of peptides.
- ``predict-library``: Predict a spectral library from protein FASTA file.
- ``correlate``: Compare predicted and observed intensities and optionally compute correlations.
- ``correlate-single``: Compare predicted and observed intensities for a single peptide spectrum.
- ``get-training-data``: Extract feature vectors and target intensities from observed spectra for
  training.
- ``annotate-spectra``: Annotate peaks in observed spectra.

MSPIP supports a wide range of PSM input formats and spectrum output formats, and includes
pre-trained models for multiple fragmentation methods, instruments and labeling techniques. See
`Usage `_ for more information.

Related projects
----------------

- `MSRescore `_: Use MSPIP and other peptide prediction
  tools to boost peptide identification results.
- `DeepLC `_: Retention time prediction for (modified)
  peptides using deep learning.
- `IM2Deep `_: Ion mobility prediction for (modified)
  peptides using deep learning.
- `psm_utils `_: Common utilities for parsing and handling
  peptide-spectrum matches and search engine results in Python

Citations
---------

If you use MSPIP for your research, please cite the following publication:

- Declercq, A., Bouwmeester, R., Chiva, C., Sabid, E., Hirschler, A., Carapito, C., Martens, L.,
  Degroeve, S., Gabriels, R. (2023). Updated MSPIP web server supports cutting-edge proteomics
  applications. `Nucleic Acids Research` `doi:10.1093/nar/gkad335 `_

Prior MSPIP publications:

- Gabriels, R., Martens, L., & Degroeve, S. (2019). Updated MSPIP web server
  delivers fast and accurate MS2 peak intensity prediction for multiple
  fragmentation methods, instruments and labeling techniques. `Nucleic Acids
  Research` `doi:10.1093/nar/gkz299 `_
- Degroeve, S., Maddelein, D., & Martens, L. (2015). MSPIP prediction server:
  compute and visualize MS2 peak intensity predictions for CID and HCD
  fragmentation. `_Nucleic Acids Research`, 43(W1), W326W330.
  `doi:10.1093/nar/gkv542 `_
- Degroeve, S., & Martens, L. (2013). MSPIP: a tool for MS/MS peak intensity
  prediction. `Bioinformatics (Oxford, England)`, 29(24), 3199203.
  `doi:10.1093/bioinformatics/btt544 `_

Please also take note of, and mention, the MSPIP version you used.

Full documentation
------------------

The full documentation, including installation instructions, usage examples,
and the command-line and Python API reference, can be found at
`ms2pip.readthedocs.io `_.

Contributing
------------

Bugs, questions or suggestions? Feel free to post an issue in the
`issue tracker `_ or to make a pull
request. Any contribution, small or large, is welcome!

Owner

  • Name: Computational Omics and Systems Biology Group
  • Login: CompOmics
  • Kind: organization
  • Email: compomics.list@gmail.com

The CompOmics group, headed by Prof. Dr. Lennart Martens, specializes in the management, analysis and integration of high-throughput Omics data.

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
  - family-names: "Gabriels"
    given-names: "Ralf"
    orcid: "https://orcid.org/0000-0002-1679-1711"
  - family-names: "Velghe"
    given-names: "Kevin"
    orcid: "https://orcid.org/0000-0002-9968-6043"
  - family-names: "Martens"
    given-names: "Lennart"
    orcid: "https://orcid.org/0000-0003-4277-658X"
  - family-names: "Degroeve"
    given-names: "Sven"
    orcid: "https://orcid.org/0000-0001-8349-3370"
title: "MS²PIP"
url: "https://github.com/compomics/ms2pip"
license: "Apache-2.0"
preferred-citation:
  type: article
  doi: "10.1093/nar/gkad335"
  journal: "Nucleic Acids Research"
  title: "Updated MS²PIP web server supports cutting-edge proteomics applications"
  year: 2023
  abstract: "MS²PIP is a data-driven tool that accurately predicts peak intensities for a given peptide's fragmentation mass spectrum. Since the release of the MS²PIP web server in 2015, we have brought significant updates to both the tool and the web server. In addition to the original models for CID and HCD fragmentation, we have added specialized models for the TripleTOF 5600+ mass spectrometer, for TMT-labeled peptides, for iTRAQ-labeled peptides, and for iTRAQ-labeled phosphopeptides. Because the fragmentation pattern is heavily altered in each of these cases, these additional models greatly improve the prediction accuracy for their corresponding data types. We have also substantially reduced the computational resources required to run MS²PIP, and have completely rebuilt the web server, which now allows predictions of up to 100 000 peptide sequences in a single request. The MS²PIP web server is freely available at https://iomics.ugent.be/ms2pip/."
  authors:
    - family-names: "Declercq"
      given-names: "Arthur"
      orcid: "https://orcid.org/0000-0002-9376-1399"
    - family-names: "Bouwmeester"
      given-names: "Robbin"
      orcid: "https://orcid.org/0000-0001-6807-7029"
    - family-names: "Cristina"
      given-names: "Chiva"
      orcid: "https://orcid.org/0000-0001-8150-6203"
    - family-names: "Sabidó"
      given-names: "Eduard"
      orcid: "https://orcid.org/0000-0001-6506-7714"
    - family-names: "Hirschler"
      given-names: "Aurélie"
      orcid: "https://orcid.org/0000-0001-5066-6263"
    - family-names: "Carapito"
      given-names: "Christine"
      orcid: "https://orcid.org/0000-0002-0079-319X"
    - family-names: "Martens"
      given-names: "Lennart"
      orcid: "https://orcid.org/0000-0003-4277-658X"
    - family-names: "Degroeve"
      given-names: "Sven"
      orcid: "https://orcid.org/0000-0001-8349-3370"
    - family-names: "Gabriels"
      given-names: "Ralf"
      orcid: "https://orcid.org/0000-0002-1679-1711"

GitHub Events

Total
  • Watch event: 1
  • Issue comment event: 1
  • Fork event: 1
Last Year
  • Watch event: 1
  • Issue comment event: 1
  • Fork event: 1

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 44
  • Total pull requests: 76
  • Average time to close issues: 10 months
  • Average time to close pull requests: about 2 months
  • Total issue authors: 20
  • Total pull request authors: 9
  • Average comments per issue: 1.64
  • Average comments per pull request: 0.46
  • Merged pull requests: 46
  • Bot issues: 1
  • Bot pull requests: 25
Past Year
  • Issues: 2
  • Pull requests: 5
  • Average time to close issues: N/A
  • Average time to close pull requests: 28 days
  • Issue authors: 2
  • Pull request authors: 3
  • Average comments per issue: 0.0
  • Average comments per pull request: 1.2
  • Merged pull requests: 3
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • RalfG (20)
  • vrkosk (4)
  • JB91451 (4)
  • ypriverol (2)
  • MarcIsak (2)
  • gukongxin (1)
  • edunivers (1)
  • AXXQ (1)
  • shubham1637 (1)
  • comics-asiis (1)
  • Luxxii (1)
  • RobbinBouwmeester (1)
  • MKoesters (1)
  • dependabot-preview[bot] (1)
  • patrick-willems (1)
Pull Request Authors
  • RalfG (40)
  • dependabot-preview[bot] (16)
  • ArthurDeclercq (10)
  • paretje (10)
  • dependabot[bot] (9)
  • Cajac102 (1)
  • Harry-young-0 (1)
  • rodvrees (1)
  • jaspervdh (1)
  • RobbinBouwmeester (1)
  • Jaspervdh (1)
Top Labels
Issue Labels
bug (8) enhancement (5) deployment (3) documentation (3) question (3) feature (1)
Pull Request Labels
dependencies (27) deployment (11) enhancement (11) bug (8) documentation (6) feature (1)

Dependencies

.github/workflows/build_and_publish.yml actions
  • actions/checkout v2 composite
  • actions/download-artifact v2 composite
  • actions/setup-python v2 composite
  • actions/upload-artifact v2 composite
  • pypa/gh-action-pypi-publish master composite
.github/workflows/test.yml actions
  • actions/checkout v2 composite
  • actions/setup-python v2 composite
  • actions/upload-artifact v2 composite
pyproject.toml pypi
  • cython * develop
  • pytest * develop
  • biopython >=1.74,<2
  • click >=7,<9
  • deeplc ^0.1.14
  • lxml ^4
  • matplotlib ^3.0
  • numpy >=1.16,<2
  • pandas >=0.24,<2
  • psycopg2 ^2.8.4
  • pyteomics >=3.5,<5
  • python ^3.7
  • rich >=13
  • spectrum_utils ^0.3.5
  • sqlalchemy ^1.3.13
  • tables >=3.4
  • tomlkit >=0.5.11,<1
  • tqdm >=4,<5
  • xgboost ^1.3