Science Score: 57.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 3 DOI reference(s) in README
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (9.6%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

Basic Info
  • Host: GitHub
  • Owner: ProteinEngineering-PESB2
  • License: other
  • Language: TypeScript
  • Default Branch: main
  • Homepage: https://www.peptipedia.cl
  • Size: 9.44 MB
Statistics
  • Stars: 4
  • Watchers: 0
  • Forks: 0
  • Open Issues: 1
  • Releases: 0
Created over 2 years ago · Last pushed about 1 year ago
Metadata Files
Readme Citation

README.md

Peptipedia v2.0

This repository contains the source code for Peptipedia, a peptide sequence database and user-friendly web platform, available at app.peptipedia.cl.

Summary

In recent years, peptides have gained significant relevance due to their therapeutic properties. The surge in peptide production and synthesis has generated vast amounts of data, enabling the creation of comprehensive databases and information repositories. Advances in sequencing techniques and artificial intelligence have further accelerated the design of tailor-made peptides. However, leveraging these techniques requires versatile and continuously updated storage systems, along with tools that facilitate peptide research and the implementation of machine learning for predictive systems. This work introduces Peptipedia v2.0, one of the most comprehensive public repositories of peptides, supporting biotechnological research by simplifying peptide study and annotation. Peptipedia v2.0 has expanded its collection by over 45% with peptide sequences that have reported biological activities. The functional biological activity tree has been revised and enhanced, incorporating new categories such as cosmetic and dermatological activities, molecular binding, and antiageing properties. Utilizing protein language models and machine learning, more than 90 binary classification models have been trained, validated, and incorporated into Peptipedia v2.0. These models exhibit average sensitivities and specificities of 0.877±0.0530 and 0.873±0.054, respectively, facilitating the annotation of more than 3.6 million peptide sequences with unknown biological activities, also registered in Peptipedia v2.0. Additionally, Peptipedia v2.0 introduces description tools based on structural and ontological properties and user-friendly machine learning tools to facilitate the application of machine learning strategies to study peptide sequences.

How to cite

If you use Peptipedia in your research, please cite the following article:

Gabriel Cabas-Mora, Anamaría Daza, Nicole Soto-García, Valentina Garrido, Diego Alvarez, Marcelo Navarrete, Lindybeth Sarmiento-Varón, Julieta H Sepúlveda Yañez, Mehdi D Davari, Frederic Cadet, Álvaro Olivera-Nappa, Roberto Uribe-Paredes, David Medina-Ortiz, Peptipedia v2.0: a peptide sequence database and user-friendly web platform. A major update, Database, Volume 2024, 2024, baae113, https://doi.org/10.1093/database/baae113

bibtex @article{cabas2024peptipedia, title = {Peptipedia v2.0: a peptide sequence database and user-friendly web platform. A major update}, author = {Cabas-Mora, Gabriel and Daza, Anamar{\'\i}a and Soto-Garc{\'\i}a, Nicole and Garrido, Valentina and Alvarez, Diego and Navarrete, Marcelo and Sarmiento-Var{\'o}n, Lindybeth and Sep{\'u}lveda Ya{\~n}ez, Julieta H and Davari, Mehdi D and Cadet, Frederic and others}, journal = {Database}, volume = {2024}, pages = {baae113}, year = {2024}, publisher = {Oxford University Press UK}, doi = {10.1093/database/baae113} }

Requirements and instalation

This web application was implemented using a client-server architecture. The frontend and backend directories contain information about requirements and instalation in their own README.md.

You can find Peptipedia database in this Google Drive folder.

Owner

  • Name: ProteinEngineering-ML-group
  • Login: ProteinEngineering-PESB2
  • Kind: organization
  • Email: david.medina@umag.cl
  • Location: Chile

Developing and studying machine learning applications for protein engineering tasks

Citation (CITATION.cff)

cff-version: 1.2.0
message: If you use this software, please cite both the article from preferred-citation and the software itself.
authors:
  - family-names: Cabas-Mora
    given-names: Gabriel
  - family-names: Daza
    given-names: Anamar\'\ia
  - family-names: Soto-Garc\'\ia
    given-names: Nicole
  - family-names: Garrido
    given-names: Valentina
  - family-names: Alvarez
    given-names: Diego
  - family-names: Navarrete
    given-names: Marcelo
  - family-names: Sarmiento-Varón
    given-names: Lindybeth
  - family-names: Sepúlveda Yañez
    given-names: Julieta H
  - family-names: Davari
    given-names: Mehdi D
  - family-names: Cadet
    given-names: Frederic
title: 'Peptipedia v2.0: a peptide sequence database and user-friendly web platform. A major update'
version: 1.0.0
doi: 10.1093/database/baae113
date-released: '2024-11-27'
preferred-citation:
  authors:
    - family-names: Cabas-Mora
      given-names: Gabriel
    - family-names: Daza
      given-names: Anamar\'\ia
    - family-names: Soto-Garc\'\ia
      given-names: Nicole
    - family-names: Garrido
      given-names: Valentina
    - family-names: Alvarez
      given-names: Diego
    - family-names: Navarrete
      given-names: Marcelo
    - family-names: Sarmiento-Varón
      given-names: Lindybeth
    - family-names: Sepúlveda Yañez
      given-names: Julieta H
    - family-names: Davari
      given-names: Mehdi D
    - family-names: Cadet
      given-names: Frederic
  title: 'Peptipedia v2.0: a peptide sequence database and user-friendly web platform. A major update'
  doi: 10.1093/database/baae113
  type: article-journal
  pages: baae113
  year: '2024'
  conference: {}
  publisher:
    name: Oxford University Press UK

GitHub Events

Total
  • Issues event: 1
  • Watch event: 2
  • Delete event: 2
  • Issue comment event: 1
  • Push event: 20
  • Pull request event: 2
  • Create event: 2
Last Year
  • Issues event: 1
  • Watch event: 2
  • Delete event: 2
  • Issue comment event: 1
  • Push event: 20
  • Pull request event: 2
  • Create event: 2

Dependencies

frontend/package-lock.json npm
  • 487 dependencies
frontend/package.json npm
  • @types/mui-datatables ^4.3.12 development
  • @types/react-copy-to-clipboard ^5.0.7 development
  • @types/react-dom ^18.2.18 development
  • @types/react-json-to-csv ^1.2.4 development
  • @types/react-plotly.js ^2.6.3 development
  • 3dmol 1.8.0
  • @emotion/react ^11.11.1
  • @emotion/styled ^11.11.0
  • @mui/lab ^5.0.0-alpha.142
  • @mui/material ^5.14.7
  • @react-spring/web ^9.7.3
  • @vitejs/plugin-react ^4.0.4
  • axios ^1.5.0
  • highcharts ^11.1.0
  • jquery ^3.7.1
  • js-file-download ^0.4.12
  • mui-datatables ^4.3.0
  • proseqviewer ^1.1.9
  • react-copy-to-clipboard ^5.1.0
  • react-hot-toast ^2.4.1
  • react-icons ^4.10.1
  • react-json-to-csv ^1.2.0
  • react-jsx-highcharts ^5.0.1
  • react-plotly.js ^2.6.0
  • react-router-dom ^6.15.0
  • react-router-loading ^1.0.2
  • react-spinners ^0.13.8
  • typescript ^5.3.3
  • vite ^4.4.9
  • vite-plugin-compression ^0.5.1
backend/pyproject.toml pypi
  • biopython ==1.81
  • flask ==2.3.3
  • flask-cors ==4.0.0
  • gunicorn ==21.2.0
  • matplotlib ==3.7.2
  • networkx ==3.1
  • numpy ==1.25.2
  • pandas ==2.0.3
  • psycopg2-binary ==2.9.7
  • scikit-learn ==1.3.0
  • scipy ==1.11.2
  • sqlalchemy ==2.0.20
backend/uv.lock pypi
  • backend 0.1.0
  • biopython 1.81
  • blinker 1.9.0
  • click 8.1.7
  • colorama 0.4.6
  • contourpy 1.3.1
  • cycler 0.12.1
  • flask 2.3.3
  • flask-cors 4.0.0
  • fonttools 4.55.0
  • greenlet 3.1.1
  • gunicorn 21.2.0
  • itsdangerous 2.2.0
  • jinja2 3.1.4
  • joblib 1.4.2
  • kiwisolver 1.4.7
  • markupsafe 3.0.2
  • matplotlib 3.7.2
  • networkx 3.1
  • numpy 1.25.2
  • packaging 24.2
  • pandas 2.0.3
  • pillow 11.0.0
  • psycopg2-binary 2.9.7
  • pyparsing 3.0.9
  • python-dateutil 2.9.0.post0
  • pytz 2024.2
  • scikit-learn 1.3.0
  • scipy 1.11.2
  • six 1.16.0
  • sqlalchemy 2.0.20
  • threadpoolctl 3.5.0
  • typing-extensions 4.12.2
  • tzdata 2024.2
  • werkzeug 3.1.3