plf

Peptide location fingerprinting (PLF) is a technique capable of identifying modified proteins and potential causal mechanisms in complex biological samples. Protein Locational Fingerprinter determines statisticaly significant peptide yields from protein functional domains.

https://github.com/maxozo/plf

Science Score: 49.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 2 DOI reference(s) in README
  • Academic publication links
    Links to: pubmed.ncbi, ncbi.nlm.nih.gov, sciencedirect.com
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (13.8%) to scientific vocabulary

Keywords

domains mass-spectrometry proteomics
Last synced: 6 months ago · JSON representation

Repository

Peptide location fingerprinting (PLF) is a technique capable of identifying modified proteins and potential causal mechanisms in complex biological samples. Protein Locational Fingerprinter determines statisticaly significant peptide yields from protein functional domains.

Basic Info
Statistics
  • Stars: 1
  • Watchers: 2
  • Forks: 1
  • Open Issues: 0
  • Releases: 2
Topics
domains mass-spectrometry proteomics
Created about 5 years ago · Last pushed 7 months ago
Metadata Files
Readme License Citation

README.md

Peptide location fingerprinting (PLF) is a technique capable of identifying modified proteins and potential causal mechanisms in complex biological samples. In standard proteomics, proteins are trypsinised which generate peptides whose sequence identities and relative abundances are measured by LC-MS/MS. During this process most proteins are only partially digested, due to differing solubilities, stabilities and enzyme susceptibilities related to their higher order structures. By mapping and quantifying LC-MS/MS-detected peptides within specific regions, PLF enables the detection of statistical differences in the regional digestibility along the protein structure due to ageing and disease mechanisms.

Quick start

Instalation

!python version 3.9 is required If you have a different default version and you got miniconda/miniforge/anaconda installed you can create a python3.9 env with:

     conda create -n py39env python=3.9
     conda activate py39env

Then when you have checked that your python is 3.9 you can create a new virtual python enviroment

     python -m venv mplf_venv

Activate the enviroment (Linux and Mac):

       source mplf_venv/bin/activate

Activate the enviroment (Windows)

       .\mplf_venv\Scripts\activate.bat

Clone the PLF repo:

      git clone https://github.com/maxozo/PLF.git

Install requirements:

       cd PLF
       pip install -r requirements.txt          

Test data

To just try to run the analysis on test data please run (thats also available for download on our MPLF website ):

     python PLF.py --test --outname My_Test_Run

Note: MyTestRun can be a path/to/MyTestRun/Filename

Your own data

  1. Prepeare a file that lists Protein name (optional if source protein not determined), Peptide sequence (remove any special characters from these), Sample of protein belionging and spectra (can be multiple columns as per: spectra_1,spectra_2, etc. -- these will be added up): as per this file.

| Protein | Sample | Peptide | spectra | spectra2 | spectra3 | spectra4 | | -------- | -------- | -------- | -------- | -------- | -------- | -------- | | FBLN1HUMAN | 20180601SherrattMMO15.raw (FullSkin15) | CLAFECPENYR | 0 | 1 | 0 | 0 | | FBLN1HUMAN | 20180601SherrattMMO15.raw (FullSkin_15) | CVDVDECAPPAEPCGK | 0 | 1 | 0 | 0 |

  1. Prepeare a tsv file that lists the experimental design - as per this file. If paired make sure that the rows list matching pairs, otherwise any order is ok.

| forearm | buttock |
| -------- | -------- |
| sample1 | sample2 | | sample3 | sample4 | | sample6 | sample7 |

  1. Run the MPLF pipeline:

     python ../../PLF.py --experimental_design Experiment_feed.tsv --peptides Sample_Data_For_Analysis.csv --spiecies HUMAN --domain_types DOMAINS,REGIONS,TOPO_DOM,TRANSMEM,REPEAT,50AA,75AA,100AA --paired True --outname MPLF_RUN --p_threshold 0.05
    

Params:

      Required:

      --experimental_design  This allows to provide the experimental defign file file

      --peptides   This allows to provide your peptides file

      --spiecies   The spiecies of the peptides

      --paired    Is the samples specified in experimental_design paired or unpaired

      --outname  The name of the output files

      Optional:

      --cpus     (default=max available) How many cpus to use for analysis.

      --p_threshold    (dafault=0.05) Only return proteins that has at least one domain with a significance threshold lover or equal to specified
  1. Results will produce two files {outname specified}.tsv and {outname specified}.mplf file. TSV file will list all the domains, their p values, quantified data, normalised data etc. MPLF file can be uploaded to Manchester Proteome Location Fingerprinter (MPLF) to perform visualisations of the data.

Methods

For details please read our publication

References (please cite)

Owner

  • Name: Matiss
  • Login: maxozo
  • Kind: user
  • Location: London
  • Company: Wellcome Sanger Institute | University of Cambridge | University of Manchester

Principal Bioinformatician | Research Associate | Software Engineer

GitHub Events

Total
  • Release event: 1
  • Delete event: 1
  • Member event: 1
  • Push event: 18
  • Fork event: 1
  • Create event: 2
Last Year
  • Release event: 1
  • Delete event: 1
  • Member event: 1
  • Push event: 18
  • Fork event: 1
  • Create event: 2

Issues and Pull Requests

Last synced: 11 months ago

All Time
  • Total issues: 0
  • Total pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Total issue authors: 0
  • Total pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels