https://github.com/bioconductor-source/smad

https://github.com/bioconductor-source/smad

Science Score: 36.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
    Found 14 DOI reference(s) in README
  • Academic publication links
    Links to: nature.com
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (10.2%) to scientific vocabulary
Last synced: 9 months ago · JSON representation

Repository

Basic Info
  • Host: GitHub
  • Owner: bioconductor-source
  • License: other
  • Language: R
  • Default Branch: devel
  • Size: 197 KB
Statistics
  • Stars: 0
  • Watchers: 2
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created almost 2 years ago · Last pushed almost 2 years ago
Metadata Files
Readme Changelog License

README.md

Statistical Modelling of AP-MS Data (SMAD)

This R package implements statistical modelling of affinity purification–mass spectrometry (AP-MS) data to compute confidence scores to identify bona fide protein-protein interactions (PPI).

Installation

The development version can be installed through github: {r} devtools::install_github(repo="zqzneptune/SMAD") library(SMAD)

Input Data

A demo data.frame was provided as a hint how the input data should strcutured in order to run the scoring functions:

```{r} data(TestDatInput) colnames(TestDataInput)

1 "idRun" "idBait" "idPrey" "countPrey" "lenPrey" ```

|idRun|idBait|idPrey|countPrey|lenPrey| |-----|:----:|:----:|:-------:|-------| |Unique ID of one AP-MS run|Bait ID|Prey ID|Prey peptide count|Protein sequence length of the prey|

In case of duplcates, a suffix or prefix of e.g. "A", "B" could be added to idRun in order to make "idRun-idBait" combination unique to each replicate.

Run scoring

1. CompPASS

Comparative Proteomic Analysis Software Suite (CompPASS) is based on spoke model. This algorithm was developed by Dr. Mathew Sowa for defining the human deubiquitinating enzyme interaction landscape (Sowa, Mathew E., et al., 2009). The implementation of this algorithm was inspired by Dr. Sowa's online tutorial. The output includes Z-score, S-score, D-score and WD-score. In its implementation in BioPlex 1.0 (Huttlin, Edward L., et al., 2015) and BioPlex 2.0 (Huttlin, Edward L., et al., 2017), a naive Bayes classifier that learns to distinguish true interacting proteins from non-specific background and false positive identifications was included in the compPASS pipline. This function was optimized from the source code.

The input data.frame, datInput, should include:idRun, idBait, idPrey and countPrey.

{r} datScore <- CompPASS(datInput)

2. DICE

The Dice coefficient is used to score the interaction scores across prey pair-wise combinations, which was proposed by (Bing Zhang et al., 2008)

The input data.frame, datInput, should include:idRun and idPrey.

{r} datScore <- DICE(datInput)

3. Hart

Hart scoring algorithm is based on a hypergeometric distribution error model (Hart et al., 2007).

The input data.frame, datInput, should include:idRun and idPrey.

{r} datScore <- Hart(datInput)

4. HGScore

HGScore algorithm is based on a hypergeometric distribution error model (Hart et al., 2007) with incorporation of NSAF (Zybailov, Boris, et al., 2006). This algorithm was first introduced to predict the protein complex network of Drosophila melanogaster (Guruharsha, K. G., et al., 2011). This scoring algorithm was based on matrix model.

The input data.frame, datInput, should include:idRun, idPrey, countPrey and lenPrey.

{r} datScore <- HG(datInput)

5. PE

PE incorporated both spoke and matrix model as repored in (Sean R. Collins, et al., 2007).

The input data.frame, datInput, should include:idRun, idBait and idPrey.

{r} datScore <- PE(datInput)

License

MIT @ Qingzhou Zhang

Owner

  • Name: (WIP DEV) Bioconductor Packages
  • Login: bioconductor-source
  • Kind: organization
  • Email: maintainer@bioconductor.org

Source code for packages accepted into Bioconductor

GitHub Events

Total
Last Year

Dependencies

DESCRIPTION cran
  • R >= 3.6.0 depends
  • RcppAlgos * depends
  • Rcpp >= 1.0.0 imports
  • dplyr * imports
  • magrittr >= 1.5 imports
  • stats * imports
  • tidyr * imports
  • utils * imports
  • BiocStyle * suggests
  • knitr * suggests
  • rmarkdown * suggests
  • testthat * suggests