PRONE

R Package for preprocessing, normalizing, and analyzing proteomics data

https://github.com/daisybio/prone

Science Score: 26.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (16.2%) to scientific vocabulary

Keywords

data-analysis evaluation normalization proteomics
Last synced: 6 months ago · JSON representation

Repository

R Package for preprocessing, normalizing, and analyzing proteomics data

Basic Info
Statistics
  • Stars: 5
  • Watchers: 1
  • Forks: 2
  • Open Issues: 0
  • Releases: 0
Topics
data-analysis evaluation normalization proteomics
Created almost 2 years ago · Last pushed 9 months ago
Metadata Files
Readme Changelog License

README.Rmd

---
output: github_document
---



```{r, include = FALSE}
knitr::opts_chunk$set(
  collapse = TRUE,
  comment = "#>",
  fig.path = "man/figures/",
  out.width = "100%"
)
```

# PRONE - The PROteomics Normalization Evaluator 


R Package for preprocessing, normalizing, and analyzing proteomics data

## Introduction

High-throughput omics data are often affected by systematic biases introduced throughout all the steps of a clinical study, from sample collection to quantification.
Failure to account for these biases can lead to erroneous results and misleading conclusions in downstream analysis.
Normalization methods aim to adjust for these biases to make the actual biological signal more prominent.
However, selecting an appropriate normalization method is challenging due to the wide range of available approaches.
Therefore, a comparative evaluation of unnormalized and normalized data is essential in identifying an appropriate normalization strategy for a specific data set.
This R package provides different functions for preprocessing, normalizing, and evaluating different normalization approaches.
Furthermore, normalization methods can be evaluated on downstream steps, such as differential expression analysis and statistical enrichment analysis.
Spike-in data sets with known ground truth and real-world data sets of biological experiments acquired by either tandem mass tag (TMT) or label-free quantification (LFQ) can be analyzed.

## Installation

To install the package, run:

```{r, eval = FALSE}
  # Official BioC installation instructions
  if (!require("BiocManager", quietly = TRUE))
    install.packages("BiocManager")

  BiocManager::install("PRONE")
  
  # Load and attach PRONE 
  library("PRONE")
```

If you have troubles downloading PRONE from Bioconductor, you still have the option to install PRONE from GitHub. However, the Bioconductor download is recommended!

```{r, eval = FALSE}
  # Install PRONE.R from github and build vignettes
  if (!requireNamespace("devtools", quietly = TRUE)){
    install.packages("devtools")
  } 
  devtools::install_github("daisybio/PRONE", build_vignettes = TRUE, 
                           dependencies = TRUE)
  
  # Load and attach PRONE 
  library("PRONE")

```


## Workflow

A six-step workflow was developed in R version 4.2.2 to evaluate the effectiveness of the previously defined normalization methods on proteomics data.
The workflow incorporates a set of novel functions and also integrates various methods adopted by state-of-the-art tools.



Following the upload of the proteomics data into a SummarizedExperiment object, proteins with too many missing values can be removed, outlier samples identified, and normalization carried out.
Furthermore, an exploratory analysis of the performance of normalization methods can be conducted.
Finally, differential expression analysis can be executed to further evaluate the effectiveness of normalization methods.
For data sets with known ground truth, such as spike-in and simulated data sets, performance metrics, such as true positives (TPs), false positives (FPs), and area under the curve (AUC) values, can be computed.
The evaluation of DE results of real-world experiments is based on visual quality inspection, for instance, using volcano plots, and an intersection analysis of the DE proteins of different normalization methods is available.

## Usage

To get familiar with the functionalities of the R package, check out the article [Getting started with PRONE](https://daisybio.github.io/PRONE/articles/PRONE.html).


## Citation

TODO

Owner

  • Name: Data Science in Systems Biology
  • Login: daisybio
  • Kind: organization
  • Email: markus.list@tum.de
  • Location: Freising, Germany

The research group Data Science in Systems Biology at the School of Life Sciences, Technical University of Munich

GitHub Events

Total
  • Watch event: 3
  • Delete event: 1
  • Push event: 35
  • Pull request event: 6
  • Create event: 3
Last Year
  • Watch event: 3
  • Delete event: 1
  • Push event: 35
  • Pull request event: 6
  • Create event: 3

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 0
  • Total pull requests: 8
  • Average time to close issues: N/A
  • Average time to close pull requests: about 1 hour
  • Total issue authors: 0
  • Total pull request authors: 2
  • Average comments per issue: 0
  • Average comments per pull request: 0.0
  • Merged pull requests: 8
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 8
  • Average time to close issues: N/A
  • Average time to close pull requests: about 1 hour
  • Issue authors: 0
  • Pull request authors: 2
  • Average comments per issue: 0
  • Average comments per pull request: 0.0
  • Merged pull requests: 8
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
  • lisiarend (10)
Top Labels
Issue Labels
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • bioconductor 2,733 total
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 5
  • Total maintainers: 1
bioconductor.org: PRONE

The PROteomics Normalization Evaluator

  • Versions: 5
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 2,733 Total
Rankings
Dependent repos count: 0.0%
Dependent packages count: 31.5%
Average: 42.4%
Downloads: 95.6%
Maintainers (1)
Last synced: 6 months ago