SurfR

An Rpackage to identify cells membrane marker genes from bulkRNA sequencing data

https://github.com/auroramaurizio/surfr

Science Score: 26.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
    Found 1 DOI reference(s) in README
  • Academic publication links
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (13.0%) to scientific vocabulary

Keywords

dge enrichment-analysis metaanalysis plots proteins public-data rnaseq rpackage surface surfaceome
Last synced: 6 months ago · JSON representation

Repository

An Rpackage to identify cells membrane marker genes from bulkRNA sequencing data

Basic Info
  • Host: GitHub
  • Owner: auroramaurizio
  • License: agpl-3.0
  • Language: R
  • Default Branch: devel
  • Homepage:
  • Size: 6.61 MB
Statistics
  • Stars: 4
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Topics
dge enrichment-analysis metaanalysis plots proteins public-data rnaseq rpackage surface surfaceome
Created over 2 years ago · Last pushed 9 months ago
Metadata Files
Readme Changelog License

README.md

SurfR

Proteins at the cell surface connect intracellular and extracellular signaling networks and largely determine a cell’s capacity to communicate and interact with its environment.

Importantly, variations in transcriptomic profiles are often observed between healthy and diseased cells, presenting distinct sets of cell-surface proteins. Indeed, cell surface proteins i) may act as biomarkers for the detection of diseased cells in tissues or body fluids and ii) are the most prevalent target of pharmaceutical agents: 66% of approved human drugs listed in the DrugBank database target a cell-surface protein. The investigation of the cell surfaceome therefore could provide new possibilities for diagnosis, prognosis, treatment development, and therapy response evaluation.

What is SurfR

The SurfR package aims to provide a streamlined end-to-end workflow for identifying surface protein coding genes from expression data using computational prediction.

SurfR :

  • Returns a list of of surface protein coding genes, starting from a list of genes of interest, the raw count matrix of your own RNA-seq experiment, or from bulk transcriptomic data automatically retrieved from public databases. Protein classification is based on a recently developed surfaceome predictor, called SURFY, based on machine learning.
  • Allows automatic data retrieval from public databases such as GEO and TCGA. GEO queries are based on the ArchS4 pipeline. TCGA repository is interrogated through TCGAbiolinks.
  • Provides a function for differential gene expression analysis. For this task it relies on DESeq2 package, starting from counts data.
  • Offers the opportunity to increase the sample size of a cohort by integrating related datasets, therefore enhancing the power to detect differentially expressed genes of interest. Meta-analysis can be performed through metaRNASeq, taking into account inter-study variability that may arise from technical differences among studies (e.g., sample preparation, library protocols, batch effects) as well as additional biological variability.
  • Gene ontology (GO) and pathway annotation can also be performed within SurfR to gain further insights about surface protein candidates.
  • Includes functions to visualize DEG and enrichment results, including BarPlots, Histograms, Venn diagrams, and PCA plots to help users achieve efficient data interpretation.

Installation

To install this package, start R (version "4.4") and enter:

```{r install, eval = FALSE}

if (!require("BiocManager", quietly = TRUE)) install.packages("BiocManager") ```

When the package is available on Bioconductor, use

{r install-Bioconductor, eval = FALSE} BiocManager::install("SurfR")

Development package version can be installed from GitHub using devtools: devtools::install_github("auroramaurizio/SurfR")

Dependencies

This package is supported for macOS, and Linux (Windows not tested). SurfR works with R v4.4 or greater. Dependencies are indicated in the DESCRIPTION file, and can be automatically installed when installing the SurfR pacakge.

Vignettes

A comprehensive vignette provides an introduction to the SurfR package. Examples and use-cases are covered for each function. Additional RMD notebooks containing the use cases code described in the manuscript are available on GitHub: https://github.com/auroramaurizio/SurfR_UseCases.

Documentation

Instructions to run the main functions can be found consulting the vignette or by entering ?FunctionName (e.g. ?Splot) in the console after loading the package.

Citation

Maurizio, A., Tascini, A.S., Morelli, M. SurfR: Riding the wave of RNA-Seq data with a comprehensive Bioconductor package to identify Surface Protein Coding Genes. Bioinformatics Advances, 2024 (DOI: 10.1093/bioadv/vbae201)

Authors

Aurora Maurizio (auroramaurizio1@gmail.com), Anna Sofia Tascini (volpesofi@gmail.com), Marco Morelli (morelli.marco@hsr.it)

Help, Suggestions, and Contributions

Any contribution is highly appreciated! If you are interested in contributing to this project, please open an issue.

Owner

  • Name: Aurora Maurizio
  • Login: auroramaurizio
  • Kind: user
  • Location: Milan
  • Company: COSR @ IRCCS Ospedale San Raffaele

GitHub Events

Total
  • Issues event: 2
  • Watch event: 2
  • Push event: 34
  • Create event: 2
Last Year
  • Issues event: 2
  • Watch event: 2
  • Push event: 34
  • Create event: 2

Committers

Last synced: about 2 years ago

All Time
  • Total Commits: 190
  • Total Committers: 5
  • Avg Commits per committer: 38.0
  • Development Distribution Score (DDS): 0.284
Past Year
  • Commits: 140
  • Committers: 3
  • Avg Commits per committer: 46.667
  • Development Distribution Score (DDS): 0.25
Top Committers
Name Email Commits
maurizio.aurora@hsr.it m****a@h****t 136
tascini.annasofia t****a@u****t 46
tascini.annasofia v****i@g****m 5
Aurora Maurizio a****1@g****m 2
tascini.annasofia t****a@h****t 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: about 1 year ago

All Time
  • Total issues: 1
  • Total pull requests: 0
  • Average time to close issues: about 17 hours
  • Average time to close pull requests: N/A
  • Total issue authors: 1
  • Total pull request authors: 0
  • Average comments per issue: 0.0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 1
  • Pull requests: 0
  • Average time to close issues: about 17 hours
  • Average time to close pull requests: N/A
  • Issue authors: 1
  • Pull request authors: 0
  • Average comments per issue: 0.0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • auroramaurizio (1)
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads: unknown
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 8
  • Total maintainers: 1
bioconductor.org: SurfR

Surface Protein Prediction and Identification

  • Versions: 8
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent repos count: 0.0%
Dependent packages count: 31.3%
Average: 42.9%
Downloads: 97.4%
Maintainers (1)
Last synced: 6 months ago

Dependencies

DESCRIPTION cran
  • BiocFileCache * imports
  • DESeq2 * imports
  • SPsimSeq * imports
  • SummarizedExperiment * imports
  • TCGAbiolinks * imports
  • assertr * imports
  • biomaRt * imports
  • dplyr * imports
  • edgeR * imports
  • enrichR * imports
  • ggplot2 * imports
  • ggrepel * imports
  • grDevices * imports
  • graphics * imports
  • gridExtra * imports
  • httr * imports
  • knitr * imports
  • magrittr * imports
  • metaRNASeq * imports
  • openxlsx * imports
  • rhdf5 * imports
  • scales * imports
  • stats * imports
  • stringr * imports
  • tidyr * imports
  • utils * imports
  • venn * imports
  • testthat >= 3.0.0 suggests