SurfR
An Rpackage to identify cells membrane marker genes from bulkRNA sequencing data
Science Score: 26.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
✓DOI references
Found 1 DOI reference(s) in README -
○Academic publication links
-
○Committers with academic emails
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (13.0%) to scientific vocabulary
Keywords
Repository
An Rpackage to identify cells membrane marker genes from bulkRNA sequencing data
Basic Info
Statistics
- Stars: 4
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Topics
Metadata Files
README.md
SurfR
Proteins at the cell surface connect intracellular and extracellular signaling networks and largely determine a cell’s capacity to communicate and interact with its environment.
Importantly, variations in transcriptomic profiles are often observed between healthy and diseased cells, presenting distinct sets of cell-surface proteins. Indeed, cell surface proteins i) may act as biomarkers for the detection of diseased cells in tissues or body fluids and ii) are the most prevalent target of pharmaceutical agents: 66% of approved human drugs listed in the DrugBank database target a cell-surface protein. The investigation of the cell surfaceome therefore could provide new possibilities for diagnosis, prognosis, treatment development, and therapy response evaluation.
What is SurfR
The SurfR package aims to provide a streamlined end-to-end workflow for identifying surface protein coding genes from expression data using computational prediction.
SurfR :
- Returns a list of of surface protein coding genes, starting from a list of genes of interest, the raw count matrix of your own RNA-seq experiment, or from bulk transcriptomic data automatically retrieved from public databases. Protein classification is based on a recently developed surfaceome predictor, called SURFY, based on machine learning.
- Allows automatic data retrieval from public databases such as GEO and TCGA. GEO queries are based on the ArchS4 pipeline. TCGA repository is interrogated through TCGAbiolinks.
- Provides a function for differential gene expression analysis. For this task it relies on DESeq2 package, starting from counts data.
- Offers the opportunity to increase the sample size of a cohort by integrating related datasets, therefore enhancing the power to detect differentially expressed genes of interest. Meta-analysis can be performed through metaRNASeq, taking into account inter-study variability that may arise from technical differences among studies (e.g., sample preparation, library protocols, batch effects) as well as additional biological variability.
- Gene ontology (GO) and pathway annotation can also be performed within SurfR to gain further insights about surface protein candidates.
- Includes functions to visualize DEG and enrichment results, including BarPlots, Histograms, Venn diagrams, and PCA plots to help users achieve efficient data interpretation.
Installation
To install this package, start R (version "4.4") and enter:
```{r install, eval = FALSE}
if (!require("BiocManager", quietly = TRUE)) install.packages("BiocManager") ```
When the package is available on Bioconductor, use
{r install-Bioconductor, eval = FALSE}
BiocManager::install("SurfR")
Development package version can be installed from GitHub using devtools:
devtools::install_github("auroramaurizio/SurfR")
Dependencies
This package is supported for macOS, and Linux (Windows not tested). SurfR works with R v4.4 or greater. Dependencies are indicated in the DESCRIPTION file, and can be automatically installed when installing the SurfR pacakge.
Vignettes
A comprehensive vignette provides an introduction to the SurfR package. Examples and use-cases are covered for each function. Additional RMD notebooks containing the use cases code described in the manuscript are available on GitHub: https://github.com/auroramaurizio/SurfR_UseCases.
Documentation
Instructions to run the main functions can be found consulting the vignette or by entering ?FunctionName (e.g. ?Splot) in the console after loading the package.
Citation
Maurizio, A., Tascini, A.S., Morelli, M. SurfR: Riding the wave of RNA-Seq data with a comprehensive Bioconductor package to identify Surface Protein Coding Genes. Bioinformatics Advances, 2024 (DOI: 10.1093/bioadv/vbae201)
Authors
Aurora Maurizio (auroramaurizio1@gmail.com), Anna Sofia Tascini (volpesofi@gmail.com), Marco Morelli (morelli.marco@hsr.it)
Help, Suggestions, and Contributions
Any contribution is highly appreciated! If you are interested in contributing to this project, please open an issue.
Owner
- Name: Aurora Maurizio
- Login: auroramaurizio
- Kind: user
- Location: Milan
- Company: COSR @ IRCCS Ospedale San Raffaele
- Website: https://research.hsr.it/en/centers/omics-sciences/aurora-maurizio.html
- Twitter: aurora_maurizio
- Repositories: 2
- Profile: https://github.com/auroramaurizio
GitHub Events
Total
- Issues event: 2
- Watch event: 2
- Push event: 34
- Create event: 2
Last Year
- Issues event: 2
- Watch event: 2
- Push event: 34
- Create event: 2
Committers
Last synced: about 2 years ago
Top Committers
| Name | Commits | |
|---|---|---|
| maurizio.aurora@hsr.it | m****a@h****t | 136 |
| tascini.annasofia | t****a@u****t | 46 |
| tascini.annasofia | v****i@g****m | 5 |
| Aurora Maurizio | a****1@g****m | 2 |
| tascini.annasofia | t****a@h****t | 1 |
Issues and Pull Requests
Last synced: about 1 year ago
All Time
- Total issues: 1
- Total pull requests: 0
- Average time to close issues: about 17 hours
- Average time to close pull requests: N/A
- Total issue authors: 1
- Total pull request authors: 0
- Average comments per issue: 0.0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 1
- Pull requests: 0
- Average time to close issues: about 17 hours
- Average time to close pull requests: N/A
- Issue authors: 1
- Pull request authors: 0
- Average comments per issue: 0.0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- auroramaurizio (1)
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels
Packages
- Total packages: 1
- Total downloads: unknown
- Total dependent packages: 0
- Total dependent repositories: 0
- Total versions: 8
- Total maintainers: 1
bioconductor.org: SurfR
Surface Protein Prediction and Identification
- Homepage: https://github.com/auroramaurizio/SurfR
- Documentation: https://bioconductor.org/packages/release/bioc/vignettes/SurfR/inst/doc/SurfR.pdf
- License: GPL-3 + file LICENSE
-
Latest release: 1.4.1
published 8 months ago
Rankings
Maintainers (1)
Dependencies
- BiocFileCache * imports
- DESeq2 * imports
- SPsimSeq * imports
- SummarizedExperiment * imports
- TCGAbiolinks * imports
- assertr * imports
- biomaRt * imports
- dplyr * imports
- edgeR * imports
- enrichR * imports
- ggplot2 * imports
- ggrepel * imports
- grDevices * imports
- graphics * imports
- gridExtra * imports
- httr * imports
- knitr * imports
- magrittr * imports
- metaRNASeq * imports
- openxlsx * imports
- rhdf5 * imports
- scales * imports
- stats * imports
- stringr * imports
- tidyr * imports
- utils * imports
- venn * imports
- testthat >= 3.0.0 suggests