BreastSubtypeR
Science Score: 39.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 20 DOI reference(s) in README -
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (12.9%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: yqkiuo
- Language: R
- Default Branch: main
- Size: 30.7 MB
Statistics
- Stars: 2
- Watchers: 1
- Forks: 2
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
BreastSubtypeR 
BreastSubtypeR — assumption-aware, multi-method intrinsic molecular subtyping for breast cancer (R / Bioconductor)
Authors: Qiao Yang, Emmanouil G. Sifakis\ Affiliation: Department of Oncology-Pathology, Karolinska Institutet (Stockholm, Sweden)\ Paper: Yang Q., Hartman J., Sifakis E.G. BreastSubtypeR: A Unified R/Bioconductor Package for Intrinsic Molecular Subtyping in Breast Cancer Research. NAR Genomics and Bioinformatics (2025). Editor’s Choice.
Overview
BreastSubtypeR consolidates established gene-expression–based intrinsic subtyping methods into a single, reproducible R/Bioconductor package and provides a local Shiny app (iBreastSubtypeR) for users without programming experience.
Key goals: - reduce method misapplication across heterogeneous research cohorts, - enable direct cross-method benchmarking and exploration of discordant calls, - provide method-specific preprocessing and robust probe-to-gene mapping, - support privacy-preserving local analyses via a Shiny GUI.
Main features
- Comprehensive Intrinsic Subtyping: Integrates multiple published intrinsic subtyping algorithms (NC- and SSP-based), including PAM50 variants, AIMS, ssBC, sspbc, and others.
- Unified Multi-Method Interface (
BS_Multi): Run many classifiers from one consistent API and compare results side-by-side. - AUTO Mode (cohort-aware selection): Evaluates cohort diagnostics (e.g., receptor-status distribution, subtype purity, subgroup sizes) and programmatically disables classifiers whose assumptions are likely violated—reducing misclassification in skewed or small cohorts.
- Standardised Input & Method-Specific Normalisation: Supports raw RNA-seq counts, precomputed FPKM, and log₂-normalised microarray/nCounter matrices with automated, method-appropriate transformations.
- Optimised Probe/Gene Mapping: Entrez ID–based mapping and conflict resolution to maximise marker coverage across platforms.
- Interactive Shiny App (
iBreastSubtypeR): Local GUI that replicates core workflows for non-programmers and preserves data privacy. - Bioconductor distribution & reproducibility: Unit tests, vignettes and SummarizedExperiment compatibility to support reproducible deployment.
Methods included (single-method implementations)
| Method id | Short description | Group | Reference |
|------------------|-------------------|------------------|------------------|
| parker.original | Original PAM50 by Parker et al., 2009 | NC-based | Parker et al., 2009 |
| genefu.scale | PAM50 implementation as in the genefu R package (scaled version) | NC-based | Gendoo et al., 2016 |
| genefu.robust | PAM50 implementation as in the genefu R package (robust version) | NC-based | Gendoo et al., 2016 |
| cIHC | Conventional ER-balancing using immunohistochemistry (IHC) | NC-based | Ciriello et al., 2015 |
| cIHC.itr | Iterative version of cIHC | NC-based | Curtis et al., 2012 |
| PCAPAM50 | Selects IHC-defined ER subsets, then uses Principal component analysis (PCA) to create ESR1 expression-based ER-balancing | NC-based | Raj-Kumar et al., 2019 |
| ssBC | Subgroup-specific gene-centering PAM50 | NC-based | Zhao et al., 2015 |
| ssBC.v2 | Updated subgroup-specific gene-centering PAM50 with refined quantiles | NC-based | Fernandez-Martinez et al., 2020 |
| AIMS | Absolute Intrinsic Molecular Subtyping (AIMS) method | SSP-based | Paquet & Hallett, 2015 |
| sspbc | Single-Sample Predictors for Breast Cancer (AIMS adaptation) | SSP-based | Staaf et al., 2022 |
(See the package vignette for implementation details.)
Installation
Install the released version from Bioconductor:
``` r if (!require("BiocManager", quietly = TRUE)) install.packages("BiocManager")
BiocManager::install("BreastSubtypeR") ```
Or install the development version from GitHub:
``` r
Install devtools package if you haven't already
install.packages("devtools")
Install BreastSubtypeR from GitHub
devtools::install_github("yqkiuo/BreastSubtypeR") ```
Quick start examples
Note: this README uses example datasets included in the package. Replace BreastSubtypeRobj / OSLO2EMIT0obj with your own SummarizedExperiment object and clinical metadata..
1) Preprocessing & mapping
``` r library(BreastSubtypeR)
Example data shipped with the package
data("BreastSubtypeRobj") # package-specific object data("OSLO2EMIT0obj")
Map probes/ids to Entrez
datainput <- Mapping( OSLO2EMIT0obj$seobj, RawCounts = FALSE, method = "max", impute = TRUE, verbose = FALSE ) ```
2) Multi-method subtyping (user-defined methods)
``` r methods <- c("parker.original", "PCAPAM50", "sspbc") result <- BSMulti( datainput = data_input, methods = methods, Subtype = FALSE, hasClinical = FALSE )
View per-sample subtype calls (methods x samples)
head(result$ressubtypes[, 1:min(5, ncol(result$ressubtypes))], 5) ```
3) AUTO mode (cohort-aware selection)
``` r resultauto <- BSMulti( datainput = datainput, methods = "AUTO", Subtype = FALSE, hasClinical = FALSE )
Visualize subtype calls and inter-method concordance
VisMulti(resultauto$res_subtypes) ```
4) Launch the local Shiny app
``` r library(BreastSubtypeR) library(tidyverse) library(shiny) library(bslib)
iBreastSubtypeR() # interactive GUI (local) ```
Vignette & Documentation
A comprehensive usage guide is included as a vignette with the package—browse it through your R help system or find the rendered version in your documentation files.
For specific functions (like BS_Multi, Mapping, or iBreastSubtypeR), see their help pages (e.g., ?BS_Multi). The package manual also lists accepted input formats and parameter descriptions for ease of reference.
Contributing & issues
Contributions and issue reports are welcome. Please open issues or pull requests on the GitHub repository: https://github.com/yqkiuo/BreastSubtypeR/issues.
Citation
When using BreastSubtypeR in publications, please cite the package and the paper:
Yang Q., Hartman J., Sifakis E. G. (2025) BreastSubtypeR: A Unified R/Bioconductor Package for Intrinsic Molecular Subtyping in Breast Cancer Research. NAR Genomics and Bioinformatics. (Editor’s Choice).
You can also use citation("BreastSubtypeR") after installing the package to retrieve the canonical citation(s).
License
This project is released under the GPL-3 license.
Owner
- Name: Qiao Yang
- Login: yqkiuo
- Kind: user
- Location: Stockholm, Sweden
- Company: Karolinska Institutet
- Repositories: 1
- Profile: https://github.com/yqkiuo
Bioinformatic PhD student at Karolinska Institutet; Bioinformatic MSc in Sichuan University; BENG in Hangzhou Dianzi University
GitHub Events
Total
- Issues event: 1
- Watch event: 1
- Delete event: 10
- Issue comment event: 5
- Public event: 1
- Push event: 149
- Pull request event: 93
- Fork event: 1
- Create event: 9
Last Year
- Issues event: 1
- Watch event: 1
- Delete event: 10
- Issue comment event: 5
- Public event: 1
- Push event: 150
- Pull request event: 93
- Fork event: 1
- Create event: 9
Issues and Pull Requests
Last synced: 10 months ago
All Time
- Total issues: 1
- Total pull requests: 119
- Average time to close issues: N/A
- Average time to close pull requests: about 17 hours
- Total issue authors: 1
- Total pull request authors: 2
- Average comments per issue: 0.0
- Average comments per pull request: 0.06
- Merged pull requests: 108
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 1
- Pull requests: 119
- Average time to close issues: N/A
- Average time to close pull requests: about 17 hours
- Issue authors: 1
- Pull request authors: 2
- Average comments per issue: 0.0
- Average comments per pull request: 0.06
- Merged pull requests: 108
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- guodudou2 (1)
Pull Request Authors
- yqkiuo (115)
- sifakise (4)
Top Labels
Issue Labels
Pull Request Labels
Packages
- Total packages: 1
- Total downloads: unknown
- Total dependent packages: 0
- Total dependent repositories: 0
- Total versions: 1
- Total maintainers: 1
bioconductor.org: BreastSubtypeR
Methods for breast cancer intrinsic subtyping
- Homepage: https://github.com/yqkiuo/BreastSubtypeR
- Documentation: https://bioconductor.org/packages/release/bioc/vignettes/BreastSubtypeR/inst/doc/BreastSubtypeR.pdf
- License: GPL (>= 3)
-
Latest release: 1.0.0
published about 1 year ago
Rankings
Maintainers (1)
Dependencies
- R >= 4.4.0 depends
- ComplexHeatmap >= 2.22.0 imports
- RColorBrewer >= 1.1 imports
- bslib >= 0.8.0 imports
- circlize >= 0.4.16 imports
- data.table >= 1.16.0 imports
- dplyr >= 1.1.0 imports
- e1071 >= 1.7 imports
- ggplot2 >= 3.5.1 imports
- ggrepel >= 0.9.6 imports
- impute >= 1.80.0 imports
- magrittr >= 2.0.3 imports
- methods * imports
- rlang >= 1.1.4 imports
- shiny >= 1.9.1 imports
- stringr >= 1.5.1 imports
- withr * imports
- knitr * suggests
- rmarkdown * suggests