Science Score: 57.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 2 DOI reference(s) in README
  • Academic publication links
  • Committers with academic emails
    3 of 5 committers (60.0%) from academic institutions
  • Institutional organization owner
    Organization sydneybiox has institutional domain (www.sydney.edu.au)
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (9.3%) to scientific vocabulary
Last synced: 6 months ago · JSON representation

Repository

Basic Info
Statistics
  • Stars: 13
  • Watchers: 8
  • Forks: 2
  • Open Issues: 9
  • Releases: 0
Created about 4 years ago · Last pushed 9 months ago
Metadata Files
Readme

README.md

scFeatures: Multi-view representations of single-cell and spatial data for disease outcome prediction

scFeatures is a tool that generates multi-view representations of single-cell and spatial data through the construction of a total of 17 feature types belonging to the following six categories.

  1. cell type proportions
  2. cell type specific gene expressions
  3. cell type specific pathway expressions
  4. cell type specific cell-cell interaction (CCI) scores
  5. overall aggregated gene expressions
  6. spatial metrics

Overview

Installation

The latest scFeatures can be installed using devtools:

library(devtools) devtools::install_github("SydneyBioX/scFeatures")

Quick start

scFeatures can be run using one line of code scfeatures_result <- scFeatures(data = data, sample = sample, celltype = celltype) which generates a list of dataframes containing all feature types in the form of samples x features.

Currently, scFeatures support scRNA-seq, spatial proteomics and spatial transcriptomics.

For scRNA-seq, run:

``` data("examplescrnaseq" , package = "scFeatures") data <- examplescrnaseq

scfeatures_result <- scFeatures(data = data@assays$RNA@data, sample = data$sample, celltype = data$celltype, type = "scrna",
ncores = 8,
species = "Homo sapiens") ```

For spatial proteomics, run:

```

note, spatial data requires spatial coordinates of each cell.

spatialCoords <- list( sample( 1:ncol(data), ncol(data)) , sample( 1:ncol(data), ncol(data) )) # generate fake coordinates

scfeaturesresult <- scFeatures(data = data@assays$RNA@data, sample = data$sample, celltype = data$celltype, type = "spatialp",
spatialCoords = spatialCoords, ncores = 8,
species = "Homo sapiens")

```

For spatial transcriptomics, run:

```

note, spatial data requires spatial coordinates of each cell.

spatialCoords <- list( sample( 1:ncol(data), ncol(data)) , sample( 1:ncol(data), ncol(data) ))

as well as predicted probability of cell types in each spot

spotProbability <- t(gtools::rdirichlet( ncol(data), rep(1, 5))) # simulate the cell type prediction result based on 5 cell types rownames( spotProbability) <- c("Cell type A", "Cell type B" , "Cell type C", "Cell type D", "Cell type E") colnames( spotProbability ) <- colnames(data)

scfeaturesresult <- scFeatures(data = data@assays$RNA@data, sample = data$sample, celltype = data$celltype, type = "spatialt",
spatialCoords = spatialCoords, spotProbability = spotProbability, ncores = 8,
species = "Homo sapiens")

```

Detailed vignette

Please see https://sydneybiox.github.io/scFeatures/articles/scFeatures_overview.html.

Reference

Cao, Y., Lin, Y., Patrick, E., Yang, P., & Yang, J. Y. H. (2022). scFeatures: multi-view representations of single-cell and spatial data for disease outcome prediction. In O. Vitek (Ed.), Bioinformatics (Vol. 38, Issue 20, pp. 4745–4753). Oxford University Press (OUP). https://doi.org/10.1093/bioinformatics/btac590

Owner

  • Name: Sydney Precision Data Science Centre
  • Login: SydneyBioX
  • Kind: organization
  • Location: Sydney, Australia

SPDSC alliance brings together multiple research groups and junior and senior researchers with shared interests in bioinformatics and computational sciences.

GitHub Events

Total
  • Issues event: 1
  • Watch event: 2
  • Push event: 4
Last Year
  • Issues event: 1
  • Watch event: 2
  • Push event: 4

Committers

Last synced: over 2 years ago

All Time
  • Total Commits: 442
  • Total Committers: 5
  • Avg Commits per committer: 88.4
  • Development Distribution Score (DDS): 0.509
Past Year
  • Commits: 276
  • Committers: 4
  • Avg Commits per committer: 69.0
  • Development Distribution Score (DDS): 0.409
Top Committers
Name Email Commits
Nicholas Robertson n****r@g****m 217
Yue Cao y****o@s****u 163
Yue Cao y****u@g****m 50
Ellis Patrick e****k@s****u 11
Yue Cao y****c@m****u 1
Committer Domains (Top 20 + Academic)

Packages

  • Total packages: 1
  • Total downloads:
    • bioconductor 5,248 total
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 6
  • Total maintainers: 1
bioconductor.org: scFeatures

scFeatures: Multi-view representations of single-cell and spatial data for disease outcome prediction

  • Versions: 6
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 5,248 Total
Rankings
Dependent repos count: 0.0%
Dependent packages count: 0.0%
Average: 32.7%
Downloads: 98.0%
Maintainers (1)
Last synced: 7 months ago

Dependencies

DESCRIPTION cran
  • ClassifyR * imports
  • DelayedArray * imports
  • DelayedMatrixStats * imports
  • EnsDb.Hsapiens.v79 * imports
  • EnsDb.Mmusculus.v79 * imports
  • GSVA * imports
  • Seurat * imports
  • ape * imports
  • dplyr * imports
  • ensembldb * imports
  • gtools * imports
  • msigdbr * imports
  • parallel * imports
  • plyr * imports
  • proxyC * imports
  • reshape2 * imports
  • spatstat.core * imports
  • spatstat.geom * imports
  • tidyr * imports
  • knitr * suggests
  • rmarkdown * suggests