https://github.com/bodenmillergroup/imcdatasets
ExperimentHub collection of imaging mass cytometry datasets
Science Score: 39.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 10 DOI reference(s) in README -
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (10.4%) to scientific vocabulary
Repository
ExperimentHub collection of imaging mass cytometry datasets
Basic Info
- Host: GitHub
- Owner: BodenmillerGroup
- License: gpl-3.0
- Language: R
- Default Branch: devel
- Homepage: https://bodenmillergroup.github.io/imcdatasets/
- Size: 6.91 MB
Statistics
- Stars: 7
- Watchers: 5
- Forks: 4
- Open Issues: 4
- Releases: 0
Metadata Files
README.md
imcdatasets
Documentation is available at: https://bodenmillergroup.github.io/imcdatasets/index.html
Introduction
The imcdatasets package is an extensible resource containing a set of publicly
available and curated Imaging Mass Cytometry datasets. Each dataset consists of
three data objects:
1. Single cell data in the form of a SingleCellExperiment or
SpatialExperiment class object.
2. Multichannel images formatted into a CytoImageList class object.
3. Cell segmentation masks formatted into a CytoImageList class object.
These formats facilitate accession and integration into R/Bioconductor workflows. The data objects are hosted on Bioconductor's ExperimentHub platform.
Installation
Release version
The release version
of imcdatasets requires R version >= 4.3 and Bioconductor version >= 3.18.
The current release of Bioconductor should be installed:
{r}
if (!requireNamespace("BiocManager", quietly = TRUE))
install.packages("BiocManager")
BiocManager::install(version = "3.18")
Then, imcdatasets can be installed from Bioconductor:
{r}
BiocManager::install("imcdatasets")
Development version
The development version
of imcdatasets requires R version >= 4.4 and Bioconductor version >= 3.19.
The development version of Bioconductor should be installed:
{r}
if (!requireNamespace("BiocManager", quietly = TRUE))
install.packages("BiocManager")
BiocManager::install(version='devel')
Then, imcdatasets can be installed from Bioconductor:
{r}
BiocManager::install("imcdatasets")
imcdatasets can also be installed from GitHub using devtools:
{r}
if (!requireNamespace("devtools", quietly = TRUE))
install.packages("devtools")
devtools::install_github("BodenmillerGroup/imcdatasets", build_vignettes = TRUE)
Dependencies
imcdatasets builds on data objects contained in the
SingleCellExperiment,
SpatialExperiment,
and cytomapper packages.
These packages can be installed as follows:
{r}
if (!requireNamespace("BiocManager", quietly = TRUE))
install.packages("BiocManager")
BiocManager::install(c("SingleCellExperiment", "SpatialExperiment", "cytomapper"))
Usage
To load imcdatasets in your R session, use:
{r}
library(imcdatasets)
Detailed information on how to access the datasets is available in the imcdatasets vignette.
The vignette can also be viewed directly in R:
{r}
vignette("imcdatasets")
Details
The imcdatasets package provides quick and easy access to published and
curated imaging mass cytometry datasets. Each dataset consists of three data
objects that can be retrieved individually:
Single cell data in the form of a
SingleCellExperimentor aSpatialExperimentclass object: This object contains cell-level expression values and metadata. TherowDataentry contain marker information while thecolDataentry contain cell-level metadata, including image names and cell numbers. Theassaysslots contain marker expression levels per cell: thecountsassay contains average ion counts per cell whereas the other assays contain counts transformations (details available in the documentation of each dataset).Multichannel images formatted into a
CytoImageListclass object. This object contains multichannel images and metadata, including channel names and image names.Cell segmentation masks formatted into a
CytoImageListclass object. This object contains single-channel images representing cell segmentation masks and metadata, including image names. The mask intensity values map to cell number values in theSingleCellExperimentobject so that single cell data can be associated to segmentation masks.
The three data objects can be mapped using the image names contained in the metadata of each object. Details are available in the vignette (see above).
For more information about the SingleCellExperiment, SpatialCellExperiment,
and CytoImageList objects, please refer to the
SingleCellExperiment,
SpatialExperiment,
and cytomapper packages,
respectively.
Available datasets
List of available datasets
- Damond2019Pancreas: Pancreas sections from organ donors with type 1
diabetes.
- Documentation: Damond2019Pancreas.
- Publication: Damond et al. Cell Metab (2019) 29(3):755-768.e5.
- Documentation: Damond2019Pancreas.
- HochSchulz2022Melanoma: Metastatic melanoma samples, including a panel
with co-detection of protein and RNA targets.
- Documentation: HochSchulz2022Melanoma.
- Publication: Hoch, Schulz et al. Sci Immunol (2022) 70(7):abk1692
- JacksonFischer2020BreastCancer: Tumour tissue from patients with breast
cancer.
- Documentation: JacksonFischer2020BreastCancer.
- Publication: Jackson, Fischer et al. Nature (2020) 578:615–620
- Zanotelli2020Spheroids: 3D spheroids generated from different cell
lines.
- Documentation: Zanotelli2020Spheroids.
- Publication: Zanotelli et al. Mol Syst Biol (2020) 16:e9798.
- Documentation: Zanotelli2020Spheroids.
- IMMUcan2022CancerExample: Example data from the
IMMUcan project.
- Documentation: IMMUcan2022CancerExample.
- Documentation: IMMUcan2022CancerExample.
Viewing available datasets in R
In R, currently available datasets can be viewed with:
{r}
imc <- imcdatasets::listDatasets()
imc <- as.data.frame(imc)
imc
Detailed information about each dataset is available in the help pages
(e.g., ?JacksonFischer_2020_BreastCancer).
For more information, please refer to the
ExperimentHub vignette.
Contributing
Suggestions for new Imaging Mass Cytometry datasets to include in the
imcdatasets package are welcome and can be made by
opening an issue on GitHub.
Guidelines about contributions and dataset formatting are provided in a dedicated vignette.
Citation
Damond N, Eling N, Fischer J, Hoch T (2024). imcdatasets: Collection of publicly available imaging mass cytometry (IMC) datasets. R package version 1.11.1, https://github.com/BodenmillerGroup/imcdatasets.
Authors
- Nicolas Damond (author, maintainer)
- Nils Eling (contributor)
- Jana Fischer (contributor)
- Tobias Hoch (contributor)
References
Owner
- Name: BodenmillerGroup
- Login: BodenmillerGroup
- Kind: organization
- Repositories: 83
- Profile: https://github.com/BodenmillerGroup
GitHub Events
Total
- Issues event: 2
- Watch event: 2
- Issue comment event: 1
- Fork event: 1
Last Year
- Issues event: 2
- Watch event: 2
- Issue comment event: 1
- Fork event: 1
Issues and Pull Requests
Last synced: 7 months ago
All Time
- Total issues: 16
- Total pull requests: 12
- Average time to close issues: 6 months
- Average time to close pull requests: about 1 month
- Total issue authors: 8
- Total pull request authors: 3
- Average comments per issue: 1.25
- Average comments per pull request: 1.42
- Merged pull requests: 11
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 2
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 2
- Pull request authors: 0
- Average comments per issue: 0.0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- nilseling (4)
- ndamond (4)
- DarioS (3)
- Al3n70rn (1)
- Huimin721 (1)
- mjemons (1)
- mdeea (1)
- lassedochreden (1)
Pull Request Authors
- ndamond (8)
- nilseling (3)
- JanaFischer (1)