CytoML

A GatingML Interface for Cross Platform Cytometry Data Sharing

https://github.com/rglab/cytoml

Science Score: 33.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
    Found 1 DOI reference(s) in README
  • Academic publication links
    Links to: ncbi.nlm.nih.gov
  • Committers with academic emails
    1 of 21 committers (4.8%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (11.5%) to scientific vocabulary

Keywords from Contributors

bioconductor-packages genomics gene bioconductor immunology rna-seq single-cell bioinformatics labkey immunespace
Last synced: 10 months ago · JSON representation

Repository

A GatingML Interface for Cross Platform Cytometry Data Sharing

Basic Info
  • Host: GitHub
  • Owner: RGLab
  • License: agpl-3.0
  • Language: HTML
  • Default Branch: devel
  • Homepage:
  • Size: 190 MB
Statistics
  • Stars: 31
  • Watchers: 8
  • Forks: 15
  • Open Issues: 55
  • Releases: 4
Created about 10 years ago · Last pushed over 1 year ago
Metadata Files
Readme License

README.Rmd

---
output: github_document
editor_options: 
  
  chunk_output_type: inline
---

# CytoML: Cross-Platform Cytometry Data Sharing.

This package is designed to import/export the hierarchical gated cytometry data to and from R (specifically the [openCyto](https://github.com/RGLab/openCyto) framework) using the [`gatingML2.0`](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4874733/) and [`FCS3.0`](http://isac-net.org/Resources/Standards/FCS3-1.aspx) cytometry data standards. This package makes use of the `GatingSet` R object and data model so that imported data can easily be manipulated and visualized in R using tools like [openCyto](https://github.com/RGLab/openCyto) and [ggCyto](https://github.com/RGLab/ggcyto).


## What problems does CytoML solve?

CytoML allows you to:

- Import manually gated data into R from [Diva](http://www.bdbiosciences.com/us/instruments/clinical/software/flow-cytometry-acquisition/bd-facsdiva-software/m/333333/overview), [FlowJo](https://www.flowjo.com/) and [Cytobank](https://cytobank.org/).
- Combine manual gating strategies with automated gating strategies in R.
- Export data gated manually, auto-gated, or gated using a combination of manual and automated strategies from R to [Diva](http://www.bdbiosciences.com/us/instruments/clinical/software/flow-cytometry-acquisition/bd-facsdiva-software/m/333333/overview), [FlowJo](https://www.flowjo.com/) and [Cytobank](https://cytobank.org/).
- Share computational flow analyses with users on other platforms.
- Perform comparative analyses between computational and manual gating approaches.


## INSTALLATION

CytoML can be installed in several ways:

### For all versions:

For all versions, you must have dependencies installed

```r
library(BiocManager)
# This should pull all dependencies.
BiocManager::install("openCyto") 

# Then install latest dependencies from github, using devtools.
install.packages("devtools") 
library(devtools) #load it

install_github("RGLab/flowWorkspace")
install_github("RGLab/openCyto")
```

### Installing from [BioConductor](https://www.bioconductor.org).

- [Current BioConductor Relase](https://doi.org/doi:10.18129/B9.bioc.CytoML)

```r
library(BiocManager)
#this should pull all dependencies.
BiocManager::install("CytoML", version = "devel") 
```

- [Current BioConductor Development Version](http://bioconductor.org/packages/devel/bioc/html/CytoML.html)

```r
library(BiocManager)
#this should pull all dependencies.
BiocManager::install("CytoML", version = "devel") 
```

### Installing from GitHub

- [Latest GitHub Version](https://github.com/RGLab/CytoML)

```r
install.packges("devtools")
devtools::install_github("RGLab/CytoML")
```

- [Latest GitHub Release](https://github.com/RGLab/CytoML/releases)

```r
install.packges("devtools")
devtools::install_github("RGLab/CytoML@*release")
```
## Reproducible examples from the CytoML paper

- A reproducible workflow can be found at the [RGLab site](http://www.rglab.org/CytoML), and was prepared with version 1.7.10 of CytoML, R v3.5.0, and dependencies that can be installed by:

```r
# We recomend using R version 3.5.0
devtools::install_github("RGLab/RProtoBufLib@v1.3.7")
devtools::install_github("RGLab/cytolib@v1.3.2")
devtools::install_github("RGLab/flowCore@v1.47.7")
devtools::install_github("RGLab/flowWorkspace@v3.29.7")
devtools::install_github("RGLab/openCyto@v1.19.2")
devtools::install_github("RGLab/CytoML@v1.7.10")
devtools::install_github("RGLab/ggcyto@v1.9.12")
```

## Examples

### Import data

To import data you need the xml workspace and the raw FCS files.

#### Import `gatingML` generated from [Cytobank](https://cytobank.org/).


```{r, message = FALSE, warning = FALSE, error = FALSE}
library(CytoML)
acsfile <- system.file("extdata/cytobank_experiment.acs", package = "CytoML")
ce <- open_cytobank_experiment(acsfile)
xmlfile <- ce$gatingML
fcsFiles <- list.files(ce$fcsdir, full.names = TRUE)
gs <- cytobank_to_gatingset(xmlfile, fcsFiles)
```

#### Import a [Diva](http://www.bdbiosciences.com/us/instruments/clinical/software/flow-cytometry-acquisition/bd-facsdiva-software/m/333333/overview) workspace.

```{r, message = FALSE, warning = FALSE, error = FALSE}
ws <- open_diva_xml(system.file('extdata/diva/PE_2.xml', package = "flowWorkspaceData"))
# The path to the FCS files is stored in ws@path.
# It can also be passed in to parseWorksapce via the `path` argument.
gs <- diva_to_gatingset(ws, name = 2, subset = 1, swap_cols = FALSE)
```


#### Interact with the gated data (`GatingSet`)

We need `flowWorkspace` to interact with the imported data.

```{r, message = FALSE, warning = FALSE, error = FALSE}
library(flowWorkspace)
```

We can visualize the gating tree as follows:

```{r}
#get the first sample
gh <- gs[[1]]

#plot the hierarchy tree
plot(gh)
```

For more information see the [flowWorkspace](http://www.github.com/RGLab/flowWorkspace) package.

We can print all the cell populations defined in the gating tree.

```{r}
#show all the cell populations(/nodes)
gs_get_pop_paths(gh)
```

We can extract the cell population statistics. 

```{r}
#show the population statistics
gh_pop_compare_stats(gh)
```

The `openCyto.count` column shows the cell counts computed via the import.
The `xml.count` column shows the cell counts computed by FlowJo (note not all platforms report cell counts in the workspace). It is normal for these to differ by a few cells due to numerical differences in the implementation of data transformations. CytoML and openCyto are *reproducing* the data analysis from the raw data based on the information in the workspace.

We can plot all the gates defined in the workspace.

```{r}
#plot the gates
plotGate(gh) 
```

#### Access information about cells in a specific population.

Because CytoML and flowWorkspace reproduce the entire analysis in a workspace in R, we have access to information about which cells are part of which cell populations.

flowWorkspace has convenience methods to extract the cells from specific cell populations:

```{r}
gh_pop_get_data(gh,"P3")
```
This returns a `flowFrame` with the cells in gate P3 (70% of the cells according to the plot).

The matrix of expression can be extracted from a `flowFrame` using the `exprs()` method from the `flowCore` package:

```{r}
library(flowCore)
e <- exprs(gh_pop_get_data(gh,"P3"))
class(e)
dim(e)
colnames(e)
#compute the MFI of the fluorescence channels.
colMeans(e[,8:15])
```

### Export gated data to other platforms.

In order to export gated data, it must be in `GatingSet` format.

#### Export a `GatingSet` from R to [Cytobank](https://cytobank.org/) or [FlowJo](https://www.flowjo.com/)

Load something to export.

```{r}
dataDir <- system.file("extdata",package="flowWorkspaceData")
gs <- load_gs(list.files(dataDir, pattern = "gs_manual",full = TRUE))
```

##### Export to Cytobank

```{r}
#Cytobank
outFile <- tempfile(fileext = ".xml")
gatingset_to_cytobank(gs, outFile)
```

##### Export to FlowJo

```{r}
#flowJo
outFile <- tempfile(fileext = ".wsp")
gatingset_to_flowjo(gs, outFile)
```

## Next Steps

See the [flowWorskspace](http://www.github.com/RGLab/flowWorkspace) and [openCyto](http://www.github.com/RGLab/openCyto] packages to learn more about what can be done with `GatingSet` objects.

## Code of conduct 

Please note that this project is released with a [Contributor Code of Conduct](CODE_OF_CONDUCT.md).
By participating in this project you agree to abide by its terms.

Owner

  • Name: RGLab
  • Login: RGLab
  • Kind: organization
  • Location: Seattle

Raphael Gottardo's Research Lab

GitHub Events

Total
  • Issues event: 9
  • Watch event: 3
  • Issue comment event: 6
  • Push event: 3
  • Pull request event: 4
  • Fork event: 1
  • Create event: 2
Last Year
  • Issues event: 9
  • Watch event: 3
  • Issue comment event: 6
  • Push event: 3
  • Pull request event: 4
  • Fork event: 1
  • Create event: 2

Committers

Last synced: about 2 years ago

All Time
  • Total Commits: 775
  • Total Committers: 21
  • Avg Commits per committer: 36.905
  • Development Distribution Score (DDS): 0.33
Past Year
  • Commits: 6
  • Committers: 2
  • Avg Commits per committer: 3.0
  • Development Distribution Score (DDS): 0.333
Top Committers
Name Email Commits
mikejiang w****2@f****g 519
Jacob Wagner j****r@f****g 132
mikejiang m****e@o****i 37
Nitesh Turaga n****a@g****m 26
gfinak g****k@g****m 14
whitews w****s@g****m 6
vobencha v****a@g****m 4
vobencha v****n@r****g 4
hpages@fhcrc.org h****s@f****g@b****8 4
J Wokaty j****y@s****u 4
Hervé Pagès h****s@f****g 4
Herve Pages h****s@f****g 4
Greg Finak g****g@o****i 3
Darío Hereñú m****a@g****m 2
Eric Leung e****c@e****m 2
J Wokaty j****y 2
Greg Finak g****k@f****g 2
Kayla-Morrell k****l@r****g 2
Mike Jiang w****2@f****g 2
Martin Morgan m****n@f****g 1
Raphael Gottardo r****h@r****g 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 10 months ago

All Time
  • Total issues: 7
  • Total pull requests: 2
  • Average time to close issues: 9 days
  • Average time to close pull requests: about 24 hours
  • Total issue authors: 6
  • Total pull request authors: 1
  • Average comments per issue: 0.43
  • Average comments per pull request: 0.0
  • Merged pull requests: 2
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 7
  • Pull requests: 2
  • Average time to close issues: 9 days
  • Average time to close pull requests: about 24 hours
  • Issue authors: 6
  • Pull request authors: 1
  • Average comments per issue: 0.43
  • Average comments per pull request: 0.0
  • Merged pull requests: 2
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • rwbaer (2)
  • Biomiha (1)
  • ztflaten (1)
  • malcook (1)
  • david-priest (1)
  • Bmacs (1)
  • Close-your-eyes (1)
  • klai001 (1)
Pull Request Authors
  • mikejiang (2)
  • alefrol638 (1)
Top Labels
Issue Labels
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • bioconductor 108,764 total
  • Total dependent packages: 3
  • Total dependent repositories: 0
  • Total versions: 7
  • Total maintainers: 1
bioconductor.org: CytoML

A GatingML Interface for Cross Platform Cytometry Data Sharing

  • Versions: 7
  • Dependent Packages: 3
  • Dependent Repositories: 0
  • Downloads: 108,764 Total
Rankings
Dependent repos count: 0.0%
Dependent packages count: 0.0%
Average: 4.0%
Downloads: 11.9%
Maintainers (1)
Last synced: 10 months ago

Dependencies

DESCRIPTION cran
  • Biobase * imports
  • RBGL * imports
  • RUnit * imports
  • RcppParallel * imports
  • Rgraphviz * imports
  • XML * imports
  • base64enc * imports
  • corpcor * imports
  • cytolib >= 2.3.10 imports
  • data.table * imports
  • dplyr * imports
  • flowCore >= 1.99.10 imports
  • flowWorkspace >= 4.1.8 imports
  • ggcyto >= 1.11.4 imports
  • grDevices * imports
  • graph * imports
  • graphics * imports
  • jsonlite * imports
  • lattice * imports
  • methods * imports
  • openCyto >= 1.99.2 imports
  • plyr * imports
  • stats * imports
  • tibble * imports
  • utils * imports
  • xml2 * imports
  • yaml * imports
  • flowWorkspaceData * suggests
  • knitr * suggests
  • parallel * suggests
  • rmarkdown * suggests
  • testthat * suggests