HuBMAPR
R client to discover and retrieve data on HuBMAP Data Portal
Science Score: 49.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 2 DOI reference(s) in README -
✓Academic publication links
Links to: biorxiv.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (11.3%) to scientific vocabulary
Keywords
Repository
R client to discover and retrieve data on HuBMAP Data Portal
Basic Info
- Host: GitHub
- Owner: christinehou11
- Language: HTML
- Default Branch: main
- Homepage: https://christinehou11.github.io/HuBMAPR/
- Size: 5.1 MB
Statistics
- Stars: 3
- Watchers: 4
- Forks: 1
- Open Issues: 0
- Releases: 0
Topics
Metadata Files
README.md
HuBMAPR
'HuBMAP' data portal (https://portal.hubmapconsortium.org/) provides
an open, global bio-molecular atlas of the human body at the cellular
level. HuBMAPR package provides an alternative interface to access the
data exploration and file retrieval via R.
The HuBMAP Consortium offers several
APIs.
To achieve the main objectives, HuBMAPR package specifically integrates three
APIs:
Search API: The Search API is primarily searching relevant data information and is referenced to the Elasticsearch API.
Entity API: The Entity API is specifically utilized in the
bulk_data_transfer()function for Globus URL retrievalOntology API: The Ontology API is applied in the
organ()function to provide additional information about the abbreviation and corresponding full name of each organ.
Each API serves a distinct purpose with unique query capabilities, tailored to
meet various needs. Utilizing the httr2 and rjsoncons packages, HuBMAPR
effectively manages, modifies, and executes multiple requests via these APIs,
presenting responses in formats such as tibble or character. These outputs are
further modified for clarity in the final results from the HuBMAPR functions,
and these functions help reflect the data information of HuBMAP Data Portal as
much as possible.
HuBMAP Data incorporates three different identifiers:
HuBMAP ID, e.g. HBM399.VCTL.353
Universally Unique Identifier (UUID), e.g. 7036a70229eff1a51af965454dddbe7d
Digital Object Identifiers (DOI), e.g. 10.35079/HBM399.VCTL.353.
The HuBMAPR
package utilizes the UUID - a 32-digit hexadecimal number - and the more
human-readable HuBMAP ID as two common identifiers in the retrieved results.
Considering precision and compatibility with software implementation and data
storage, UUID serves as the primary identifier to retrieve data across various
functions, with the UUID mapping uniquely to its corresponding HuBMAP ID.
The systematic nomenclature is adopted for functions in the package by appending the entity category prefix to the concise description of the specific functionality. Most of the functions are grouped by entity categories, thereby simplifying the process of selecting the appropriate functions to retrieve the desired information associated with the given UUID from the specific entity category. The structure of these functions is heavily consistent across all entity categories with some exceptions for collection and publication.
Installation
HuBMAPR is a R package available in Bioconductor version $\geq$ 3.20 and
R version $\geq$ 4.4.0. You can install HuBMAPR by using the following
commands in R session from Bioconductor:
``` r if (!require("BiocManager", quietly = TRUE)) install.packages("BiocManager")
BiocManager::install("HuBMAPR")
Check Bioconductor installation
BiocManager::valid() ```
Additionally, you can install development version from GitHub:
r
BiocManager::install_github("christinehou11/HuBMAPR")
Use
$\textbf{Entity Category:}$
Dataset
Sample
Donor
Collection
Publication
$\textbf{Available records for [Entity Category]:}$
datasets()
samples()
donors()
collections()
publications()
$\textbf{The default columns from Entity Category]:}$
datasetsdefaultcolumns(as = c(“tibble”, “character”))
samples defaultcolumns(as = c(“tibble”, “character”)
donors defaultcolumns(as = c(“tibble”, “character”)
collections defaultcolumns(as = c(“tibble”, “character”)
publications defaultcolumns(as = c(“tibble”, “character”)
$\textbf{Single Record Information for [Entity Category] record:}$
Dataset
- datasetdetail(datasetuuid)
- datasetderived(datasetuuid)
- datasetmetadata(datasetuuid)
- datasetcontributors(datasetuuid)
Sample
- sampledetail(sampleuuid)
- samplederived(sampleuuid, entity_type = c("Dataset", "Sample"))
- samplemetadata(sampleuuid)
Donor
- donordetail(donoruuid)
- donor derived(donoruuid, entity_type = c("Dataset", "Sample"))
- donor metadata(donoruuid)
Collection
- collectiondetail(collectionuuid)
- collectiondata(collectionuuid)
- collectioncontributors(collectionuuid)
- collectioncontacts(collectionuuid)
- collectioninformation(collectionuuid)
Publication
- publicationdetail(publicationuuid)
- publication data(publicationuuid)
- publicationauthors(publicationuuid)
- publicationinformation(publicationuuid)
$\textbf{Provenance of a dataset/sample/donor}$:
- uuid_provenance(dataset/sample/donor uuid)
$\textbf{Additional information about organ abbreviation and its full name:}$
- organ()
$\textbf{Retrieve data files from one dataset single record:}$
- bulkdatatransfer(dataset_uuid)
View the article Explore Human BioMelecular Atlas Program Data Portal to read detailed examples.
Preprint
Pre-print: https://www.biorxiv.org/content/10.1101/2024.09.26.615227v1
Owner
- Login: christinehou11
- Kind: user
- Repositories: 1
- Profile: https://github.com/christinehou11
GitHub Events
Total
- Watch event: 3
- Delete event: 2
- Push event: 43
- Fork event: 1
- Create event: 5
Last Year
- Watch event: 3
- Delete event: 2
- Push event: 43
- Fork event: 1
- Create event: 5
Issues and Pull Requests
Last synced: 6 months ago
All Time
- Total issues: 9
- Total pull requests: 5
- Average time to close issues: about 10 hours
- Average time to close pull requests: about 5 hours
- Total issue authors: 3
- Total pull request authors: 2
- Average comments per issue: 1.56
- Average comments per pull request: 0.4
- Merged pull requests: 3
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 9
- Pull requests: 5
- Average time to close issues: about 10 hours
- Average time to close pull requests: about 5 hours
- Issue authors: 3
- Pull request authors: 2
- Average comments per issue: 1.56
- Average comments per pull request: 0.4
- Merged pull requests: 3
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- mtmorgan (5)
- shazanfar (2)
- stephaniehicks (2)
Pull Request Authors
- mtmorgan (3)
- federicomarini (2)
Top Labels
Issue Labels
Pull Request Labels
Packages
- Total packages: 1
-
Total downloads:
- bioconductor 2,603 total
- Total dependent packages: 0
- Total dependent repositories: 0
- Total versions: 6
- Total maintainers: 1
bioconductor.org: HuBMAPR
Interface to 'HuBMAP'
- Homepage: https://christinehou11.github.io/HuBMAPR/ https://github.com/christinehou11/HuBMAPR
- Documentation: https://bioconductor.org/packages/release/bioc/vignettes/HuBMAPR/inst/doc/HuBMAPR.pdf
- License: Artistic-2.0
-
Latest release: 1.2.2
published 7 months ago
Rankings
Maintainers (1)
Dependencies
- JamesIves/github-pages-deploy-action v4.5.0 composite
- actions/checkout v4 composite
- r-lib/actions/setup-pandoc v2 composite
- r-lib/actions/setup-r v2 composite
- r-lib/actions/setup-r-dependencies v2 composite
- dplyr * imports
- httr2 * imports
- purrr * imports
- rjsoncons * imports
- rlang * imports
- stringr * imports
- tibble * imports
- tidyr * imports
- utils * imports
- whisker * imports
- BiocStyle * suggests
- knitr * suggests
- rmarkdown * suggests
- testthat >= 3.0.0 suggests
- JamesIves/github-pages-deploy-action releases/v4 composite
- actions/cache v3 composite
- actions/checkout v3 composite
- actions/upload-artifact master composite
- docker/build-push-action v4 composite
- docker/login-action v2 composite
- docker/setup-buildx-action v2 composite
- docker/setup-qemu-action v2 composite
- r-lib/actions/setup-pandoc v2 composite
- r-lib/actions/setup-r v2 composite