https://github.com/cnag-biomedical-informatics/beacon2-cbi-tools
Beacon v2 - CNAG Biomedical Informatics - Tools (Data ingestion tools)
https://github.com/cnag-biomedical-informatics/beacon2-cbi-tools
Science Score: 13.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
○codemeta.json file
-
○.zenodo.json file
-
✓DOI references
Found 2 DOI reference(s) in README -
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (8.6%) to scientific vocabulary
Keywords
23andme
beacon
beacon-friendly-format
beacon-v2
beacon2-reference-implementation
beacon2-ri
beacon2-ri-tools
bff
bff-portal
browser
cnag
data-exchange
json
microarray
mongodb
phenopackets-v2
pxf
snp
vcf
Last synced: 5 months ago
·
JSON representation
Repository
Beacon v2 - CNAG Biomedical Informatics - Tools (Data ingestion tools)
Basic Info
Statistics
- Stars: 3
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Fork of EGA-archive/beacon2-ri-tools
Topics
23andme
beacon
beacon-friendly-format
beacon-v2
beacon2-reference-implementation
beacon2-ri
beacon2-ri-tools
bff
bff-portal
browser
cnag
data-exchange
json
microarray
mongodb
phenopackets-v2
pxf
snp
vcf
Created over 2 years ago
· Last pushed 8 months ago
https://github.com/CNAG-Biomedical-Informatics/beacon2-cbi-tools/blob/main/
[](https://github.com/CNAG-Biomedical-Informatics/beacon2-cbi-tools/actions/workflows/docker-build.yml) [](https://github.com/cnag-biomedical-informatics/beacon2-cbi-tools/actions/workflows/documentation.yml)  [](https://www.gnu.org/licenses/gpl-3.0) [](https://hub.docker.com/r/manuelrueda/beacon2-cbi-tools/) [](https://hub.docker.com/r/manuelrueda/beacon2-ri-tools/) [](https://hub.docker.com/r/beacon2ri/beacon_reference_implementation/)  --- **New documentation:** https://cnag-biomedical-informatics.github.io/beacon2-cbi-tools **Docker Hub Image:** https://hub.docker.com/r/manuelrueda/beacon2-cbi-tools/tags **Legacy B2RI Documentation:** https://b2ri-documentation.readthedocs.io/ **Genetic Data Interpretation Disclaimer:** https://cnag-biomedical-informatics.github.io/beacon2-cbi-tools/about/disclaimer --- **Actively maintained by CNAG Biomedical Informatics** > **Note:** This repository was formerly known as **beacon2-ri-tools** (Beacon v2 Reference Implementation). It has been renamed to **beacon2-cbi-tools (CNAG Biomedical Informatics)** to better reflect its identity under CNAG. # Table of contents - [Description](#description) - [Overview](#overview) - [Tools included](#tools-included) - [System Diagram](#system-diagram) - [Roadmap](#roadmap) - [Installation](#installation) - [Containerized](#containerized-installation-recommended) - [Non-Containerized](#non-containerized-installation) - [Citation](#citation) - [Author](#author) - [License](#copyright-and-license) # DESCRIPTION ## Overview **beacon2-cbi-tools** is a suite of tools originally developed as part of the ELIXIRBeacon v2 Reference Implementation, now continuing under [CNAG](https://www.cnag.eu) Biomedical Informatics. It provides essential functionality around the Beacon Friendly Format (BFF) data exchange format, including: - **Validating XLSX/JSON** files against Beacon v2 schemas - **Converting VCF and microarray** files into BFF (genomicVariations) - **Loading BFF** data (metadata and genomic variations) **into MongoDB** This toolkit streamlines data preparation, validation, and ingestion for federated genomic and phenotypic data sharing under Beacon v2. The resulting BFF-formatted data **can be used with any implementation of the [Beacon v2 API specification](https://docs.genomebeacons.org/) that operates on MongoDB**. ## Tools Included ### [BFF-Tools script](https://github.com/CNAG-Biomedical-Informatics/beacon2-cbi-tools/tree/main/bin/README.md) (`bin/bff-tools`): A command-line tool for converting VCF data into BFF format and inserting the resulting BFF data into a MongoDB instance. The tool offers five modes: 1. **vcf**: Convert a VCF.gz file into BFF format. 2. **tsv**: Convert a SNP microarray file (e.g., from 23andme) into BFF format. 3. **load**: Load BFF-formatted data into a MongoDB instance. 4. **full**: Perform both TSV/VCF conversion and MongoDB loading. 5. **validate**: Validate XLSX or JSON metadata against Beacon v2 schemas and serialize into BFF. An [Excel template](https://github.com/CNAG-Biomedical-Informatics/beacon2-cbi-tools/blob/main/CINECA_synthetic_cohort_EUROPE_UK1/Beacon-v2-Models_CINECA_UK1.xlsx) is provided to help structure your metadata. ### [Utility Suite](https://github.com/CNAG-Biomedical-Informatics/beacon2-cbi-tools/tree/main/utils): A collection of support tools to aid in data ingestion. Key among them: - **[BFF-Browser](https://github.com/CNAG-Biomedical-Informatics/beacon2-cbi-tools/tree/main/utils/bff_browser)**: A web application for interactive visualization of BFF data, particularly `genomicVariations` and `individuals`. - **[BFF-Portal](https://github.com/CNAG-Biomedical-Informatics/beacon2-cbi-tools/tree/main/utils/bff_portal)**: A simple API and web application to query BFF data via MongoDB. ### [CINECA Synthetic Cohort - EUROPE_UK1](https://github.com/CNAG-Biomedical-Informatics/beacon2-cbi-tools/tree/main/CINECA_synthetic_cohort_EUROPE_UK1): A synthetic dataset for testing and demonstration purposes. ## System Diagram * Beacon v2 - CBI Tools * ___________ XLSX | | or | Metadata | (incl. Phenotypic data) JSON |__________| _________ | | | | | TSV | | bff-tools validate |______ | | | | Beacon v2 | bff-tools tsv | ____v____ ____v____ __________ ______ | | | | | | | | <---- Request | VCF | -----> | BFF | ---------> | Database | <----> | API | |_______| |_ _____| |__________| |_____| ----> Response | MongoDB bff-tools vcf | bff-tools load | | Optional (utils) | _____v_____ | | | utils/ | | bff- | | browser | Visualization | (beta) | |_________| -----------------------------------------------|||--------------------------- beacon2-cbi-tools e.g. beacon2-ri-api beacon2-pi-api java-beacon-v2.api ... ## Roadmap **Latest Update: May-2025** This repository has been widely adopted in Beacon v2 implementations and is also used internally at CNAG. As a result, we plan to continue its development. Some of our upcoming plans include: - **Implement Beacon 2.x specification changes** - For VCF: Adopt VRS nomenclature and transition away from LegacyVariation. Support for structural variants may be added. - For other entities: Align with the latest schema used in the BFF Validator and the Excel metadata template. - Update the **CINECA Synthetic Cohort** dataset. # INSTALLATION You can install **beacon2-cbi-tools** using one of two methods: ### Containerized Installation (Recommended) Follow the guide [here](docker/README.md) to use Docker for a streamlined setup. ### Non-Containerized Installation See [here](non-containerized/README.md) for manual installation instructions. # CITATION The author requests that any published work that utilizes these tools includes a citation to the following reference: Rueda, M, Ariosa R. "Beacon v2 Reference Implementation: a toolkit to enable federated sharing of genomic and phenotypic data". _Bioinformatics_, btac568, https://doi.org/10.1093/bioinformatics/btac568 # AUTHOR Written by Manuel Rueda, PhD. Info about CNAG Biomedical Informatics can be found at [https://www.cnag.eu](https://www.cnag.eu) # COPYRIGHT and LICENSE The software in this repository is copyrighted. See the LICENSE file included in this distribution.beacon2-cbi-tools
Owner
- Name: CNAG Biomedical Informatics
- Login: CNAG-Biomedical-Informatics
- Kind: organization
- Location: Spain
- Repositories: 1
- Profile: https://github.com/CNAG-Biomedical-Informatics
GitHub Events
Total
- Watch event: 1
- Delete event: 5
- Push event: 82
- Create event: 9
Last Year
- Watch event: 1
- Delete event: 5
- Push event: 82
- Create event: 9