https://github.com/cnag-biomedical-informatics/beacon2-cbi-tools

Beacon v2 - CNAG Biomedical Informatics - Tools (Data ingestion tools)

https://github.com/cnag-biomedical-informatics/beacon2-cbi-tools

Science Score: 13.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
    Found 2 DOI reference(s) in README
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (8.6%) to scientific vocabulary

Keywords

23andme beacon beacon-friendly-format beacon-v2 beacon2-reference-implementation beacon2-ri beacon2-ri-tools bff bff-portal browser cnag data-exchange json microarray mongodb phenopackets-v2 pxf snp vcf
Last synced: 5 months ago · JSON representation

Repository

Beacon v2 - CNAG Biomedical Informatics - Tools (Data ingestion tools)

Basic Info
  • Host: GitHub
  • Owner: CNAG-Biomedical-Informatics
  • License: gpl-3.0
  • Language: Perl
  • Default Branch: main
  • Homepage:
  • Size: 27.2 MB
Statistics
  • Stars: 3
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Fork of EGA-archive/beacon2-ri-tools
Topics
23andme beacon beacon-friendly-format beacon-v2 beacon2-reference-implementation beacon2-ri beacon2-ri-tools bff bff-portal browser cnag data-exchange json microarray mongodb phenopackets-v2 pxf snp vcf
Created over 2 years ago · Last pushed 8 months ago

https://github.com/CNAG-Biomedical-Informatics/beacon2-cbi-tools/blob/main/

beacon2-cbi-tools

beacon2-cbi-tools

[![Docker build](https://github.com/CNAG-Biomedical-Informatics/beacon2-cbi-tools/actions/workflows/docker-build.yml/badge.svg)](https://github.com/CNAG-Biomedical-Informatics/beacon2-cbi-tools/actions/workflows/docker-build.yml) [![Documentation Status](https://github.com/cnag-biomedical-informatics/beacon2-cbi-tools/actions/workflows/documentation.yml/badge.svg)](https://github.com/cnag-biomedical-informatics/beacon2-cbi-tools/actions/workflows/documentation.yml) ![Maintenance status](https://img.shields.io/badge/maintenance-actively--developed-brightgreen.svg) [![License: GPL v3](https://img.shields.io/badge/License-GPL%20v3-blue.svg)](https://www.gnu.org/licenses/gpl-3.0) [![Docker Pulls](https://badgen.net/docker/pulls/manuelrueda/beacon2-cbi-tools?icon=docker&label=beacon2-cbi-tools-pulls)](https://hub.docker.com/r/manuelrueda/beacon2-cbi-tools/) [![Docker Pulls](https://badgen.net/docker/pulls/manuelrueda/beacon2-ri-tools?icon=docker&label=deprecated-beacon2-ri-tools-pulls)](https://hub.docker.com/r/manuelrueda/beacon2-ri-tools/) [![Docker Pulls EGA-archive](https://badgen.net/docker/pulls/beacon2ri/beacon_reference_implementation?icon=docker&label=deprecated-EGA-archive-pulls)](https://hub.docker.com/r/beacon2ri/beacon_reference_implementation/) ![version](https://img.shields.io/badge/version-2.0.10-blue) --- **New documentation:** https://cnag-biomedical-informatics.github.io/beacon2-cbi-tools **Docker Hub Image:** https://hub.docker.com/r/manuelrueda/beacon2-cbi-tools/tags **Legacy B2RI Documentation:** https://b2ri-documentation.readthedocs.io/ **Genetic Data Interpretation Disclaimer:** https://cnag-biomedical-informatics.github.io/beacon2-cbi-tools/about/disclaimer --- **Actively maintained by CNAG Biomedical Informatics** > **Note:** This repository was formerly known as **beacon2-ri-tools** (Beacon v2 Reference Implementation). It has been renamed to **beacon2-cbi-tools (CNAG Biomedical Informatics)** to better reflect its identity under CNAG. # Table of contents - [Description](#description) - [Overview](#overview) - [Tools included](#tools-included) - [System Diagram](#system-diagram) - [Roadmap](#roadmap) - [Installation](#installation) - [Containerized](#containerized-installation-recommended) - [Non-Containerized](#non-containerized-installation) - [Citation](#citation) - [Author](#author) - [License](#copyright-and-license) # DESCRIPTION ## Overview **beacon2-cbi-tools** is a suite of tools originally developed as part of the ELIXIRBeacon v2 Reference Implementation, now continuing under [CNAG](https://www.cnag.eu) Biomedical Informatics. It provides essential functionality around the Beacon Friendly Format (BFF) data exchange format, including: - **Validating XLSX/JSON** files against Beacon v2 schemas - **Converting VCF and microarray** files into BFF (genomicVariations) - **Loading BFF** data (metadata and genomic variations) **into MongoDB** This toolkit streamlines data preparation, validation, and ingestion for federated genomic and phenotypic data sharing under Beacon v2. The resulting BFF-formatted data **can be used with any implementation of the [Beacon v2 API specification](https://docs.genomebeacons.org/) that operates on MongoDB**. ## Tools Included ### [BFF-Tools script](https://github.com/CNAG-Biomedical-Informatics/beacon2-cbi-tools/tree/main/bin/README.md) (`bin/bff-tools`): A command-line tool for converting VCF data into BFF format and inserting the resulting BFF data into a MongoDB instance. The tool offers five modes: 1. **vcf**: Convert a VCF.gz file into BFF format. 2. **tsv**: Convert a SNP microarray file (e.g., from 23andme) into BFF format. 3. **load**: Load BFF-formatted data into a MongoDB instance. 4. **full**: Perform both TSV/VCF conversion and MongoDB loading. 5. **validate**: Validate XLSX or JSON metadata against Beacon v2 schemas and serialize into BFF. An [Excel template](https://github.com/CNAG-Biomedical-Informatics/beacon2-cbi-tools/blob/main/CINECA_synthetic_cohort_EUROPE_UK1/Beacon-v2-Models_CINECA_UK1.xlsx) is provided to help structure your metadata. ### [Utility Suite](https://github.com/CNAG-Biomedical-Informatics/beacon2-cbi-tools/tree/main/utils): A collection of support tools to aid in data ingestion. Key among them: - **[BFF-Browser](https://github.com/CNAG-Biomedical-Informatics/beacon2-cbi-tools/tree/main/utils/bff_browser)**: A web application for interactive visualization of BFF data, particularly `genomicVariations` and `individuals`. - **[BFF-Portal](https://github.com/CNAG-Biomedical-Informatics/beacon2-cbi-tools/tree/main/utils/bff_portal)**: A simple API and web application to query BFF data via MongoDB. ### [CINECA Synthetic Cohort - EUROPE_UK1](https://github.com/CNAG-Biomedical-Informatics/beacon2-cbi-tools/tree/main/CINECA_synthetic_cohort_EUROPE_UK1): A synthetic dataset for testing and demonstration purposes. ## System Diagram * Beacon v2 - CBI Tools * ___________ XLSX | | or | Metadata | (incl. Phenotypic data) JSON |__________| _________ | | | | | TSV | | bff-tools validate |______ | | | | Beacon v2 | bff-tools tsv | ____v____ ____v____ __________ ______ | | | | | | | | <---- Request | VCF | -----> | BFF | ---------> | Database | <----> | API | |_______| |_ _____| |__________| |_____| ----> Response | MongoDB bff-tools vcf | bff-tools load | | Optional (utils) | _____v_____ | | | utils/ | | bff- | | browser | Visualization | (beta) | |_________| -----------------------------------------------|||--------------------------- beacon2-cbi-tools e.g. beacon2-ri-api beacon2-pi-api java-beacon-v2.api ... ## Roadmap **Latest Update: May-2025** This repository has been widely adopted in Beacon v2 implementations and is also used internally at CNAG. As a result, we plan to continue its development. Some of our upcoming plans include: - **Implement Beacon 2.x specification changes** - For VCF: Adopt VRS nomenclature and transition away from LegacyVariation. Support for structural variants may be added. - For other entities: Align with the latest schema used in the BFF Validator and the Excel metadata template. - Update the **CINECA Synthetic Cohort** dataset. # INSTALLATION You can install **beacon2-cbi-tools** using one of two methods: ### Containerized Installation (Recommended) Follow the guide [here](docker/README.md) to use Docker for a streamlined setup. ### Non-Containerized Installation See [here](non-containerized/README.md) for manual installation instructions. # CITATION The author requests that any published work that utilizes these tools includes a citation to the following reference: Rueda, M, Ariosa R. "Beacon v2 Reference Implementation: a toolkit to enable federated sharing of genomic and phenotypic data". _Bioinformatics_, btac568, https://doi.org/10.1093/bioinformatics/btac568 # AUTHOR Written by Manuel Rueda, PhD. Info about CNAG Biomedical Informatics can be found at [https://www.cnag.eu](https://www.cnag.eu) # COPYRIGHT and LICENSE The software in this repository is copyrighted. See the LICENSE file included in this distribution.

Owner

  • Name: CNAG Biomedical Informatics
  • Login: CNAG-Biomedical-Informatics
  • Kind: organization
  • Location: Spain

GitHub Events

Total
  • Watch event: 1
  • Delete event: 5
  • Push event: 82
  • Create event: 9
Last Year
  • Watch event: 1
  • Delete event: 5
  • Push event: 82
  • Create event: 9