quantum_labelling_tool

Data quality, maturity and utility labelling tool for the EHDS (HealthData@EU)

https://github.com/quantum-label/quantum_labelling_tool

Science Score: 67.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 3 DOI reference(s) in README
  • Academic publication links
    Links to: zenodo.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (13.8%) to scientific vocabulary

Keywords

data-quality-assessment data-quality-framework data-quality-measurement data-quality-report dataset tools
Last synced: 6 months ago · JSON representation ·

Repository

Data quality, maturity and utility labelling tool for the EHDS (HealthData@EU)

Basic Info
  • Host: GitHub
  • Owner: quantum-label
  • License: other
  • Language: Python
  • Default Branch: main
  • Homepage: https://quantumproject.eu/
  • Size: 1.86 MB
Statistics
  • Stars: 6
  • Watchers: 6
  • Forks: 2
  • Open Issues: 9
  • Releases: 3
Topics
data-quality-assessment data-quality-framework data-quality-measurement data-quality-report dataset tools
Created over 1 year ago · Last pushed 8 months ago
Metadata Files
Readme Citation Zenodo

README.md

QUANTUM - The Health Data Quality Label

Python Version

Cite the code: DOI

Introduction

The QUANTUM Data Quality Labelling Tool is a key component of the European Health Data Space (EHDS) initiative, designed to address the challenge of ensuring that health datasets are of high quality, accessible, and interoperable across EU member states. As healthcare data plays an increasingly vital role in research, policy, and innovation, this tool provides a standardized mechanism to evaluate and label the quality, utility, and maturity of datasets, supporting stakeholders such as healthcare institutions, research organizations, and policymakers.

Features

This tool empowers data holders by offering functionalities to:

  • Feature 1: Catalogue and Dataset metadata creation
  • Feature 2: Assess datasets quality based on a guided evaluation of the QUANTUM key dimensions (such as accuracy, accessibility, and compliance) using your own practices.
  • Feature 3: Generate and visualise data quality certificates (labels) and download them in the HealthData@EU standardized RDF format.
  • Feature 4: Facilitate compliance with EU-wide quality standards such as HealthDCAT-AP, DCAT, DQV.
  • Feature 5: Provide a transparent and accessible method for assessing data holders maturity.

Ultimately, the QUANTUM tool aims to foster trust in health data across Europe by promoting reliable, reusable datasets that can be confidently shared within the European Health Data Space.

System Requirements

Minimum Hardware Requirements

  • CPU: 1 core
  • RAM: 1 GB
  • Storage: 5 GB ### Recommended Hardware Requirements
  • CPU: 2 to 4 cores
  • RAM: 2 to 4 GB
  • Storage: 10 to 20 GB ### Software Requirements
  • Ubuntu 20.04.6 LTS or similar
  • Python 3.11
  • Docker 24.0.2 (if used)

Installation steps

bash git clone [repository_url] cd [repository_directory] pip install -r requirements.txt python manage.py migrate # if database is not generated previously python manage.py createsuperuser # for creating root user if desired (current root user is root for username and password)

Execution

bash python manage.py runserver 0.0.0.0:8000

  • Access localhost:8000 through web browser
  • Access localhost:8000/admin to visit the Admin dashboard (login is root for user and password by default)
  • To register a user it is needed to 1) create the user, 2) create an organization, 3) relate a user with an organization (userorganization)

Dockerizing

  • Download the repository. When unzipped, the folder will be named "quantumlabellingtool"
  • For Dockerizing create a folder called "QUANTUM" where needed
  • Inside QUANTUM create the folder "onlinequantumtool", which will contain the django web app.
  • From "quantumlabellingtool" copy the following files and folders to the "QUANTUM/onlinequantumtool" folder:
    • code
    • quantum
    • static
    • staticfiles
    • templates
    • webapp
    • manage.py
  • Copy the content of "quantumlabellingtool/docker" inside "QUANTUM" (keep in mind that it contains an .env file, which may be hidden)
  • If the web app is meant to be executed under a sub path (e.g. myurl.com/tool-subpath/) it is required to change on "quantum/settings.py" the "SUBPATHNAME" to the desired subpath (e.g. /tool-subpath)
  • In the "QUANTUM" folder execute by bash: bash docker build -t quantum_online_tool .
  • When the image is built execute the docker-compose command inside the "QUANTUM" folder: bash docker-compose up
  • Access the web app container through the following command: docker exec -it quantumtoolwebapp bash
  • Fill the database with the initial information: python manage.py migrate
  • Create a superuser to play with (root): python manage.py createsuperuser

  • Exit from the container exit

  • Access the database container: docker exec -it quantumtooldatabase bash

  • Execute the following commands to add information to the database mysql -u root -p use quantum; source /docker-entrypoint-initdb.d/init.sql;

  • See "Usage" sections

Technology Stack

Backend

  • Python 3
  • Django ### Frontend
  • HTML
  • CSS*
  • JavaScript*

*It is used Bootstrap 5.3

Database

Development

  • SQLite3 #### Production (Docker)
  • MariaDB

Licence

This work is licensed under the Creative Commons Attribution-NonCommercial 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc/4.0/.

CC BY-NC 4.0

Credits

Version: 0.1

Authors: QUANTUM WP2: Claudio Proietti Mercuri (Sciensano), Ángel Sánchez-García (UPV), Nienke Schutte (Sciensano), Carlos Sáez (UPV); on behalf of QUANTUM.

Maintainers: Claudio Proietti Mercuri (Sciensano), Ángel Sánchez-García (UPV), Francisco Estupiñan Romero (IACS).

Acknowledgements: QUANTUM WP1, WP2, WP3 partners and all participants in the tool piloting for their feedback.

Funded by EU QUANTUM project.

Owner

  • Name: QUANTUM project
  • Login: quantum-label
  • Kind: user
  • Location: Spain

The health data quality label

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below:"
authors:
- family-names: "Proietti Mercuri"
  given-names: "Claudio"
  orcid: "https://orcid.org/0009-0000-4373-0095"
- family-names: "Sánchez-García"
  given-names: "Ángel"
  orcid: "https://orcid.org/0000-0003-1843-9209"
- family-names: "Schutte"
  given-names: "Nienke"
  orcid: "https://orcid.org/0000-0002-8064-2569"
- family-names: "Sáez"
  given-names: "Carlos"
  orcid: "https://orcid.org/0000-0003-2678-8249"
title: "QUANTUM - The Health Data Quality Label"
version: "0.2.0"
doi: "https://zenodo.org/records/15168150"
date-released: "04/04/2025"
url: "https://github.com/quantum-label/quantum_labelling_tool/"
license: "https://creativecommons.org/licenses/by-nc/4.0/legalcode.en"

GitHub Events

Total
  • Create event: 49
  • Issues event: 81
  • Release event: 4
  • Watch event: 7
  • Delete event: 26
  • Issue comment event: 18
  • Member event: 5
  • Push event: 103
  • Pull request review event: 49
  • Pull request event: 105
  • Fork event: 2
Last Year
  • Create event: 49
  • Issues event: 81
  • Release event: 4
  • Watch event: 7
  • Delete event: 26
  • Issue comment event: 18
  • Member event: 5
  • Push event: 103
  • Pull request review event: 49
  • Pull request event: 105
  • Fork event: 2

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 47
  • Total pull requests: 48
  • Average time to close issues: 6 days
  • Average time to close pull requests: 2 days
  • Total issue authors: 2
  • Total pull request authors: 2
  • Average comments per issue: 0.21
  • Average comments per pull request: 0.08
  • Merged pull requests: 40
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 47
  • Pull requests: 48
  • Average time to close issues: 6 days
  • Average time to close pull requests: 2 days
  • Issue authors: 2
  • Pull request authors: 2
  • Average comments per issue: 0.21
  • Average comments per pull request: 0.08
  • Merged pull requests: 40
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • surfer8137 (30)
  • Cloddino95 (17)
Pull Request Authors
  • surfer8137 (32)
  • Cloddino95 (19)
Top Labels
Issue Labels
enhancement (5) bug (2) documentation (1)
Pull Request Labels
enhancement (3)