glucosego
A machine-learning derived heatmap for predicting hypoglycemia risk during exercise for people with type 1 diabetes
Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (10.3%) to scientific vocabulary
Repository
A machine-learning derived heatmap for predicting hypoglycemia risk during exercise for people with type 1 diabetes
Basic Info
- Host: GitHub
- Owner: cafoala
- License: mit
- Language: Jupyter Notebook
- Default Branch: main
- Size: 4.69 MB
Statistics
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 1
Metadata Files
README.md
glucoseGo
A machine-learning derived heatmap for predicting hypoglycemia risk during exercise for people with type 1 diabetes
Overview
This repository contains a series of Jupyter Notebooks organized into three main folders: Preprocessing, Model Building, and Explainability. The project focuses on predicting hypoglycemia risk during exercise in individuals with Type 1 Diabetes using machine learning techniques.
Folder Structure
FOLDER 1: Preprocessing
Preprocessing of EXTOD 101 for ML: Prepares the EXTOD 101 data, consisting of 35 individuals with 8 weeks of data, for machine learning analysis.
Preprocessing EXTOD Education Dataset: Processes data from the EXTOD education pilot study, involving 106 participants, for machine learning.
Preprocessing JAEB T1-DEXI Dataset: Similar to the EXTOD education dataset, this notebook prepares the JAEB T1-DEXI data for analysis.
Preprocessing JAEB T1-DEXIP Dataset: Preprocesses data from the JAEB T1-DEXIP study, aligning with the overall project's methodology.
Target Creation: Develops target variables for machine learning and statistical analysis, focusing on hypo- and hyper-glycemia during and after exercise.
Final Preparation for Machine Learning: Finalizes data for model training and validation, ensuring quality and proper formatting.
Analysis of Participant Characteristics in Exercise Studies: Provides a comprehensive analysis of participant characteristics from various exercise studies.
FOLDER 2: Model Building
Forward Feature Selection: Implements a model-based approach to incrementally add features that improve model performance, specifically using ROC Area Under Curve.
Run ML Models: Executes Logistic Regression and XGBoost models on both full and reduced sets of features, comparing performance and visualizing results.
Creating Figures: Generates, visualizes, and analyzes results from various machine learning models through informative plots.
Heatmap (Contour Plot) of Model: Visualizes the model's predicted probability changes across starting glucose levels and exercise duration in a heatmap format.
Performance Evaluation of XGBoost Models on a Hold-Out Dataset: Assesses and compares the predictive performance of two XGBoost models on a 10% hold-out dataset.
FOLDER 3: Explainability
SHAP Analysis: Focuses on explaining machine learning models using SHAP values to understand the importance and impact of different features.
Subgroup Analysis of Two-Featured Model: Conducts a detailed subgroup analysis of a two-featured XGBoost model, exploring its performance across various patient profiles.
Calibration Curves: Assesses and visualizes calibration curves for binary classification models to evaluate their reliability.
Learning Curves: Generates and visualizes learning curves to understand the relationship between training set size and model performance.
General Information
- Each notebook contains a detailed explanation of its objectives, methodology, and the results obtained.
- The notebooks are designed to provide a comprehensive understanding of the process of developing and evaluating machine learning models for predicting hypoglycemia risk in Type 1 Diabetes patients.
Data Privacy Note
Due to privacy concerns and the sensitive nature of the medical data used in this project, the datasets are not publicly available in this repository. However, we may be able to share some anonymized data upon request. Please contact the project maintainers for more information.
Contributing
Feel free to contribute to this project by suggesting improvements, reporting bugs, or submitting pull requests. Please read CONTRIBUTING.md for guidelines on how to contribute. License
This project is licensed under the MIT License - see the LICENSE.md file for details.
Contact
For any queries or further information, please contact the project maintainers.
Owner
- Login: cafoala
- Kind: user
- Repositories: 2
- Profile: https://github.com/cafoala
Citation (CITATION.cff)
cff-version: 1.2.0 message: "If you use this software, please cite it as below." authors: - family-names: "Russon" given-names: "Catherine" orcid: "https://orcid.org/0000-0001-6785-6477" - family-names: "Allen" given-names: "Michael" orcid: "https://orcid.org/0000-0002-8746-9957" title: "glucoseGo" version: 1.0.0 doi: 10.5281/zenodo.1234 date-released: 2023-10-12 url: "https://github.com/cafoala/glucoseGo"
GitHub Events
Total
- Watch event: 1
Last Year
- Watch event: 1