deep-block
Science Score: 49.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 1 DOI reference(s) in README -
✓Academic publication links
Links to: medrxiv.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (8.2%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: taehojo
- Language: Jupyter Notebook
- Default Branch: master
- Size: 10.8 MB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
Deep-Block
Genetic Variant Analysis for Alzheimer's Disease using LD (Beta version for preliminary experiments)
Overview
Deep-Block is a software tool designed to analyze whole-genome sequencing (WGS) data for identifying genetic variants associated with Alzheimer's disease (AD). It integrates advanced AI-driven genomics techniques with transformer models to capture long-range dependencies in genomic data, providing insights into the genetic basis of AD.
Key Features
Transformer-Based Algorithms: Deep-Block utilizes transformer-based algorithms specifically optimized for analyzing WGS data. These algorithms employ a novel tokenization strategy based on Linkage Disequilibrium (LD) blocks to effectively capture the genetic architecture.
Identification of Genetic Variants: By considering the natural correlations between Single Nucleotide Polymorphisms (SNPs) within LD blocks, Deep-Block can accurately identify genetic variants linked to AD. The self-attention mechanism of the transformer model helps uncover relationships between distant genomic regions, potentially discovering novel genomic loci associated with AD.
Handling Missing Data: Deep-Block incorporates various machine learning-based imputation methods to address missing data in genomic research. These methods, including Simple Imputer, GAN Imputer, K-NN Imputers, Iterative Imputer, and MissForest Imputer, demonstrate stable and statistically robust performance, enhancing the accuracy of the analysis.
Results
Deep-Block successfully identified key LD blocks and SNPs associated with Alzheimer's disease (AD):
Figure: Results of Deep-Block
Results of Deep-Block in identifying key LD blocks and SNPs associated with AD are summarized as follows:
- Training Graph: The transformer's training graph illustrates the optimization process during model training.

- Importance of LD Blocks: LD blocks play a crucial role in capturing the genetic architecture of AD. This visualization highlights the significance of LD blocks in the analysis.

- Identification of Significant SNPs: Deep-Block identifies significant SNPs within LD blocks. Notably, rs429358 emerged as the most important SNP, confirming known results and revealing new insights.

Department of Radiology & Imaging Sciences
Indiana Alzheimer's Disease Research Center
Center for Neuroimaging
Indiana University School of Medicine
Owner
- Name: Taeho Jo
- Login: taehojo
- Kind: user
- Location: Indiana, USA
- Company: Indiana University School of Medicine
- Website: https://taehojo.github.io
- Repositories: 4
- Profile: https://github.com/taehojo
Computational Biologist, Ph.D
GitHub Events
Total
- Push event: 13
Last Year
- Push event: 13