https://github.com/bowang-lab/integrao

Integrate Any Omics: Towards genome-wide data integration for patient stratification

https://github.com/bowang-lab/integrao

Science Score: 46.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
    Links to: arxiv.org, nature.com
  • Committers with academic emails
    1 of 3 committers (33.3%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (14.5%) to scientific vocabulary
Last synced: 7 months ago · JSON representation

Repository

Integrate Any Omics: Towards genome-wide data integration for patient stratification

Basic Info
Statistics
  • Stars: 56
  • Watchers: 4
  • Forks: 11
  • Open Issues: 1
  • Releases: 2
Created over 2 years ago · Last pushed 11 months ago
Metadata Files
Readme License

README.md

IntegrAO: Integrate Any Omics

This is the official codebase for Integrate Any Omics: Towards genome-wide data integration for patient stratification.

Preprint   Documentation   PyPI version   License

Updates:

[2025.03.02] 🔥🔥🔥 We added the functionalities of extracting feature importance for the unsupervised and supervised IntegrAO models! Feel free to check it out here: Unsupervised integration feature importance and Supervised integration feature importance. Welcome for suggestions!

[2025.01.23] 🥳 IntegrAO is published on Nature Machine Intelligence!

[2024.01.15] 🥳 IntegrAO Preprint available!

🔨 Hardware requirements

IntegrAO package requires only a standard computer with enough RAM to support the in-memory operations.

🔨 Installation

IntegrAO works with Python >= 3.7. Please make sure you have the correct version of Python pre-installation.

  1. Create a virtual environment: conda create -n integrAO python=3.10 -y and conda activate integrAO
  2. Install Pytorch 2.1.0
  3. IntegrAO is available on PyPI. To install IntegrAO, run the following command: pip install integrao

For developing, clone this repo with following commands:

bash $ git clone this-repo-url $ cd IntegrAO $ pip install -r requirement.txt

🧬 Introduction

High-throughput omics profiling advancements have greatly enhanced cancer patient stratification. However, incomplete data in multi-omics integration presents a significant challenge, as traditional methods like sample exclusion or imputation often compromise biological diversity and dependencies. Furthermore, the critical task of accurately classifying new patients with partial omics data into existing subtypes is commonly overlooked. We introduce IntegrAO, an unsupervised framework integrating incomplete multi-omics and classifying new biological samples. IntegrAO first combines partially overlapping patient graphs from diverse omics sources and utilizes graph neural networks to produce unified patient embeddings.

An overview of IntegrAO can be seen below.

integrAO

📖 Tutorial

We offer the following tutorials for demonstration:

Citing IntegrAO

bash @article{ma2025moving, title={Moving towards genome-wide data integration for patient stratification with Integrate Any Omics}, author={Ma, Shihao and Zeng, Andy GX and Haibe-Kains, Benjamin and Goldenberg, Anna and Dick, John E and Wang, Bo}, journal={Nature Machine Intelligence}, volume={7}, number={1}, pages={29--42}, year={2025}, publisher={Nature Publishing Group} } }

GitHub Events

Total
  • Create event: 4
  • Release event: 1
  • Issues event: 13
  • Watch event: 47
  • Delete event: 1
  • Issue comment event: 19
  • Member event: 1
  • Push event: 25
  • Pull request event: 14
  • Fork event: 11
Last Year
  • Create event: 4
  • Release event: 1
  • Issues event: 13
  • Watch event: 47
  • Delete event: 1
  • Issue comment event: 19
  • Member event: 1
  • Push event: 25
  • Pull request event: 14
  • Fork event: 11

Committers

Last synced: 11 months ago

All Time
  • Total Commits: 55
  • Total Committers: 3
  • Avg Commits per committer: 18.333
  • Development Distribution Score (DDS): 0.073
Past Year
  • Commits: 26
  • Committers: 3
  • Avg Commits per committer: 8.667
  • Development Distribution Score (DDS): 0.154
Top Committers
Name Email Commits
Rex Ma r****2@g****m 51
npunw n****i@h****m 3
michaelhla m****a@c****u 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 7 months ago

All Time
  • Total issues: 7
  • Total pull requests: 7
  • Average time to close issues: 12 days
  • Average time to close pull requests: less than a minute
  • Total issue authors: 5
  • Total pull request authors: 2
  • Average comments per issue: 3.29
  • Average comments per pull request: 0.0
  • Merged pull requests: 7
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 7
  • Pull requests: 7
  • Average time to close issues: 12 days
  • Average time to close pull requests: less than a minute
  • Issue authors: 5
  • Pull request authors: 2
  • Average comments per issue: 3.29
  • Average comments per pull request: 0.0
  • Merged pull requests: 7
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • marioernestovaldes (3)
  • kjh-sdfmu (1)
  • Dadatata-JZ (1)
  • LudensZhang (1)
  • liux0614 (1)
Pull Request Authors
  • rexxxx1234 (7)
  • npunw (5)
Top Labels
Issue Labels
enhancement (1)
Pull Request Labels

Dependencies

pyproject.toml pypi
  • python ^3.9
requirement.txt pypi
  • matplotlib *
  • numpy *
  • pandas *
  • snfpy *
  • torch *
  • torchvision *