https://github.com/biocypher/omnipath-secondary-adapter

BioCypher adapter to retrieve data from Omnipath database (Networks, Intercell, Complexes, EnzymePTM and Annotations).

https://github.com/biocypher/omnipath-secondary-adapter

Science Score: 36.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
    1 of 2 committers (50.0%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (10.1%) to scientific vocabulary

Keywords

biocypher biological-networks knowledge-graph neo4j ontoweaver
Last synced: 6 months ago · JSON representation

Repository

BioCypher adapter to retrieve data from Omnipath database (Networks, Intercell, Complexes, EnzymePTM and Annotations).

Basic Info
  • Host: GitHub
  • Owner: biocypher
  • License: mit
  • Language: Jupyter Notebook
  • Default Branch: main
  • Homepage:
  • Size: 1.66 MB
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 3
  • Open Issues: 6
  • Releases: 0
Topics
biocypher biological-networks knowledge-graph neo4j ontoweaver
Created over 1 year ago · Last pushed 8 months ago
Metadata Files
Readme License

README.md

Omnipath Secondary Adapter

About OmniPath

Omnipath is a database of molecular biology prior knowledge developed in Saez Lab and Korcsmaros Lab. It combines data from more than 100 resources and contains:

  • protein-protein and gene regulatory interactions
  • Enzyme and Post-Translational-Modifications(PTM) relationships
  • Protein complexes
  • Protein annotations
  • intercellular communication.

Omnipath database stores the information in five different tables containing the aforementioned data. - Networks - Enzyme-PTM - Complexes - Annotations - Intercell

> [!WARNING]

> This BioCypher adapter relies on the Ontoweaver library that works with Python 3.12. Ensure the #Python version is 3.12 in your virtual environment.

About this adapter

The main goal of this adapter is to enable users to retrieve data from the Omnipath database and use that information to generate a knowledge graph. For achieving this goal, we are going use BioCypher and OntoWeaver.

  • BioCypher:it is a framework for building biomedical knowledge graphs by integrating heterogeneous data into a structured, ontology-aligned graph representation.

  • OntoWeaver: it is a visual and declarative ontology-based tool for designing and managing data integration systems, enabling intuitive schema mapping and semantic consistency. OntoWeaver has been built built on top of Biocypher.

Prerequisites

  • Python 3
  • Poetry [recommended]: Python packaging and dependency manager.
  • git: version control manager

| Prerequisite | Version | Verify installation | How to install? | | --------------- | --------- | ------------------------ | --------------------------------------------------------------------- | | Python 3 | >=3.12 | $ python --version | link | | Poetry | 1.8 | $ poetry about | link | | git | >= 2.0 | $ git --version | link | | Neo4j server | 2025.04.0 | $ neo4j --version | link |

Usage

  1. Clone this repository and change to the directory: bash git clone https://github.com/biocypher/omnipath-secondary-adapter cd omnipath-secondary-adapter
  2. Install all the dependencies (pre-configured in the pyproject.toml file): bash poetry lock poetry install --no-root

  3. This current implementation generates Neo4j scripts for each table in the Omnipath database

| Omnipath table | Script parameter | Neo4j script generation | | ------------------ | :-------------------------------- | :-------------------------: | | Networks | -net or --networks | ✅ Done | | Enzyme-PTM | -enz or --enzyme-PTM | ✅ Done | | Complexes | -co or --complexes | ✅ Done | | Annotations | -an or --annotations | ✅ Done | | Intercell | -inter or --intercell | ✅ Done |

Use the following commands to generate import scripts for each Omnipath table:

Networks

bash poetry run python weave_knowledge_graph.py -net download

Enzyme-PTM

bash poetry run python weave_knowledge_graph.py -enz download

Complexess

bash poetry run python weave_knowledge_graph.py -co download

Annotations

bash poetry run python weave_knowledge_graph.py -an download

Intercell

bash poetry run python weave_knowledge_graph.py -inter download

4. Once the script has processed the data, you can verify a folder has been generated under biocypher-out. The folder is named after the created date (e.g., 20250610172616). This folder contains the following:

  • neo4j-admin-import-call.sh: script to import the the data in Neo4j.
  • CSV files:
    • *-header.csv: indicate the node/edge properties fields
    • *-part000.csv, *-part001.csv,...: store nodes/edges data based on the structure indicated in the *-header.csv files.
  1. Stop the running instances of Neo4j by running: bash sudo neo4j stop

  2. Run the import script generated by step 4. bash sudo bash biocypher-out/<datetime>/neo4j-admin-import-call.sh

  3. Start again Neo4j by running: bash sudo neo4j start

  4. Open the Neo4j platform on http://localhost:7474. You will be ask for credentials. By default the Neo4j credentials are:

  • Database User: neo4j
  • Password: neo4j

Note: change the password as requested.

  1. At the end, you have you Knowledge Graph! 🎉 Congratulations!

Owner

  • Name: biocypher
  • Login: biocypher
  • Kind: organization

GitHub Events

Total
  • Issues event: 37
  • Delete event: 2
  • Issue comment event: 14
  • Push event: 27
  • Pull request review comment event: 5
  • Pull request review event: 17
  • Pull request event: 36
  • Fork event: 4
  • Create event: 1
Last Year
  • Issues event: 37
  • Delete event: 2
  • Issue comment event: 14
  • Push event: 27
  • Pull request review comment event: 5
  • Pull request review event: 17
  • Pull request event: 36
  • Fork event: 4
  • Create event: 1

Committers

Last synced: 9 months ago

All Time
  • Total Commits: 82
  • Total Committers: 2
  • Avg Commits per committer: 41.0
  • Development Distribution Score (DDS): 0.268
Past Year
  • Commits: 82
  • Committers: 2
  • Avg Commits per committer: 41.0
  • Development Distribution Score (DDS): 0.268
Top Committers
Name Email Commits
Edwin Carreño e****o@g****m 60
Matthieu NAJM m****m@p****r 22
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 23
  • Total pull requests: 41
  • Average time to close issues: about 1 month
  • Average time to close pull requests: about 21 hours
  • Total issue authors: 2
  • Total pull request authors: 2
  • Average comments per issue: 0.17
  • Average comments per pull request: 0.39
  • Merged pull requests: 36
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 23
  • Pull requests: 41
  • Average time to close issues: about 1 month
  • Average time to close pull requests: about 21 hours
  • Issue authors: 2
  • Pull request authors: 2
  • Average comments per issue: 0.17
  • Average comments per pull request: 0.39
  • Merged pull requests: 36
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • ecarrenolozano (20)
  • njmmatthieu (3)
Pull Request Authors
  • ecarrenolozano (24)
  • njmmatthieu (17)
Top Labels
Issue Labels
enhancement (5) dev: use case (4) documentation (3) dev: prerequisite (1)
Pull Request Labels
enhancement (7) bug (2) dev: use case (2)

Dependencies

docker-compose.yml docker
  • biocypher/base 1.2.0
  • neo4j 4.4-enterprise
poetry.lock pypi
  • appdirs 1.4.4
  • biocypher 0.5.43
  • certifi 2024.7.4
  • charset-normalizer 3.3.2
  • colorama 0.4.6
  • colorlog 6.8.2
  • idna 3.7
  • isodate 0.6.1
  • more-itertools 10.3.0
  • neo4j 4.4.12
  • neo4j-utils 0.0.7
  • networkx 3.3
  • numpy 2.0.1
  • packaging 24.1
  • pandas 2.2.2
  • platformdirs 4.2.2
  • pooch 1.8.2
  • pyparsing 3.1.2
  • python-dateutil 2.9.0.post0
  • pytz 2024.1
  • pyyaml 6.0.1
  • rdflib 6.3.2
  • requests 2.32.3
  • six 1.16.0
  • stringcase 1.2.0
  • toml 0.10.2
  • tqdm 4.66.4
  • treelib 1.6.4
  • tzdata 2024.1
  • urllib3 2.2.2
pyproject.toml pypi
  • biocypher ^0.5.43
  • python ^3.10
.github/actions/initial_setup/action.yaml actions
  • actions/cache v3 composite
  • actions/setup-python v5 composite
  • snok/install-poetry v1 composite
.github/actions/install/action.yaml actions
  • actions/cache v3 composite
.github/actions/test/action.yaml actions
.github/workflows/test_and_code_quality.yaml actions
  • ./.github/actions/initial_setup * composite
  • ./.github/actions/install * composite
  • ./.github/actions/test * composite
  • actions/checkout v4 composite