https://github.com/astrazeneca/onto_merger

OntoMerger is an ontology alignment library for deduplicating knowledge graph nodes that represent the same domain.

Science Score: 23.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
○
codemeta.json file
○
.zenodo.json file
✓
DOI references
Found 1 DOI reference(s) in README
✓
Academic publication links
Links to: arxiv.org
○
Committers with academic emails
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (15.1%) to scientific vocabulary

Keywords

algorithm alignment biological-networks biology graph kg knowledge knowledge-graph mapping ontology ontology-alignment

Keywords from Contributors

rank coverage metric mrr personalization precision recall recommender recommender-system recsys

Last synced: 5 months ago · JSON representation

Repository

OntoMerger is an ontology alignment library for deduplicating knowledge graph nodes that represent the same domain.

Basic Info

Host: GitHub
Owner: AstraZeneca
License: apache-2.0
Language: HTML
Default Branch: main
Homepage:
Size: 32.8 MB

Statistics

Stars: 98
Watchers: 6
Forks: 6
Open Issues: 5
Releases: 0

Topics

algorithm alignment biological-networks biology graph kg knowledge knowledge-graph mapping ontology ontology-alignment

Created almost 4 years ago · Last pushed about 2 years ago

Metadata Files

Readme License

README.md

Paper | Documentation | External Resources

OntoMerger is an ontology alignment library for deduplicating knowledge graph nodes,(i.e. ontology concepts) that represent the same domain, e.g. diseases, and connecting them to form a single directed acyclic hierarchical graph (DAG) (i.e. an ontology class hierarchy). The library implements a pipeline that takes nodes, mappings and (disconnected) hierarchies as input and produces node merges and a connected hierarchy. It also provides analysis and data testing for fine tuning the inputs in order to further reduce duplication, as well as to increase connectivity.

Citing

If you find OntoMerger useful in your work or research, please consider adding the following citation:

```bibtex @misc{ontomerger, doi = {10.48550/ARXIV.2206.02238}, author = {Geleta, David and Nikolov, Andriy and ODonoghue, Mark and Rozemberczki, Benedek and Gogleva, Anna and Tamma, Valentina and Payne, Terry R.}, title = {OntoMerger: An Ontology Integration Library for Deduplicating and Connecting Knowledge Graph Nodes}, publisher = {arXiv}, year = {2022}, }

```

Getting Started

```python

from onto_merger.pipeline import Pipeline

initialise the pipeline

pipeline = Pipeline(projectfolderpath="../path/to/project")

run the process

pipeline.runalignmentandconnectionprocess()

view results in "../path/to/project/output/report/index.html"

```

Running tests

``` $ tox -e py

```

License

Apache 2.0 License

Credit

The Onto Merger logo is based on:

Owner

Name: AstraZeneca
Login: AstraZeneca
Kind: organization
Location: Global

Website: https://www.astrazeneca.com/
Repositories: 33
Profile: https://github.com/AstraZeneca

Data and AI: Unlocking new science insights

GitHub Events

Total

Watch event: 7

Last Year

Watch event: 7

Committers

Last synced: 8 months ago

All Time

Total Commits: 214
Total Committers: 4
Avg Commits per committer: 53.5
Development Distribution Score (DDS): 0.206

Past Year

Commits: 0
Committers: 0
Avg Commits per committer: 0.0
Development Distribution Score (DDS): 0.0

Top Committers

Name	Email	Commits
dgeleta	8****a	170
Rozemberczki	k**8@a**t	41
Ughetto, Michaël	m**o@a**m	2
Michaël Ughetto	m**o@g**m	1

Committer Domains (Top 20 + Academic)

astrazeneca.com: 1 astrazeneca.net: 1

Packages

Total packages: 1
Total downloads:
- pypi 6 last-month

Total dependent packages: 0
Total dependent repositories: 0
Total versions: 1
Total maintainers: 1

pypi.org: onto-merger

OntoMerger is an ontology alignment library for deduplicating knowledge graph nodes,(i.e. ontology concepts) that represent the *same domain*, e.g. diseases, and **connecting** them to form a single directed acyclic hierarchical graph (DAG) (i.e. an ontology class hierarchy).

Homepage: https://github.com/AstraZeneca/onto_merger
Documentation: https://onto-merger.readthedocs.io/
License: Apache License, Version 2.0
Latest release: 0.1.0
published over 3 years ago

Versions: 1
Dependent Packages: 0
Dependent Repositories: 0
Downloads: 6 Last month

Rankings

Dependent packages count: 6.6%

Stargazers count: 7.8%

Forks count: 15.7%

Average: 23.4%

Dependent repos count: 30.6%

Downloads: 56.1%

Maintainers (1)

davidgeleta

Last synced: 6 months ago

Dependencies

docs/requirements_1.txt pypi

Cython *
Jinja2 ==3.0.0
jupyter-sphinx *
markupsafe ==2.0.1
mock *
nbconvert *
nbsphinx *
nbsphinx_link *
numpy *
pandas *
six *
sphinx *
sphinx-autodoc-typehints *
sphinx-automodapi *
sphinx_rtd_theme *
sphinxcontrib-pseudocode *
tqdm *

requirements.txt pypi

dataclasses-json ==0.5.7
docopt ==0.6.2
great_expectations ==0.15.2
jinja2 ==3.0.3
jsonschema ==4.4.0
kaleido ==0.2.1
networkit ==9.1.1
networkx *
numpy ==1.21.5
pandas ==1.3.5
pandas-profiling ==3.1.0
plotly-express ==0.4.1
pytest *
pytest-cov *
pytest-runner *
ruamel.yaml ==0.17.17
tqdm ==4.64.0

setup.py pypi

dataclasses-json ==0.5.7
docopt ==0.6.2
great_expectations ==0.15.2
jinja2 ==3.0.3
jsonschema ==4.4.0
kaleido ==0.2.1
networkx *
numpy ==1.21.5
pandas ==1.3.5
pandas-profiling ==3.1.0
plotly-express ==0.4.1
ruamel.yaml ==0.17.17
tqdm ==4.64.0

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

https://github.com/astrazeneca/onto_merger

Science Score: 23.0%

Keywords

Keywords from Contributors

Repository

Basic Info

Statistics

Topics

Metadata Files

README.md

```

initialise the pipeline

run the process

view results in "../path/to/project/output/report/index.html"

```

Owner

GitHub Events

Total

Last Year

Committers

All Time

Past Year

Top Committers

Committer Domains (Top 20 + Academic)

Packages

pypi.org: onto-merger

Rankings

Maintainers (1)

Dependencies