https://github.com/aida-ugent/topoembedding
Topologically Regularized Data Embeddings
Science Score: 23.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
○DOI references
-
✓Academic publication links
Links to: zenodo.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (13.3%) to scientific vocabulary
Repository
Topologically Regularized Data Embeddings
Basic Info
- Host: GitHub
- Owner: aida-ugent
- License: gpl-3.0
- Language: HTML
- Default Branch: main
- Size: 14.1 MB
Statistics
- Stars: 7
- Watchers: 5
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
readme.md

Topologically Regularized Data Embeddings
The code in this repository is accompanying the manuscript "Topologically Regularized Data Embeddings".
Installation
Python
We provide a conda_env.yml file listing the required packages. You can install by creating a new conda environment
bash
conda env create -f topembedding/conda_env.yml -p ./topo_env
conda activate topo_env/
(1) Install TopologyLayer
bash
pip install git+https://github.com/bruel-gabrielsson/TopologyLayer.git
(2) Install Aleph by following the instructions on their GitHub.
``` git clone https://github.com/Pseudomanifold/Aleph.git cd Aleph && mkdir build && cd build && cmake ../ && make && make test
cd bindings/python/aleph python setup.py install ```
(3) Optional: Install DioDe to exerun the pseudotime analysis in the CellCyle notebook.
R
To execute the R scripts in the Scripts folder you need:
* TDA
* ggplot2
* latex2exp
* gridExtra
Repository Content
Code
Try out the example config files in /Code/config by providing one of them to main.py.
bash
python main.py Code/config/synthetic_random_optimize.yaml
Note that 'cell_cycle.yaml' takes about 5 Minutes to run. Upon completion the final embeddings will be shown.
Data
- Synthetic data are generated using Data/datasets.py
- Cell trajectory (source): included as Data/CellCycle.rds
- Cell bifurcation (source): included as Data/CellBifurcation.rds
- Karate[1]: partially (graph) loaded from networkx and partially (weights) from Data/Karate.txt
- Harry Potter (source): included in Data/HarryPotter
Experiments
In this folder you find one notebook for every dataset to reproduce the results in the experiments section. You can open the .hml version to see the code with output.
Scripts
Folder "Scripts": contains two Jupyter notebooks and two R scripts to reproduce the visualizations of Section (2) in the manuscript.
Acknowledgements
The content of this repository is an extension of the code in topembedding developed by Robin Vandaele.
References
[1]: W.W. Zachary. An information flow model for conflict and fission in small groups. Journal of Anthropological Research, 33:452–473, 1977.
Owner
- Name: Ghent University Artificial Intelligence & Data Analytics Group
- Login: aida-ugent
- Kind: organization
- Email: tijl.debie@ugent.be
- Location: Ghent
- Website: aida.ugent.be
- Repositories: 36
- Profile: https://github.com/aida-ugent
GitHub Events
Total
Last Year
Issues and Pull Requests
Last synced: over 1 year ago
All Time
- Total issues: 0
- Total pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Total issue authors: 0
- Total pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0