https://github.com/acg-team/single-char-indel-asr-preserves-long-indels
This repository includes the scripts used for analysis investigating the dynamics of indels in mammalian orthologous proteins and the examination of the ancestral reconstruction of multiple-character indels under the PIP.
https://github.com/acg-team/single-char-indel-asr-preserves-long-indels
Science Score: 26.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
✓DOI references
Found 5 DOI reference(s) in README -
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (10.4%) to scientific vocabulary
Keywords
Repository
This repository includes the scripts used for analysis investigating the dynamics of indels in mammalian orthologous proteins and the examination of the ancestral reconstruction of multiple-character indels under the PIP.
Basic Info
Statistics
- Stars: 0
- Watchers: 4
- Forks: 0
- Open Issues: 1
- Releases: 0
Topics
Metadata Files
readme.md
The Python scripts used for the paper entitled:
Single-character insertion-deletion model preserves long indels in ancestral sequence reconstruction
Requirments
For this tutorial, you should already have Python 3.9 or higher, jupyter notebook along with the following libraries:
numpy, ete3, biopython, seaborn, matplotlib, sklearn and pandas
Installation
To install the package you can simply download the repository and run the following command in the root directory.
Install the dependencies using this command:
console
pip3 install -r requirements.txt
Files
- mammals_01_stat.ipynb contains functions for indel pattern plots for mammalian data.
- mammals_02_dynamic_of_gaps.ipynb includes functions for calculating dynamic of gap pattern for each mammalian data.
- mammals_03_indel_length.ipynb includes functions for ploting indel length for mammalian data.
- simulation_01_acc.ipynb contains functions for computing accuracy of ARPIP inference on simulated data.
- simulation_02_dynamic_of_gaps.ipynb contains functions for calculating dynamic of gap pattern for each simulated data.
- simulation_03_stat.ipynb contains functions for indel pattern plots for simulated data.
- simulation_04_discussion.ipynb contains scripts for the appendix figures.
- requirements.txt contains library versions of dependencies.
To get the figures in the manuscript all the necessary files and scripts are provided here. Moreover, suplemental data is stored in another repository with this link.
Citation
Please cite:
Gholamhossein Jowkar, Julija Pecerska, Manuel Gil, and Maria Anisimova
Single-character insertion-deletion model preserves long indels in ancestral sequence reconstruction.
BioRxiv, 2024;
doi:10.1101/2024.03.09.584071
Author
Gholam-Hossein Jowkar E-mail
Owner
- Name: Applied Computational Genomics Team
- Login: acg-team
- Kind: organization
- Location: Wädenswil, Switzerland
- Website: https://www.zhaw.ch/de/lsfm/institute-zentren/ias/forschung/computational-genomics/
- Repositories: 29
- Profile: https://github.com/acg-team
Computational Genomics tools from Maria Anisimova and collaborators
GitHub Events
Total
Last Year
Issues and Pull Requests
Last synced: 9 months ago
All Time
- Total issues: 0
- Total pull requests: 2
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Total issue authors: 0
- Total pull request authors: 1
- Average comments per issue: 0
- Average comments per pull request: 0.0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
- junniest (2)
Top Labels
Issue Labels
Pull Request Labels
Dependencies
- biopython ==1.79
- ete3 ==3.1.2
- matplotlib ==3.7.1
- numpy ==1.23.3
- pandas ==1.5.0
- seaborn =0.13.0
- sklearn =1.1.3