conodictor
ConoDictor predicts conopeptides superfamily using generalized profiles and hidden Markov models.
Science Score: 67.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 7 DOI reference(s) in README -
✓Academic publication links
Links to: researchgate.net -
○Committers with academic emails
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (13.5%) to scientific vocabulary
Keywords
Repository
ConoDictor predicts conopeptides superfamily using generalized profiles and hidden Markov models.
Basic Info
Statistics
- Stars: 2
- Watchers: 3
- Forks: 2
- Open Issues: 2
- Releases: 18
Topics
Metadata Files
README.md
ConoDictor
A fast and accurate prediction and classification tool for conopeptides
🗺️ Overview
Unlocking the Potential of Cone Snail Venom
Cone snails are a treasure trove of natural peptides with immense pharmacological and therapeutic potential. The advent of affordable RNA sequencing (RNAseq) has revolutionized the mining of novel bioactive conopeptides from venom gland transcriptomes. However, the complexity of bioinformatic analyses often impedes the discovery process.
Introducing ConoDictor 2
ConoDictor 2 is a standalone, user-friendly command-line tool designed to streamline the discovery of conopeptides. Building on a decade-old web server, we have significantly upgraded ConoDictor with modern tools and algorithms, and enhanced our classification models using new, high-quality sequences. The result is a program that is more accurate, faster, and compatible across multiple platforms.
Key Features
- Enhanced Accuracy and Speed: ConoDictor 2 processes entire venom gland transcriptomes, whether from raw reads or assembled contigs, in record time.
- Ease of Use: The program requires only the assembled transcriptome or raw reads file, in either DNA or amino acid format. ConoDictor 2 automatically recognizes the alphabet used.
- Advanced Prediction Capabilities: It runs predictions directly on the submitted or dynamically generated proteins file, aiming to identify the longest conopeptide precursor-like sequences.
Simplified Bioinformatics for Breakthrough Discoveries
With ConoDictor 2, researchers can bypass the intricate bioinformatic challenges and focus on uncovering the next generation of bioactive peptides from cone snail venom. Its robust performance and user-centric design make it an indispensable tool in venom research and drug discovery.
Installing
Install from Pip
You will first have to install ~~HMMER 3 and~~ Pftools to be able to run conodictor (as of version 2.4, conodictor does not need hmmer anymore as it use the wonderful pyhmmer library).
bash
pip install conodictor
Using containers
Docker
Accessible at https://hub.docker.com/u/ebedthan or on BioContainers.
bash
docker pull ebedthan/conodictor:latest
docker run ebedthan/conodictor:latest conodictor -h
Example of a run
bash
docker run --rm=True -v $PWD:/data -u $(id -u):$(id -g) ebedthan/conodictor:latest conodictor --out /data/outdir /data/input.fa.gz
See https://staph-b.github.io/docker-builds/run_containers/ for more informations on how to properly run a docker container.
Singularity
The singularity container does not need admin privileges making it suitable for university clusters and HPC.
bash
singularity build conodictor.sif docker://ebedthan/conodictor:latest
singularity exec conodictor.sif conodictor -h
Install from source
```bash
Download ConoDictor development version
git clone https://github.com/koualab/conodictor.git conodictor
Navigate to directory
cd conodictor
Install with poetry: see https://python-poetry.org
poetry install --no-dev
Enter the Python virtual environment with
poetry shell
Test conodictor is correctly installed
conodictor -h ```
If you do not want to go into the virtual environment just do:
bash
poetry run conodictor -h
💡 Example
bash
conodictor file.fa.gz
conodictor --out outfolder --cpus 4 --mlen 51 file.fa
Output files
The comma separeted-values file summary.csv can be easily viewed with any office suite, or text editor.
```csv sequence,hmmpred,pssmpred definitivepred SEQID1,A,A,A SEQID2,B,D,CONFLICT B and D SEQID_3,O1,O1,O1 ...
```
💭 Feedback
Issue tracker
Found a bug ? Have an enhancement request ? Head over to the GitHub issue tracker if you need to report or ask something. If you are filing in on a bug, please include as much information as you can about the issue, and try to recreate the same bug in a simple, easily reproducible situation.
⚖️ License
For commercial uses please contact Dominique Koua at dominique.koua@inphb.ci.
🔖 Citation
ConoDictor is a scientifc software, with a published paper in the Bioinformatics Advances journal. Please cite this article if you are using it in an academic work, for instance as: Koua, D., Ebou, A., & Dutertre, S. (2021). Improved prediction of conopeptide superfamilies with ConoDictor 2.0. Bioinformatics Advances, 1(1), vbab011. https://doi.org/10.1093/bioadv/vbab011
Dependencies
- Pftools
Used for PSSM prediction.
Schuepbach P et al. pfsearchV3: a code acceleration and heuristic to search PROSITE profiles. Bioinformatics 2013, 10.1093/bioinformatics/btt129
📚 References
HMMER 3
Used for HMM profile prediction.
Eddy SR, Accelerated Profile HMM Searches. PLOS Computational Biology 2011, 10.1371/journal.pcbi.1002195Pftools
Used for PSSM prediction.
Schuepbach P et al. pfsearchV3: a code acceleration and heuristic to search PROSITE profiles. Bioinformatics 2013, 10.1093/bioinformatics/btt129
Authors
Owner
- Name: Koua Research Group
- Login: koualab
- Kind: organization
- Email: dominique.koua@inphb.ci
- Location: Côte d'Ivoire
- Website: https://koualab.github.io
- Repositories: 1
- Profile: https://github.com/koualab
Bioinformatic and biostatistics research group
Citation (CITATION.cff)
cff-version: "1.1.0"
message: "If you use this software, please cite it as below."
authors:
-
family-names: Koua
given-names: Dominique
orcid: "https://orcid.org/0000-0002-9078-8844"
-
family-names: Ebou
given-names: Anicet
orcid: "https://orcid.org/0000-0003-4005-177X"
-
family-names: Dutertre
given-names: "Sébastien"
orcid: "https://orcid.org/0000-0002-2945-1484"
title: "Improved prediction of conopeptide superfamilies with ConoDictor 2.0"
version: "2.3.0"
doi: "10.1093/bioadv/vbab011"
date-released: 2021-07-07
GitHub Events
Total
Last Year
Committers
Last synced: over 2 years ago
Top Committers
| Name | Commits | |
|---|---|---|
| Anicet Ebou | a****u@g****m | 115 |
| yaiza612 | 6****2 | 2 |
Issues and Pull Requests
Last synced: over 1 year ago
All Time
- Total issues: 4
- Total pull requests: 15
- Average time to close issues: 1 day
- Average time to close pull requests: 3 months
- Total issue authors: 4
- Total pull request authors: 2
- Average comments per issue: 5.0
- Average comments per pull request: 1.27
- Merged pull requests: 1
- Bot issues: 0
- Bot pull requests: 13
Past Year
- Issues: 1
- Pull requests: 4
- Average time to close issues: N/A
- Average time to close pull requests: 6 months
- Issue authors: 1
- Pull request authors: 1
- Average comments per issue: 7.0
- Average comments per pull request: 0.75
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 4
Top Authors
Issue Authors
- sschrstn (1)
- giulippa (1)
- sarahfarhat (1)
- jthiels (1)
Pull Request Authors
- dependabot[bot] (16)
- yaiza612 (2)
Top Labels
Issue Labels
Pull Request Labels
Packages
- Total packages: 1
-
Total downloads:
- pypi 30 last-month
- Total dependent packages: 0
- Total dependent repositories: 0
- Total versions: 7
- Total maintainers: 1
pypi.org: conodictor
Prediction and classification of conopeptides
- Homepage: https://github.com/koualab/conodictor
- Documentation: https://github.com/koualab/conodictor
- License: GPL-3.0-only
-
Latest release: 2.4.1
published over 1 year ago
Rankings
Maintainers (1)
Dependencies
- black 21.12b0 develop
- click 8.1.3 develop
- flake8 4.0.1 develop
- mccabe 0.6.1 develop
- mypy-extensions 0.4.3 develop
- pathspec 0.10.1 develop
- platformdirs 2.5.2 develop
- pycodestyle 2.8.0 develop
- pyflakes 2.4.0 develop
- typing-extensions 4.3.0 develop
- bio 1.4.0
- biopython 1.79
- biothings-client 0.2.6
- certifi 2022.6.15.2
- charset-normalizer 2.1.1
- colorama 0.4.5
- cycler 0.11.0
- exitstatus 2.2.0
- fonttools 4.37.1
- idna 3.4
- kiwisolver 1.4.4
- matplotlib 3.5.3
- mygene 3.2.2
- numpy 1.23.3
- packaging 21.3
- pandas 1.4.4
- pillow 9.2.0
- pyfastx 0.8.4
- pyparsing 3.0.9
- python-dateutil 2.8.2
- pytz 2022.2.1
- requests 2.28.1
- setuptools-scm 6.4.2
- six 1.16.0
- tomli 1.2.3
- tqdm 4.64.1
- urllib3 1.26.12
- black ^21.12b0 develop
- flake8 ^4.0.1 develop
- bio ^1.3.3
- exitstatus ^2.2.0
- matplotlib ^3.5.1
- pandas ^1.3.5
- pyfastx ^0.8.4
- python ^3.8
- ubuntu 20.04 build