conodictor

ConoDictor predicts conopeptides superfamily using generalized profiles and hidden Markov models.

https://github.com/koualab/conodictor

Science Score: 67.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 7 DOI reference(s) in README
  • Academic publication links
    Links to: researchgate.net
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (13.5%) to scientific vocabulary

Keywords

bioinformatics conopeptides hmm pssm
Last synced: 6 months ago · JSON representation ·

Repository

ConoDictor predicts conopeptides superfamily using generalized profiles and hidden Markov models.

Basic Info
  • Host: GitHub
  • Owner: koualab
  • License: gpl-3.0
  • Language: Python
  • Default Branch: main
  • Homepage:
  • Size: 90.3 MB
Statistics
  • Stars: 2
  • Watchers: 3
  • Forks: 2
  • Open Issues: 2
  • Releases: 18
Topics
bioinformatics conopeptides hmm pssm
Created about 5 years ago · Last pushed over 1 year ago
Metadata Files
Readme License Citation

README.md

ConoDictor

A fast and accurate prediction and classification tool for conopeptides

PyPI Wheel Language Pyver Downloads Docker License: GPL v3

🗺️ Overview

Unlocking the Potential of Cone Snail Venom

Cone snails are a treasure trove of natural peptides with immense pharmacological and therapeutic potential. The advent of affordable RNA sequencing (RNAseq) has revolutionized the mining of novel bioactive conopeptides from venom gland transcriptomes. However, the complexity of bioinformatic analyses often impedes the discovery process.

Introducing ConoDictor 2

ConoDictor 2 is a standalone, user-friendly command-line tool designed to streamline the discovery of conopeptides. Building on a decade-old web server, we have significantly upgraded ConoDictor with modern tools and algorithms, and enhanced our classification models using new, high-quality sequences. The result is a program that is more accurate, faster, and compatible across multiple platforms.

Key Features

  • Enhanced Accuracy and Speed: ConoDictor 2 processes entire venom gland transcriptomes, whether from raw reads or assembled contigs, in record time.
  • Ease of Use: The program requires only the assembled transcriptome or raw reads file, in either DNA or amino acid format. ConoDictor 2 automatically recognizes the alphabet used.
  • Advanced Prediction Capabilities: It runs predictions directly on the submitted or dynamically generated proteins file, aiming to identify the longest conopeptide precursor-like sequences.

Simplified Bioinformatics for Breakthrough Discoveries

With ConoDictor 2, researchers can bypass the intricate bioinformatic challenges and focus on uncovering the next generation of bioactive peptides from cone snail venom. Its robust performance and user-centric design make it an indispensable tool in venom research and drug discovery.

Installing

Install from Pip

You will first have to install ~~HMMER 3 and~~ Pftools to be able to run conodictor (as of version 2.4, conodictor does not need hmmer anymore as it use the wonderful pyhmmer library).

bash pip install conodictor

Using containers

Docker

Accessible at https://hub.docker.com/u/ebedthan or on BioContainers.

bash docker pull ebedthan/conodictor:latest docker run ebedthan/conodictor:latest conodictor -h

Example of a run

bash docker run --rm=True -v $PWD:/data -u $(id -u):$(id -g) ebedthan/conodictor:latest conodictor --out /data/outdir /data/input.fa.gz

See https://staph-b.github.io/docker-builds/run_containers/ for more informations on how to properly run a docker container.

Singularity

The singularity container does not need admin privileges making it suitable for university clusters and HPC.

bash singularity build conodictor.sif docker://ebedthan/conodictor:latest singularity exec conodictor.sif conodictor -h

Install from source

```bash

Download ConoDictor development version

git clone https://github.com/koualab/conodictor.git conodictor

Navigate to directory

cd conodictor

Install with poetry: see https://python-poetry.org

poetry install --no-dev

Enter the Python virtual environment with

poetry shell

Test conodictor is correctly installed

conodictor -h ```

If you do not want to go into the virtual environment just do:

bash poetry run conodictor -h

💡 Example

bash conodictor file.fa.gz conodictor --out outfolder --cpus 4 --mlen 51 file.fa

Output files

The comma separeted-values file summary.csv can be easily viewed with any office suite, or text editor.

```csv sequence,hmmpred,pssmpred definitivepred SEQID1,A,A,A SEQID2,B,D,CONFLICT B and D SEQID_3,O1,O1,O1 ...

```

💭 Feedback

Issue tracker

Found a bug ? Have an enhancement request ? Head over to the GitHub issue tracker if you need to report or ask something. If you are filing in on a bug, please include as much information as you can about the issue, and try to recreate the same bug in a simple, easily reproducible situation.

⚖️ License

GPL v3.

For commercial uses please contact Dominique Koua at dominique.koua@inphb.ci.

🔖 Citation

ConoDictor is a scientifc software, with a published paper in the Bioinformatics Advances journal. Please cite this article if you are using it in an academic work, for instance as: Koua, D., Ebou, A., & Dutertre, S. (2021). Improved prediction of conopeptide superfamilies with ConoDictor 2.0. Bioinformatics Advances, 1(1), vbab011. https://doi.org/10.1093/bioadv/vbab011

Dependencies

  • Pftools
    Used for PSSM prediction.
    Schuepbach P et al. pfsearchV3: a code acceleration and heuristic to search PROSITE profiles. Bioinformatics 2013, 10.1093/bioinformatics/btt129

📚 References

  • HMMER 3
    Used for HMM profile prediction.
    Eddy SR, Accelerated Profile HMM Searches. PLOS Computational Biology 2011, 10.1371/journal.pcbi.1002195

  • Pftools
    Used for PSSM prediction.
    Schuepbach P et al. pfsearchV3: a code acceleration and heuristic to search PROSITE profiles. Bioinformatics 2013, 10.1093/bioinformatics/btt129

Authors

Owner

  • Name: Koua Research Group
  • Login: koualab
  • Kind: organization
  • Email: dominique.koua@inphb.ci
  • Location: Côte d'Ivoire

Bioinformatic and biostatistics research group

Citation (CITATION.cff)

cff-version: "1.1.0"
message: "If you use this software, please cite it as below."
authors: 
  -
    family-names: Koua
    given-names: Dominique
    orcid: "https://orcid.org/0000-0002-9078-8844"
  -
    family-names: Ebou
    given-names: Anicet
    orcid: "https://orcid.org/0000-0003-4005-177X"
  -
    family-names: Dutertre
    given-names: "Sébastien"
    orcid: "https://orcid.org/0000-0002-2945-1484"

title: "Improved prediction of conopeptide superfamilies with ConoDictor 2.0"
version: "2.3.0"
doi: "10.1093/bioadv/vbab011"
date-released: 2021-07-07

GitHub Events

Total
Last Year

Committers

Last synced: over 2 years ago

All Time
  • Total Commits: 117
  • Total Committers: 2
  • Avg Commits per committer: 58.5
  • Development Distribution Score (DDS): 0.017
Past Year
  • Commits: 1
  • Committers: 1
  • Avg Commits per committer: 1.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Anicet Ebou a****u@g****m 115
yaiza612 6****2 2

Issues and Pull Requests

Last synced: over 1 year ago

All Time
  • Total issues: 4
  • Total pull requests: 15
  • Average time to close issues: 1 day
  • Average time to close pull requests: 3 months
  • Total issue authors: 4
  • Total pull request authors: 2
  • Average comments per issue: 5.0
  • Average comments per pull request: 1.27
  • Merged pull requests: 1
  • Bot issues: 0
  • Bot pull requests: 13
Past Year
  • Issues: 1
  • Pull requests: 4
  • Average time to close issues: N/A
  • Average time to close pull requests: 6 months
  • Issue authors: 1
  • Pull request authors: 1
  • Average comments per issue: 7.0
  • Average comments per pull request: 0.75
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 4
Top Authors
Issue Authors
  • sschrstn (1)
  • giulippa (1)
  • sarahfarhat (1)
  • jthiels (1)
Pull Request Authors
  • dependabot[bot] (16)
  • yaiza612 (2)
Top Labels
Issue Labels
bug (1)
Pull Request Labels
dependencies (16) enhancement (1)

Packages

  • Total packages: 1
  • Total downloads:
    • pypi 30 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 7
  • Total maintainers: 1
pypi.org: conodictor

Prediction and classification of conopeptides

  • Versions: 7
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 30 Last month
Rankings
Dependent packages count: 10.0%
Forks count: 19.1%
Stargazers count: 31.9%
Average: 32.3%
Downloads: 32.9%
Dependent repos count: 67.4%
Maintainers (1)
Last synced: 7 months ago

Dependencies

poetry.lock pypi
  • black 21.12b0 develop
  • click 8.1.3 develop
  • flake8 4.0.1 develop
  • mccabe 0.6.1 develop
  • mypy-extensions 0.4.3 develop
  • pathspec 0.10.1 develop
  • platformdirs 2.5.2 develop
  • pycodestyle 2.8.0 develop
  • pyflakes 2.4.0 develop
  • typing-extensions 4.3.0 develop
  • bio 1.4.0
  • biopython 1.79
  • biothings-client 0.2.6
  • certifi 2022.6.15.2
  • charset-normalizer 2.1.1
  • colorama 0.4.5
  • cycler 0.11.0
  • exitstatus 2.2.0
  • fonttools 4.37.1
  • idna 3.4
  • kiwisolver 1.4.4
  • matplotlib 3.5.3
  • mygene 3.2.2
  • numpy 1.23.3
  • packaging 21.3
  • pandas 1.4.4
  • pillow 9.2.0
  • pyfastx 0.8.4
  • pyparsing 3.0.9
  • python-dateutil 2.8.2
  • pytz 2022.2.1
  • requests 2.28.1
  • setuptools-scm 6.4.2
  • six 1.16.0
  • tomli 1.2.3
  • tqdm 4.64.1
  • urllib3 1.26.12
pyproject.toml pypi
  • black ^21.12b0 develop
  • flake8 ^4.0.1 develop
  • bio ^1.3.3
  • exitstatus ^2.2.0
  • matplotlib ^3.5.1
  • pandas ^1.3.5
  • pyfastx ^0.8.4
  • python ^3.8
Dockerfile docker
  • ubuntu 20.04 build