ngmaster

In silico multi-antigen sequence typing for Neisseria gonorrhoeae (NG-MAST)

https://github.com/mdu-phl/ngmaster

Science Score: 67.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 11 DOI reference(s) in README
  • Academic publication links
    Links to: ncbi.nlm.nih.gov, zenodo.org
  • Committers with academic emails
    1 of 6 committers (16.7%) from academic institutions
  • Institutional organization owner
    Organization mdu-phl has institutional domain (www.mduphl.unimelb.edu.au)
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (10.1%) to scientific vocabulary

Keywords from Contributors

bioinformatics genomics bioinformatics-pipeline global-health infectious-diseases next-generation-sequencing pathogen research sequencing
Last synced: 6 months ago · JSON representation

Repository

In silico multi-antigen sequence typing for Neisseria gonorrhoeae (NG-MAST)

Basic Info
  • Host: GitHub
  • Owner: MDU-PHL
  • License: gpl-3.0
  • Language: Python
  • Default Branch: master
  • Size: 5.09 MB
Statistics
  • Stars: 8
  • Watchers: 4
  • Forks: 5
  • Open Issues: 12
  • Releases: 6
Created over 10 years ago · Last pushed 10 months ago
Metadata Files
Readme License

README.md

PyPI - License PyPI - Python Version PyPI DOI:10.1099/mgen.0.000076

ngmaster

In silico multi-antigen sequence typing for **Neisseria *gonorrhoeae* (NG-MAST) and _Neisseria **gonorrhoeae_ sequence typing for antimicrobial resistance (NG-STAR).

Synopsis

ngmaster gono.fa FILE SCHEME NG-MAST/NG-STAR porB_NG-MAST tbpB penA mtrR porB_NG-STAR ponA gyrA parC 23S gono.fa ngmaSTar 4186/231 2569 241 23 42 100 100 10 2 100

Dependencies

Installation

PiPy

```

TODO how to integrate mlst dependency

pip3 install ngmaster ```

Brew

```

TODO how to integrate mlst dependency

brew install brewsci/bio/ngmaster ```

Conda

conda install -c bioconda ngmaster

Test

Once installed, you can run the following to ensure ngmaster is successfully working:

ngmaster --test

If everything works, you will see the following:

Running ngmaster.py on test example (NG-MAST 4186 / NG-STAR 231) ... FILE SCHEME NG-MAST/NG-STAR porB_NG-MAST tbpB penA mtrR porB_NG-STAR ponA gyrA parC 23S test.fa ngmaSTar 4186/231 2569 241 23 42 100 100 10 2 100 ... Test successful.

Usage

ngmaster -h

usage:
  ngmaster [OPTIONS] <fasta1> <fasta2> <fasta3> ... <fastaN>

In silico multi-antigen sequence typing for Neisseria gonorrhoeae (NG-MAST)
and Neisseria gonorrhoeae Sequence Typing for Antimicrobial Resistance (NG-STAR)

Please cite as:
  Kwong JC, Goncalves da Silva A, Howden BP and Seemann T.
  NGMASTER: in silico multi-antigen sequence typing for Neisseria gonorrhoeae (NG-MAST)
  GitHub: https://github.com/MDU-PHL/ngmaster

positional arguments:
  FASTA            input FASTA files eg. fasta1, fasta2, fasta3 ... fastaN

optional arguments:
  -h, --help       show this help message and exit
  --db DB          specify custom directory containing allele databases
                   directory must contain database sequence files (.tfa) and allele profile files (ngmast.txt / ngstar.txt)
                   in mlst format (see <https://github.com/tseemann/mlst#adding-a-new-scheme>)
  --csv            output comma-separated format (CSV) rather than tab-separated
  --printseq FILE  specify filename to save allele sequences to
  --minid MINID    DNA percent identity of full allele to consider 'similar' [~]
  --mincov MINCOV  DNA percent coverage to report partial allele at [?]
  --updatedb       update NG-MAST and NG-STAR allele databases from <https://rest.pubmlst.org/db/pubmlst_neisseria_seqdef>
  --assumeyes      assume you are certain you wish to update db
  --test           run test example
  --comments       include NG-STAR comments for each allele in output
  --version        show program's version number and exit

Quick start

To perform in silico NG-MAST and NG-STAR typing on FASTA files:

$ ngmaster <fasta1> <fasta2> <fasta3> ... <fastaN>

The NG-MAST and NG-STAR results and allele numbers are printed in tab-separated format to stdout.

  • ngmaster reports alleles according to the same rules that are implemented in mlst.
  • mlst's arguments --minid and --mincov are available directly in ngmaster
  • For each allele n:

Symbol | Meaning | Length | Identity --- | --- | --- | --- n | exact intact allele | 100% | 100% ~n | novel full length allele similar to n | 100% | ≥ --minid n? | partial match to known allele | ≥ --mincov | ≥ --minid - | allele missing | < --mincov | < --minid n,m | multiple alleles |   |  

To save results to a tab-separated text file, redirect stdout:

$ ngmaster <fasta1> <fasta2> <fasta3> ... <fastaN> > results.txt

To display results in comma-separated format, use the --csv option:

$ ngmaster --csv <fasta1> <fasta2> <fasta3> ... <fastaN>

To save sequences of the alleles to a file (eg. for uploading "new" sequences to PubMLST):

$ ngmaster --printseq [filename] <fasta1> <fasta2> <fasta3> ... <fastaN>

This will create two files:

  1. NGMAST__filename
  2. NGSTAR__filename

Updating the allele databases

To update the allele databases from PubMLST:
Warning: This will overwrite the existing databases so ensure you back them up if you wish to keep them.

$ ngmaster.py --updatedb

A copy of the old database is saved just in case, but is overwritten with each subsequent --updatedb.

To update the allele databases into a different folder (ie. not the /db folder in the ngmaster directory):

$ ngmaster.py --updatedb --db path/to/folder

This will download the database files into the folder path/to/folder. This can then be specified when running ngmaster using the --db path/to/folder option.

Creating a custom allele database

To create a custom allele database please follow the instructions for creating a custom mlst database described here. Usually, this should not be necessary, simply run ngmaster --update to update to the latest NG-MAST and NG-STAR schemes from PubMLST.

Citation

Kwong JC, Gonçalves da Silva A, Dyet K, Williamson DA, Stinear TP, Howden BP and Seemann T.
NGMASTER: in silico multi-antigen sequence typing for Neisseria gonorrhoeae Microbial Genomics 2016 Aug 25;2(8):e000076. PMID:28348871 DOI:10.1099/mgen.0.000076

Bugs

Please submit via the GitHub issues page.

Software Licence

GPLv2

References

Authors

  • Jason Kwong (@kwongjc)
  • Anders Gonçalves da Silva (@drandersgs)
  • Mark Schultz (@schultzm)
  • Torsten Seemann (@torstenseemann)
  • Andreas Stroehlein (@stroehleina)

Development

When incrementing version (i.e., minor patch), run the following:

bumpversion --verbose --dry-run --new-version <major.minor.patch> patch bumpversion --new-version <new.version.number> patch

The same can be done for minor and major numbers.

This will automatically commit and tag the commit with the new version number. It will also update the necessary location in the file.

Pushing to pypi

Must be uploaded to maintainer's account.

``` bumpversion --new-version git push

create distribution

python3 setup.py sdist bdist_wheel

run some checkes

twine check dist/*

upload to test pypi to see if everything works

twine upload --repository-url https://test.pypi.org/legacy/ dist/*

upload to pypi

twine upload dist/* ```

Owner

  • Name: Microbiological Diagnostic Unit Public Health Laboratory
  • Login: MDU-PHL
  • Kind: organization
  • Location: Melbourne, AUSTRALIA

Australian national reference laboratory for Salmonella, Listeria and EHEC.

GitHub Events

Total
  • Watch event: 2
  • Delete event: 1
  • Push event: 31
  • Pull request review event: 2
  • Pull request event: 2
  • Fork event: 1
  • Create event: 4
Last Year
  • Watch event: 2
  • Delete event: 1
  • Push event: 31
  • Pull request review event: 2
  • Pull request event: 2
  • Fork event: 1
  • Create event: 4

Committers

Last synced: over 2 years ago

All Time
  • Total Commits: 162
  • Total Committers: 6
  • Avg Commits per committer: 27.0
  • Development Distribution Score (DDS): 0.66
Past Year
  • Commits: 0
  • Committers: 0
  • Avg Commits per committer: 0.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Jason Kwong k****j@g****m 55
Anders Goncalves da Silva a****s@g****m 45
Torsten Seemann t****n@g****m 30
Andreas Stroehlein a****n@u****u 26
schultzm d****z@g****m 3
Torsten Seemann t****n 3
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 8 months ago

All Time
  • Total issues: 35
  • Total pull requests: 8
  • Average time to close issues: 30 days
  • Average time to close pull requests: 2 days
  • Total issue authors: 13
  • Total pull request authors: 5
  • Average comments per issue: 1.51
  • Average comments per pull request: 0.0
  • Merged pull requests: 6
  • Bot issues: 0
  • Bot pull requests: 2
Past Year
  • Issues: 0
  • Pull requests: 1
  • Average time to close issues: N/A
  • Average time to close pull requests: about 21 hours
  • Issue authors: 0
  • Pull request authors: 1
  • Average comments per issue: 0
  • Average comments per pull request: 0.0
  • Merged pull requests: 1
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • tseemann (15)
  • kwongj (5)
  • DrReemaSingh (3)
  • andersgs (2)
  • SandersonHaley (2)
  • stroehleina (1)
  • maurkarr (1)
  • slvrshot (1)
  • kapsakcj (1)
  • simonrharris (1)
  • mschmerer (1)
  • jayminz (1)
  • waalkes (1)
Pull Request Authors
  • andersgs (3)
  • himal2007 (2)
  • dependabot[bot] (2)
  • schultzm (1)
  • stroehleina (1)
Top Labels
Issue Labels
enhancement (11) bug (11) help wanted (1) question (1)
Pull Request Labels
dependencies (2) bug (2) enhancement (1)

Packages

  • Total packages: 1
  • Total downloads:
    • pypi 52 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 1
  • Total versions: 7
  • Total maintainers: 1
pypi.org: ngmaster

In silico multi-antigen sequence typing for Neisseria gonorrhoeae (NG-MAST)

  • Versions: 7
  • Dependent Packages: 0
  • Dependent Repositories: 1
  • Downloads: 52 Last month
Rankings
Dependent packages count: 10.0%
Forks count: 14.2%
Dependent repos count: 21.7%
Average: 22.0%
Stargazers count: 23.1%
Downloads: 40.8%
Maintainers (1)
Last synced: 7 months ago

Dependencies

Pipfile pypi
  • e1839a8 *
Pipfile.lock pypi
  • argparse ==1.4.0
  • beautifulsoup4 ==4.9.0
  • biopython ==1.76
  • bs4 ==0.0.1
  • certifi ==2020.4.5.1
  • chardet ==3.0.4
  • e1839a8 *
  • idna ==2.9
  • numpy ==1.18.4
  • requests ==2.23.0
  • soupsieve ==2.0
  • urllib3 ==1.25.9
setup.py pypi
  • argparse *
  • biopython *
  • bs4 *
  • requests *
.github/workflows/python-package.yml actions
  • actions/checkout v2 composite
  • actions/checkout master composite
  • actions/setup-python v1 composite
  • conda-incubator/setup-miniconda v2 composite
  • pypa/gh-action-pypi-publish master composite