BCAWT

BCAWT: Automated tool for codon usage bias analysis for molecular evolution - Published in JOSS (2019)

https://github.com/aliyoussef96/bcaw-tool

Science Score: 95.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 3 DOI reference(s) in README and JOSS metadata
  • Academic publication links
    Links to: ncbi.nlm.nih.gov, joss.theoj.org
  • Committers with academic emails
    1 of 2 committers (50.0%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
    Published in Journal of Open Source Software

Scientific Fields

Mathematics Computer Science - 63% confidence
Last synced: 6 months ago · JSON representation

Repository

BCAWT: Automated tool for codon usage bias analysis for molecular evolution

Basic Info
  • Host: GitHub
  • Owner: AliYoussef96
  • License: mit
  • Language: Python
  • Default Branch: master
  • Homepage:
  • Size: 4.52 MB
Statistics
  • Stars: 5
  • Watchers: 1
  • Forks: 1
  • Open Issues: 1
  • Releases: 1
Created over 6 years ago · Last pushed about 1 year ago
Metadata Files
Readme Contributing License

README.md

BCAWT: Automated tool for codon usage bias analysis for molecular evolution

Build Status Documentation Status PyPI version status

BCAW tool Updates

Now you can run BCAW tool using a GUI software that can work on any operating system. It is very easy to use. For more information and to download it: BCAWT-GUI.

Statement of Need

There are no tools available enable users to run a whole automated workflow for codon usage bias analysis. Using python 3.7 BCAW Tool ( Bio Codon Analysis Workflow Tool ) was developed to address this problem. BCAW Tool manages a complete automated workflow to analyze the codon usage bias for genes and genomes of any organism. With minimum coding skills.

For more details about codon usage bias , and the equations used in BCAWT see.

Dependencies

1- Biopython

2- pandas

3- cai2

4- scipy

5- matplotlib

6- numpy

7- prince

Installation Instructions

Using pip

python pip install git+https://github.com/AliYoussef96/BCAW-Tool.git

Note: Python >=3.7 is required.

Contribution Guidelines

Contributions to the software are welcome

For bugs and suggestions, the most effective way is by raising an issue on the github issue tracker. Github allows you to classify your issues so that we know if it is a bug report, feature request or feedback to the authors.

If you wish to contribute some changes to the code then you should submit a pull request How to create a Pull Request? documentation on pull requests

Usage

Auto testing

Note here we try to test the result of BCAW tool and not the modules, for testing the modules in the package use test.py

First download fasta file containing the coding sequence ( you can download any fasta file containing gene sequences to be analyzed from NCBI database).

or just download that file Test file

then run ( It will automatically run a test on the results files ):

```python from BCAWT import BCAWTautotest path = "Testfolder" # absolute path to the directory to save the result in testfasta = "Testfastafile" # absolute path to the fasta file that will be tested BCAWTautotest.autotest(path, testfasta)

processing....

BCAWTautotest.autocheckfiles(path) # note: this test assumes that in the result folder nothing except the result files form the above function. ```

Main Usage

python from BCAWT import BCAWT BCAWT.BCAW(['Ecoli.fasta'],'save_path',genetic_code_=11,Auto=True)

Input

```

mainfastafile (list): list of string of the file's path or file-like object

save_path (str): absolute path to the directory to save the result in, default = the current directory

reffastafile (list): list of string of the file's path or file-like object, default = None

Auto (bool): default = False, if reffastafile not None.

geneticcode (int) : default = 1, The Genetic Codes number described by NCBI

``` Important Note: BCAW tool expect coding sequences as input and not genes, for more information about what the difference between them you can take a look here

To obtain such fasta file for a species of interest

Say that the species of interest is Escherichia coli str. K-12 substr. MG1655:

1- Go to the NCBI database.

2- In the search bar write ( Escherichia coli str. K-12 substr. MG1655, complete genome ).

3- choose one of the results ( depending on what you want in your analysis ).

3- On the right of the page, you will find send to option. From sent to select Coding Sequences then FASTA nucleotides Finally, press on Create File

For NCBI Genomes Download (FTP) FAQ

Output

The expected CSV files output

|CSV file name|Description| |------------|-----------| | ATCG | contains ; gene id, GC, GC1, GC2, GC3, GC12, AT, AT3 A3, T3, C3, G3, GRAVY, AROMO and, Gene Length | | CARSCU | contains ; each RSCU result for each codon in each genes | | CARSCUcodons | contains ; correspondence analysis first 4 axis for each codon | | CA_RSCUgenes | contains ; correspondence analysis first 4 axis for each gene | | CAI | contains ; gene id and CAI index | | ENc | contains ; gene id and ENc index. | | P2-index | contains ; gene id and P2 index | | optimal codons | contains; putative optimal codons detected |

All output plots from BCAW tool analysis for coding sequence from Escherichia coli

Fig 1

Documentations

  1. An intro to the codon usage bias >> CUB introduction
  2. For more information about the equations used to analyze CUB in the BCAW tool >> Equations
  3. For more information about the output >> Output
  4. For more information about the abbreviations used >> Abbreviations table

Citation

Anwar, (2019). BCAWT: Automated tool for codon usage bias analysis for molecular evolution. Journal of Open Source Software, 4(42), 1500, https://doi.org/10.21105/joss.01500

Owner

  • Name: Ali Youssef
  • Login: AliYoussef96
  • Kind: user
  • Location: Egypt
  • Company: Cairo university

Trying to be a Bioinformatician

JOSS Publication

BCAWT: Automated tool for codon usage bias analysis for molecular evolution
Published
October 20, 2019
Volume 4, Issue 42, Page 1500
Authors
Ali Mostafa Anwar ORCID
Department of Genetics, Faculty of Agriculture, Cairo University, 12613, Cairo, Egypt
Editor
Melissa Gymrek ORCID
Tags
Codon Usage Analysis Automated Workflow Putative optimal codons

GitHub Events

Total
  • Issues event: 1
  • Issue comment event: 1
  • Push event: 25
Last Year
  • Issues event: 1
  • Issue comment event: 1
  • Push event: 25

Committers

Last synced: 7 months ago

All Time
  • Total Commits: 403
  • Total Committers: 2
  • Avg Commits per committer: 201.5
  • Development Distribution Score (DDS): 0.002
Past Year
  • Commits: 29
  • Committers: 1
  • Avg Commits per committer: 29.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Ali Youssef 4****6 402
Benjamin Lee b****e@c****u 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 14
  • Total pull requests: 1
  • Average time to close issues: 2 months
  • Average time to close pull requests: 39 minutes
  • Total issue authors: 4
  • Total pull request authors: 1
  • Average comments per issue: 1.79
  • Average comments per pull request: 1.0
  • Merged pull requests: 1
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • Benjamin-Lee (11)
  • YDD99 (1)
  • symonovar (1)
  • Liiii007 (1)
Pull Request Authors
  • Benjamin-Lee (1)
Top Labels
Issue Labels
documentation (4)
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • pypi 34 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 1
  • Total versions: 6
  • Total maintainers: 1
pypi.org: bcawt

Manages a complete workflow to analysis the codon usage bias

  • Versions: 6
  • Dependent Packages: 0
  • Dependent Repositories: 1
  • Downloads: 34 Last month
Rankings
Dependent packages count: 10.1%
Stargazers count: 21.5%
Dependent repos count: 21.6%
Average: 21.8%
Forks count: 22.6%
Downloads: 33.2%
Maintainers (1)
Last synced: 6 months ago

Dependencies

requirements.txt pypi
  • CAI *
  • biopython *
  • matplotlib *
  • numpy *
  • pandas *
  • prince *
  • scipy *
setup.py pypi