https://github.com/aliyoussef96/vhcub
Virus-Host Codon Usage Co-Adaptation Analysis
Science Score: 49.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 5 DOI reference(s) in README -
✓Academic publication links
Links to: ncbi.nlm.nih.gov -
○Committers with academic emails
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (12.0%) to scientific vocabulary
Repository
Virus-Host Codon Usage Co-Adaptation Analysis
Basic Info
- Host: GitHub
- Owner: AliYoussef96
- License: lgpl-3.0
- Language: R
- Default Branch: master
- Size: 286 KB
Statistics
- Stars: 5
- Watchers: 1
- Forks: 1
- Open Issues: 0
- Releases: 1
Metadata Files
README.md
vhcub
Virus-Host Codon Usage Co-Adaptation Analysis

vhcub is an R package to analyze the co-adaptation of codon usage between a virus and its host.
Installation
1- Using devtools:
R
if (!requireNamespace("devtools", quietly=TRUE)){
install.packages("devtools")}
devtools::install_github('AliYoussef96/vhcub')
2- vhcub was developed using R and available on CRAN:
R
install.packages("vhcub")
3- If 1 and 2 installation instructions failed, please try:
```R if (!requireNamespace("BiocManager", quietly=TRUE)) install.packages("BiocManager") BiocManager::install("coRdon") BiocManager::install("Biostrings")
install.packages("vhcub") ```
The following measures are implemented in the package
- ENC, effective number of codons (Novembre, 2002),
- SCUO, synonymous codon usage orderliness (Wan et al., 2004),
- Codon Adaptation Index, CAI (Sharp and Li, 1987),
- Relative Codon Deoptimization Index, RCDI (Puigbò et al, 2010),
- Relative Synonymous Codon Usage, RSCU (Sharp and Li, 1987),
- Also, it provides a statistical dinucleotide over- and underrepresentation with three different models.
Usage
Using vhcub to study the CUB of a virus, its host and the co-adaptation between them is straightforward.
For example;
coding sequences for both Escherichia virus T4 and its host Escherichia coli were downloaded in fasta format from the NCBI database.
```R
read virus and host fasta files
fasta <- fasta.read("EscherichiavirusT4.fasta","Escherichiacoli.fasta") fasta.virus <- fasta[[1]] fasta.host <- fasta[[2]]
Calculate the GC overall all content as well as GC at first, second and third codon positions for the virus
gc.df <- GC.content(fasta.virus)
Calculate zscore using syncodon model for statistical dinucleotide over- and underrepresentation
syncodon <- dinuc.syncodon(fasta.virus,permutations=100)
Calculate zscore using base model for statistical dinucleotide over- and underrepresentation
base <- dinuc.base(fasta.virus,permutations=100)
Calculate zscore using codon model for statistical dinucleotide over- and underrepresentation
codon <- dinuc.codon(fasta.virus,permutations=100)
Calculate ENc values for the virus and its host
enc.df.virus <- ENc.values(fasta.virus) enc.df.host <- ENc.values(fasta.host)
Calculate SCUO values for the virus
scuo.df <- SCUO.values(fasta.virus)
Calculate CAI values for the virus using the host sequences as a reference genes set
cai.df <- CAI.values(fasta.virus, enc.df.host, fasta.host, genetic.code="11")
Calculate RSCU values for the virus and its host
rscu.virus <- RSCU.values(fasta.virus) rscu.host <- RSCU.values(fasta.host)
Calculate SiD value for the virus
SiD <- SiD.value(rscu.host,rscu.virus)
Calculate RCDI values for the virus
rcdi.df <- RCDI.values(fasta.virus,fasta.host, enc.df.host) ```
Furthermore, vhcub uses ggplot to visualize two important plots named ENc-GC3 plot and PR2plot, which help to explain what are the factors influence a virus's evolution concerning its CUB.
```R ENc.GC3plot(enc.df.virus,gc.df)
PR2.plot function need fasta sequences inputs that just only include the four-fold codons
PR2.plot(fasta.virus) ```
Contribution Guidelines
Contributions to the package are welcome
For bugs and suggestions, the most effective way is by raising an issue on the github issue tracker. Github allows you to classify your issues so that we know if it is a bug report, feature request or feedback to the authors.
If you wish to contribute some changes to the code then you should submit a pull request How to create a Pull Request? documentation on pull requests
Citation
Anwar AM, Soudy M and Mohamed R. vhcub: Virus-host codon usage co-adaptation analysis [version 1; peer review: 2 approved]. F1000Research 2019, 8:2137 (https://doi.org/10.12688/f1000research.21763.1)
Owner
- Name: Ali Youssef
- Login: AliYoussef96
- Kind: user
- Location: Egypt
- Company: Cairo university
- Website: https://www.linkedin.com/in/ali-youssef-455a92130/
- Repositories: 3
- Profile: https://github.com/AliYoussef96
Trying to be a Bioinformatician
GitHub Events
Total
- Watch event: 1
- Push event: 1
Last Year
- Watch event: 1
- Push event: 1
Committers
Last synced: over 2 years ago
Top Committers
| Name | Commits | |
|---|---|---|
| Ali Youssef | a****9@g****m | 31 |
| Ali Youssef | 4****6 | 31 |
Issues and Pull Requests
Last synced: 7 months ago
All Time
- Total issues: 1
- Total pull requests: 0
- Average time to close issues: 2 days
- Average time to close pull requests: N/A
- Total issue authors: 1
- Total pull request authors: 0
- Average comments per issue: 2.0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- Zjianglin (1)
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels
Dependencies
- Biostrings * imports
- coRdon * imports
- ggplot2 * imports
- seqinr * imports
- stringr * imports
- testthat * suggests