mitos_scripts.jl
Command Line Scripts for the MIToS Package to Analyze Protein Sequence and Structure
Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (15.3%) to scientific vocabulary
Repository
Command Line Scripts for the MIToS Package to Analyze Protein Sequence and Structure
Basic Info
- Host: GitHub
- Owner: MIToSOrg
- License: mit
- Language: Julia
- Default Branch: main
- Size: 49.8 KB
Statistics
- Stars: 0
- Watchers: 0
- Forks: 1
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
MIToS_Scripts
MIToS_Scripts.jl is a command-line interface (CLI) script collection designed to facilitate various bioinformatics tasks. Any researcher with basic terminal knowledge can easily utilize these scripts without Julia's coding experience.
Scripts
- Buslje09.jl: Calculates the corrected Mutual Information (MI/MIp) as described by Buslje et al., 2009.
- BLMI.jl: Computes corrected mutual information using BLOSUM62-based pseudocounts, as described in the MIToS publication (Zea et al., 2016).
- Conservation.jl: Calculates Shannon entropy and Kullback-Leibler divergence for each column in a Multiple Sequence Alignment (MSA).
- DownloadPDB.jl: Downloads gzipped files from the Protein Data Bank (PDB).
- Distances.jl: Computes inter-residue distances in a PDB file.
- SplitStockholm.jl: Splits a Stockholm file containing multiple alignments into separate compressed files for each MSA.
- AlignedColumns.jl: Creates a Stockholm file with aligned columns from a Pfam Stockholm file, removing insertions, and saves the mapping of residue numbers in UniProt and the original MSA columns.
- PercentIdentity.jl: Calculates the percentage identity between all sequences in an MSA and provides statistics such as mean, median, and minimum values.
- MSADescription.jl: Provides statistics for a given Stockholm file, including the number of columns, sequences, and clusters after Hobohm I clustering at 62% identity. It also reports the mean and standard deviation of sequence coverage and gap percentage.
Installation
To install MIToS_Scripts.jl, you only need Julia 1.9 or later installed on your
system. Executing julia in the terminal to open the Julia REPL, and finally, run the
following command:
julia
using Pkg
Pkg.add("https://github.com/MIToSOrg/MIToS_Scripts.jl")
Then, you can get the location of the installed scripts by running the following command:
julia
using MIToS_Scripts
joinpath(pkgdir(MIToS_Scripts), "scripts")
You can run them from that location or copy them to a directory in your PATH.
Usage
You can execute each provided script from your command line. For example, to run the Buslje09.jl
script—if you are located in the folder where it is the scripts—use:
bash
julia Buslje09.jl input_msa_file
Refer to the documentation of each script for specific usage instructions; you can access
it by running the script with the --help or -h flag:
bash
julia Buslje09.jl -h
License
This project is licensed under the MIT License. See the LICENSE file for details.
References
- Buslje, C. M., Santos, J., Delfino, J. M., & Nielsen, M. (2009). Correction for phylogeny, small number of observations and data redundancy improves the identification of coevolving amino acid pairs using mutual information. Bioinformatics, 25(9), 1125-1131.
- Zea, D. J., Anfossi, D., Nielsen, M., & Marino-Buslje, C. (2017). MIToS. jl: mutual information tools for protein sequence analysis in the Julia language. Bioinformatics, 33(4), 564-565.
Citing
See CITATION.bib for the relevant reference(s).
Owner
- Name: MIToSOrg
- Login: MIToSOrg
- Kind: organization
- Email: diegozea@gmail.com
- Repositories: 1
- Profile: https://github.com/MIToSOrg
Julia packages for protein sequence and structure analysis
Citation (CITATION.bib)
@article{10.1093/bioinformatics/btw646,
author = {Zea, Diego J and Anfossi, Diego and Nielsen, Morten and Marino-Buslje, Cristina},
title = "{MIToS.jl: mutual information tools for protein sequence analysis in the Julia language}",
journal = {Bioinformatics},
volume = {33},
number = {4},
pages = {564-565},
year = {2016},
month = {11},
abstract = "{MIToS is an environment for mutual information analysis and a framework for protein multiple sequence alignments (MSAs) and protein structures (PDB) management in Julia language. It integrates sequence and structural information through SIFTS, making Pfam MSAs analysis straightforward. MIToS streamlines the implementation of any measure calculated from residue contingency tables and its optimization and testing in terms of protein contact prediction. As an example, we implemented and tested a BLOSUM62-based pseudo-count strategy in mutual information analysis.The software is totally implemented in Julia and supported for Linux, OS X and Windows. It’s freely available on GitHub under MIT license: http://mitos.leloir.org.ar.Supplementary data are available at Bioinformatics online.}",
issn = {1367-4803},
doi = {10.1093/bioinformatics/btw646},
url = {https://doi.org/10.1093/bioinformatics/btw646},
eprint = {https://academic.oup.com/bioinformatics/article-pdf/33/4/564/25146747/btw646\_supp.pdf},
}
GitHub Events
Total
- Push event: 4
- Pull request event: 1
- Fork event: 1
Last Year
- Push event: 4
- Pull request event: 1
- Fork event: 1