mitos_scripts.jl

Command Line Scripts for the MIToS Package to Analyze Protein Sequence and Structure

https://github.com/mitosorg/mitos_scripts.jl

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (15.3%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

Command Line Scripts for the MIToS Package to Analyze Protein Sequence and Structure

Basic Info
  • Host: GitHub
  • Owner: MIToSOrg
  • License: mit
  • Language: Julia
  • Default Branch: main
  • Size: 49.8 KB
Statistics
  • Stars: 0
  • Watchers: 0
  • Forks: 1
  • Open Issues: 0
  • Releases: 0
Created over 1 year ago · Last pushed 10 months ago
Metadata Files
Readme License Citation

README.md

MIToS_Scripts

Build Status Aqua

MIToS_Scripts.jl is a command-line interface (CLI) script collection designed to facilitate various bioinformatics tasks. Any researcher with basic terminal knowledge can easily utilize these scripts without Julia's coding experience.

Scripts

  • Buslje09.jl: Calculates the corrected Mutual Information (MI/MIp) as described by Buslje et al., 2009.
  • BLMI.jl: Computes corrected mutual information using BLOSUM62-based pseudocounts, as described in the MIToS publication (Zea et al., 2016).
  • Conservation.jl: Calculates Shannon entropy and Kullback-Leibler divergence for each column in a Multiple Sequence Alignment (MSA).
  • DownloadPDB.jl: Downloads gzipped files from the Protein Data Bank (PDB).
  • Distances.jl: Computes inter-residue distances in a PDB file.
  • SplitStockholm.jl: Splits a Stockholm file containing multiple alignments into separate compressed files for each MSA.
  • AlignedColumns.jl: Creates a Stockholm file with aligned columns from a Pfam Stockholm file, removing insertions, and saves the mapping of residue numbers in UniProt and the original MSA columns.
  • PercentIdentity.jl: Calculates the percentage identity between all sequences in an MSA and provides statistics such as mean, median, and minimum values.
  • MSADescription.jl: Provides statistics for a given Stockholm file, including the number of columns, sequences, and clusters after Hobohm I clustering at 62% identity. It also reports the mean and standard deviation of sequence coverage and gap percentage.

Installation

To install MIToS_Scripts.jl, you only need Julia 1.9 or later installed on your system. Executing julia in the terminal to open the Julia REPL, and finally, run the following command:

julia using Pkg Pkg.add("https://github.com/MIToSOrg/MIToS_Scripts.jl")

Then, you can get the location of the installed scripts by running the following command:

julia using MIToS_Scripts joinpath(pkgdir(MIToS_Scripts), "scripts")

You can run them from that location or copy them to a directory in your PATH.

Usage

You can execute each provided script from your command line. For example, to run the Buslje09.jl script—if you are located in the folder where it is the scripts—use:

bash julia Buslje09.jl input_msa_file

Refer to the documentation of each script for specific usage instructions; you can access it by running the script with the --help or -h flag:

bash julia Buslje09.jl -h

License

This project is licensed under the MIT License. See the LICENSE file for details.

References

  1. Buslje, C. M., Santos, J., Delfino, J. M., & Nielsen, M. (2009). Correction for phylogeny, small number of observations and data redundancy improves the identification of coevolving amino acid pairs using mutual information. Bioinformatics, 25(9), 1125-1131.
  2. Zea, D. J., Anfossi, D., Nielsen, M., & Marino-Buslje, C. (2017). MIToS. jl: mutual information tools for protein sequence analysis in the Julia language. Bioinformatics, 33(4), 564-565.

Citing

See CITATION.bib for the relevant reference(s).

Owner

  • Name: MIToSOrg
  • Login: MIToSOrg
  • Kind: organization
  • Email: diegozea@gmail.com

Julia packages for protein sequence and structure analysis

Citation (CITATION.bib)

@article{10.1093/bioinformatics/btw646,
    author = {Zea, Diego J and Anfossi, Diego and Nielsen, Morten and Marino-Buslje, Cristina},
    title = "{MIToS.jl: mutual information tools for protein sequence analysis in the Julia language}",
    journal = {Bioinformatics},
    volume = {33},
    number = {4},
    pages = {564-565},
    year = {2016},
    month = {11},
    abstract = "{MIToS is an environment for mutual information analysis and a framework for protein multiple sequence alignments (MSAs) and protein structures (PDB) management in Julia language. It integrates sequence and structural information through SIFTS, making Pfam MSAs analysis straightforward. MIToS streamlines the implementation of any measure calculated from residue contingency tables and its optimization and testing in terms of protein contact prediction. As an example, we implemented and tested a BLOSUM62-based pseudo-count strategy in mutual information analysis.The software is totally implemented in Julia and supported for Linux, OS X and Windows. It’s freely available on GitHub under MIT license: http://mitos.leloir.org.ar.Supplementary data are available at Bioinformatics online.}",
    issn = {1367-4803},
    doi = {10.1093/bioinformatics/btw646},
    url = {https://doi.org/10.1093/bioinformatics/btw646},
    eprint = {https://academic.oup.com/bioinformatics/article-pdf/33/4/564/25146747/btw646\_supp.pdf},
}

GitHub Events

Total
  • Push event: 4
  • Pull request event: 1
  • Fork event: 1
Last Year
  • Push event: 4
  • Pull request event: 1
  • Fork event: 1