get_mnv

Identifies multiple SNVs within the same codon, reclassifies them as MNVs, and accurately computes resulting amino acid changes from genomic reads

https://github.com/pathogenomics-lab/get_mnv

Science Score: 57.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 2 DOI reference(s) in README
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (12.6%) to scientific vocabulary

Keywords

annotation bam mnv rust snp snv vcf
Last synced: 6 months ago · JSON representation ·

Repository

Identifies multiple SNVs within the same codon, reclassifies them as MNVs, and accurately computes resulting amino acid changes from genomic reads

Basic Info
Statistics
  • Stars: 6
  • Watchers: 1
  • Forks: 1
  • Open Issues: 0
  • Releases: 1
Topics
annotation bam mnv rust snp snv vcf
Created over 2 years ago · Last pushed 12 months ago
Metadata Files
Readme Contributing License Code of conduct Citation Security

README.md

get_MNV

[![get_mnv](https://img.shields.io/badge/get_mnv-rust-%23ff8000?style=flat-square)](https://github.com/PathoGenOmics-Lab/get_MNV) [![License: GPL v3](https://img.shields.io/badge/license-GPL%20v3-%23af64d1?style=flat-square)](https://github.com/PathoGenOmics-Lab/get_MNV/blob/main/LICENSE) [![Anaconda-Server Badge](https://img.shields.io/conda/dn/bioconda/get_mnv.svg?style=flat-square)](https://anaconda.org/bioconda/get_mnv) [![Anaconda-Version Badge](https://anaconda.org/bioconda/get_mnv/badges/version.svg)](https://anaconda.org/bioconda/get_mnv) [![DOI](https://img.shields.io/badge/doi-10.5281%2Fzenodo.13907423-%23ff0077?style=flat-square)](https://doi.org/10.5281/zenodo.13907423) [![PGO](https://img.shields.io/badge/PathoGenOmics-lab-red?style=flat-square)](https://github.com/PathoGenOmics-Lab)

Paula Ruiz-Rodriguez1 and Mireia Coscolla1
1. Institute for Integrative Systems Biology, I2SysBio, University of Valencia-CSIC, Valencia, Spain

get Multi-Nucleotide Variants

get_MNV is a tool designed to identify Multi-Nucleotide Variants (MNVs) within the same codon in genomic sequences. MNVs occur when multiple Single Nucleotide Variants (SNVs) are present within the same codon, leading to the translation of a different amino acid. This tool addresses limitations in current annotation programs like ANNOVAR or SnpEff, which are primarily designed to work with individual SNVs and might overlook the actual amino acid changes resulting from MNVs. **get_MNV** seeks to address this issue, enhancing the comprehensiveness of genetic variant interpretation.

get_MNV

**IMPORTANT this script works with SNV against a reference, insertions and deletions modifiying reading frame are not currently supported** ## 💾 Features - MNV Identification: Detects SNVs occurring within the same codon and reclassifies them as MNVs. - Accurate Amino Acid Change Calculation: Computes the resulting amino acid changes based on genomic reads. - Integration with BAM and VCF Files: Supports input from VCF files for variants and optional BAM files for aligned reads. - Quality Analysis: Allows setting a minimum Phred quality threshold to filter out low-quality reads. ## 🛠️ Installation You can install get_MNV via conda, mamba (for unix/mac) or downloading [the binary file](https://github.com/PathoGenOmics-Lab/get_MNV/releases/download/1.0.0/get_mnv) (unix): ### 🐍 Using conda ``` conda install -c bioconda get_mnv ``` ### 🐍 Using mamba ``` mamba install -c bioconda get_mnv ``` ### 📨 Using binary ``` wget https://github.com/PathoGenOmics-Lab/get_MNV/releases/download/1.0.0/get_mnv ``` # 📎 Usage ``` get_mnv [OPTIONS] --vcf --fasta --genes ``` ## 🗃️ Options: - -v, --vcf : VCF file containing the SNVs. (Required) - -b, --bam : BAM file with aligned reads. (Optional) - -f, --fasta : FASTA file with the reference sequence. (Required) - -g, --genes : File containing gene information. (Required) - -q, --quality : Minimum Phred quality score (default: 20). ## Example: ``` get_mnv \ --vcf variants.vcf \ --bam reads.bam \ --fasta reference.fasta \ --genes genes.txt \ --quality 30 ``` ## Input File Formats - VCF File: Should contain the identified SNVs. - BAM File: (Optional) Genomic reads aligned to the reference sequence. - FASTA File: Reference genomic sequence. - Gene File: A tab-delimited text file with the following structure per line (GeneName,GeneStart,GeneEnd,Strand): ```bash Rv0007_Rv0007 9914 10828 + ileT_Rvnt01 10887 10960 + alaT_Rvnt02 11112 11184 + Rv0008c_Rv0008c 11874 12311 - ppiA_Rv0009 12468 13016 + Rv0010c_Rv0010c 13133 13558 - ``` ## 🎴Output The program generates a TSV file named .MNV.tsv containing the following information: - Gene: Name of the gene. - Positions: Positions of the variants. - Base Changes: Nucleotide base changes. - AA Changes: Resulting amino acid changes. - SNP AA Changes: Amino acid changes if considering individual SNVs. - Variant Type: Type of variant (SNP, MNV, or SNP/MNV). - Change Type: Type of change at the protein level (Synonymous, Non-synonymous, Stop gained). - SNP Reads: (If BAM provided) Count of reads supporting each SNP. - MNV Reads: (If BAM provided) Count of reads supporting the MNV. Example: ``` Gene Positions Base Changes AA Changes SNP AA Changes Variant Type Change Type SNP Reads MNV Reads Rv0095c_Rv0095c 104838 T Asp126Glu Asp126Glu SNP Non-synonymous 0 16 Rv0095c_Rv0095c 104941,104942 T,G Gly92Gln Gly92Glu; Gly92Arg MNV Non-synonymous 0,0 25 esxL_Rv1198 1341044 C His13His His13His SNP Synonymous 0 41 esxL_Rv1198 1341083 G Ala26Ala Ala26Ala SNP Synonymous 0 12 esxL_Rv1198 1341102,1341103 T,C Arg33Ser Arg33Cys; Arg33Pro MNV Non-synonymous 0,0 11 ``` # 📉 Limitations - The script currently works only with SNVs compared against a reference sequence. - Insertions and deletions that modify the reading frame are not supported in this version. ---

✨ [Contributors]((https://github.com/PathoGenOmics-Lab/AMAP/graphs/contributors))

get_MNV is developed with ❤️ by:

Paula Ruiz-Rodriguez

💻 🔬 🤔 🔣 🎨 🔧

Mireia Coscolla

🔍 🤔 🧑‍🏫 🔬 📓
This project follows the [all-contributors](https://github.com/all-contributors/all-contributors) specification ([emoji key](https://allcontributors.org/docs/en/emoji-key)). --- # Fun ## 3D model logo Click for the stl file

get_MNV

Owner

  • Name: PathoGenOmics Lab
  • Login: PathoGenOmics-Lab
  • Kind: organization
  • Location: Spain

Citation (CITATION.cff)

# This CITATION.cff file was generated with cffinit.
# Visit https://bit.ly/cffinit to generate yours today!

cff-version: 1.2.0
title: get_MNV
message: 'If you use this software, please cite it as below.'
type: software
authors:
  - family-names: Ruiz-Rodriguez
    given-names: Paula
    orcid: 'https://orcid.org/0000-0003-0727-5974'
    affiliation: >-
      Institute for Integrative Systems Biology (I2SysBio,
      University of Valencia - CSIC), FISABIO Joint Research
      Unit “Infection and Public Health”, Paterna, Spain
    email: paula.ruiz-rodriguez@uv.es
  - family-names: Coscolla
    given-names: Mireia
    orcid: 'https://orcid.org/0000-0003-0752-0538'
    affiliation: >-
      Institute for Integrative Systems Biology (I2SysBio,
      University of Valencia - CSIC), FISABIO Joint Research
      Unit “Infection and Public Health”, Paterna, Spain
    email: mireia.coscolla@csic.es
url: 'https://github.com/PathoGenOmics-Lab/get_MNV'
license: GPL-3.0
version: 1.0.0
date-released: '2024-10-05'
doi: 10.5281/zenodo.13907423

GitHub Events

Total
  • Watch event: 3
  • Push event: 14
  • Fork event: 1
  • Create event: 2
Last Year
  • Watch event: 3
  • Push event: 14
  • Fork event: 1
  • Create event: 2