amr

In this repository, we have compiled the WGS analysis schematics of the bacterial isolates obtained from poultry manure of layer and broiler poultry farms

https://github.com/dinesh2k19/amr

Keywords

antimicrobial-resistance bacterial-genome-analysis manure poultry-birds wgs

Last synced: 10 months ago · JSON representation ·

Repository

In this repository, we have compiled the WGS analysis schematics of the bacterial isolates obtained from poultry manure of layer and broiler poultry farms

Basic Info

Host: GitHub
Owner: dinesh2k19
License: cc0-1.0
Default Branch: main
Homepage: https://github.com/dinesh2k19/amr
Size: 1.78 MB

Statistics

Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Releases: 1

Topics

antimicrobial-resistance bacterial-genome-analysis manure poultry-birds wgs

Created over 3 years ago · Last pushed over 1 year ago

Metadata Files

Readme License Citation

Poultry manure bacterial isolates

Comprehending the existence of Antimicrobial Resistant Genes (ARGs) in poultry litter and bacteria harbouring them may provide insights into the potentiality of gene transmission into microbial communities of humans and opportunistic pathogens, highlighting the significance of a One-Health approach that considers the intricate relationships between humans, animals, and environmental health.
The outcomes provide insights into the interaction of genes, phenotypes, and environmental variables in forming AMR profiles in poultry contexts, which has implications for agricultural settings and public health.

Graphical abstract

GA3

Whole genome sequence (WGS) analysis workflow

WGS Analysis workflow includes sequence quality checking, quality control, taxonomy assignment, genome assembly, assembly quality assessment, assembled contig annotation, antimicrobial resistance, and taxonomy predictions.

Setup Conda Environment for workflow management

Miniconda is a lightweight version of Anaconda that includes only Conda and its dependencies. If you need a full package, install Anaconda instead of Miniconda. For detailed instructions, please refer to https://docs.anaconda.com/anaconda/install/

Analysis workflow steps

We created a conda environment named env_amr and installed all the required packages, dependencies were installed and configured in this environment to execute the analysis workflow steps as per following commands.
Raw sequence quality assessment using FASTQC Toolkit (https://www.bioinformatics.babraham.ac.uk/projects/fastqc/) bash $ conda create -n env_amr -c bioconda fastqc

bash $ conda activate env_amr

bash $ fastqc -o /fastqc_output -t 64 /data/*.fastq.gz

Adapter trimming and low quality filtering using PrinSeq-Lite v0.20.4 and reports using MultiQC version 1.20

bash $ perl prinseq-lite.pl -fastq Sample1_R1.fastq.gz -fastq2 Sample1_R2.fastq.gz -min_qual_mean 30 -min_len 50 -ns_max_n 0 bash $ fastp -i Sample1_R1.fastq.gz -I Sample1_R2.fastq.gz -o S1_clean_R1.fastq.gz -O S1_clean_R2.fastq.gz -h S1_fastp_report.html -j S1_fastp_report.json --low_complexity_filter --cut_mean_quality 30 --min_length 50 --thread 16

Generate MultiQC report

bash $ multiqc . -o multiqc_report/

Genome Assembly

Illumina Paired-end trimmed and filtered reads were assembled using Unicycler with the command line options at local server. Unicycler is an assembly pipeline for bacterial genomes. bash $ unicycler -1 S1_filt_reads_R1.fastq.gz -2 S1_filt_reads_R2.fastq.gz -o output_dir --assembly_method bold ## Genome quality check and statistics using CheckM v2 and QUality ASsessment Tool (QUAST) v5.0.2
We obtained the genomic assembly statistics using CheckM v2 for all the samples and obtained the QUAST reports bash $ checkm lineage_wf -x fasta output_dir/ checkm_report/

bash $ quast -i genome_dir/assembly.fasta -o quast_report/ --threads 64 --m bacterial

Genome based taxonomy assignment using (GTDB-Tk) and multi-locus sequence typing (MLST) using PubMLST server (https://pubmlst.org/)

The bacterial genomes were assigned taxonomy using Genome based taxonomy database and processed for the multi-locus strain typing by PubMLST server.

bash $ gtdbtk classify_wf --genome_dir genome_dir/ --output_dir gtdbtk_report/ --cpus 64

Gene prediction using Prokka and annotation

Subsystems utilizing the SEED Servers (http://www.theseed.org/servers) and Rapid Annotation using Subsystem Technology (RAST) Server (https://rast.nmpdr.org/) and eggNOG-mapper v2 assisted with precomputed eggNOG v5.0 clusters were used to perform functional annotation.

Idenitfication of the Antimicrobial Resistant Genes (ARGs)

The ARGs were identified using the Resistance Gene Identifier (RGI) from respective bacterial genome assembly nucleotide sequences based on homology and SNP models using Comprehensive Antibiotic Resistance Database (CARD) (https://card.mcmaster.ca/)

Prediction of phage sequences using PHASTER (PHAge Search Tool Enhanced Release) server (https://phaster.ca/).

PHASTER web server was used for the rapid identification and annotation of prophage sequences within bacterial genome assemblies.

Analysis of virulence factors

Analysis was performed using VFDB server version 2022 (http://www.mgc.ac.cn/cgi-bin/VFs/v5/main.cgi)

Analysis of Integrative and conjugative elements (ICEs), integrative and mobilizable elements (IMEs), and cis-mobilizable elements (CIMEs)

ICEfinder server was used (https://bioinfo-mml.sjtu.edu.cn/ICEfinder/ICEfinder.html) for the detection of ICEs/IMEs of bacterial genomes.

The heavy metal resistant genes (HMRGs) analysis

The HMRGs sequences were downloaded to create a custom database and analysed using the Hidden Markov models (HMMs) profile protein family (PF13801: Heavy-metal resistance) https://www.ebi.ac.uk/interpro/entry/pfam/PF13801/

Genomic feature visulisation using Proksee server (https://proksee.ca/). Proksee is a suite of command line tools for performing assembly and evaluation of microbial genomes.

Proksee server system was used for genome assembly, annotation and visualization, featuring interactive circular and linear genome maps for the respective bacterial genome assemblies, and ARGs

Schematic

Analysis-workflow

Taxonomic identification using 16S ribosomal RNA gene sequencing

MEGA 11 (Version 11.0.13) was employed for multiple sequence alignment, while the phylogenetic investigation was done using a maximum likelihood approach with 1000 bootstrap replicates. Using the iTOL server, aligned sequences were subsequently utilized to generate a circular phylogenetic tree (https://itol.embl.de/). All 16S rRNA gene sequences have been submitted to the NCBI public repository (GenBank Accession numbers PP053433-PP053445)

Genome Analysis and Visualisation in Proksee Server

Proksee (https://proksee.ca) provides users with a powerful, easy-to-use, and feature-rich system for assembling, annotating, analysing, and visualizing bacterial genomes. Proksee accepts Illumina sequence reads as compressed FASTQ files or pre-assembled contigs in raw, FASTA, or GenBank format.

The circular genome visualised below incidate the features of selected isolate (Acinetobacter indicus BRLT-5) isolated from broiler litter samples:

BRLT5

Raw Data Availaility

Accession numbers (PP053433-PP053445) of 16S rRNA nucleotide sequences of selected poultry litter bacterial isolates used in this study are available in National Centre for Biotechnology Information (NCBI) database and the raw reads generated through WGS has been submitted to the NCBI under Sequence Read Archive (SRA) submission with Bio-project accession number PRJNA1058239.

Funding Information

This research work was supported by Gujarat Biotechnology Research Centre (GBRC), Department of Science and Technology (DST), Government of Gujarat, Gandhinagar, India. Authors would like to gratefully acknowledge the UKRI-GCRF One Health Poultry Hub (OHPH), UK for supporting research activity. Department of Biotechnology (DBT), Government of India, New Delhi gratefully acknowledged for financial assistance to Animesh Tripathi in the form of Junior Research Fellow (DBTHRDPMU/JRF/BET-20/2020/AL/293).

Citation

If you find it helpful, we would appreciate if you could cite the following publication:

Tripathi, Animesh, Anjali Jaiswal, Dinesh Kumar, Ramesh Pandit, Damer Blake, Fiona Tomley, Madhvi Joshi, Chaitanya G. Joshi, and Suresh Kumar Dubey. "High occurrence of antimicrobial resistance genes in bacteria isolated from poultry manure assessed using bacterial genome sequencing.

Owner

Name: Dinesh Kumar
Login: dinesh2k19
Kind: user
Location: Hyderabad, Telangana, India 500078
Company: Birla Institute of Technology and Science (BITS) Pilani Hyderabad Campus

Website: https://sites.google.com/view/dinesh2k19/
Twitter: dinesh_2k19
Repositories: 1
Profile: https://github.com/dinesh2k19

Postdoctoral Researcher at BITS-Pilani, Hyderabad Campus

Citation (citations.md)

## Please cite the following primary sources and databases

## Pipeline tools

- ## FastQC
  > https://www.bioinformatics.babraham.ac.uk/projects/fastqc/

- ## PRINSEQ: PReprocessing and INformation of SEQuence data (http://prinseq.sourceforge.net)
  > Schmieder R, Edwards R. Quality control and preprocessing of metagenomic datasets. Bioinformatics 2011;27:863–4. doi: 10.1093/bioinformatics/btr026.

- ## MultiQC 
  > Ewels P, Magnusson M, Lundin S, Käller M. MultiQC: summarize analysis results for multiple tools and samples in a single report. Bioinformatics 2016;32:3047–8. doi: 10.1093/bioinformatics/btw354.

- ## Unicycler 
  > Wick RR, Judd LM, Gorrie CL, Holt KE. Unicycler: Resolving bacterial genome assemblies from short and long sequencing reads. PLOS Comput Biol 2017;13:e1005595. doi: 10.1371/journal.pcbi.1005595.

- ## CheckM 
  > Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res 2015;25:1043–55. doi: 10.1101/gr.186072.114.

- ## QUAST 
  > Gurevich A, Saveliev V, Vyahhi N, Tesler G. QUAST: quality assessment tool for genome assemblies. Bioinformatics 2013;29:1072–5. doi: 10.1093/bioinformatics/btt086.

- ## GTDB-Tk
  > Chaumeil P-A, Mussig AJ, Hugenholtz P, Parks DH. GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database. Bioinformatics 2020;36:1925–7. doi: 10.1093/bioinformatics/btz848.

- ## PubMLST Server 
  > Jolley KA, Bray JE, Maiden MCJ. Open-access bacterial population genomics: BIGSdb software, the PubMLST.org website and their applications. Wellcome Open Res 2018;3:124. doi: 10.12688/wellcomeopenres.14826.1.

- ## CARD Database
  > Alcock BP, Huynh W, Chalil R, Smith KW, Raphenya AR, Wlodarski MA, et al. CARD 2023: expanded curation, support for machine learning, and resistome prediction at the Comprehensive Antibiotic Resistance Database. Nucleic Acids Res 2023;51:D690–9. doi: 10.1093/nar/gkac920.

- ## Proksee Server
  > Grant JR, Enns E, Marinier E, Mandal A, Herman EK, Chen C, et al. Proksee: in-depth characterization and visualization of bacterial genomes. Nucleic Acids Res 2023;51:W484–92. doi: 10.1093/nar/gkad326.

- ## The SEED Server 
  > Overbeek R, Olson R, Pusch GD, Olsen GJ, Davis JJ, Disz T, et al. The SEED and the Rapid Annotation of microbial genomes using Subsystems Technology (RAST). Nucleic Acids Res 2014;42:D206–14. doi: 10.1093/nar/gkt1226.

- ## PHASTER 
  > Arndt D, Grant JR, Marcu A, Sajed T, Pon A, Liang Y, et al. PHASTER: a better, faster version of the PHAST phage search tool. Nucleic Acids Res 2016;44:W16–21. doi: 10.1093/nar/gkw387.

- ## PHAST
  > Zhou Y, Liang Y, Lynch KH, Dennis JJ, Wishart DS. PHAST: A Fast Phage Search Tool. Nucleic Acids Res 2011;39:W347–52. doi: 10.1093/nar/gkr485.

- ## VFDB 2022
  > Liu B, Zheng D, Zhou S, Chen L, Yang J. VFDB 2022: a general classification scheme for bacterial virulence factors. Nucleic Acids Res 2022;50:D912–7. doi: 10.1093/nar/gkab1107.

- ## eggNOG 5.0
  > Huerta-Cepas J, Szklarczyk D, Heller D, Hernández-Plaza A, Forslund SK, Cook H, et al. eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. Nucleic Acids Res 2019;47:D309–14. doi: 10.1093/nar/gky1085.

- ## eggNOG-mapper v2
  > Cantalapiedra CP, Hernández-Plaza A, Letunic I, Bork P, Huerta-Cepas J. eggNOG-mapper v2: Functional annotation, orthology assignments, and domain prediction at the metagenomic scale. Mol Biol Evol 2021;38:5825–9. doi: 10.1093/molbev/msab293.

## Tools
- tRNAscan-SE 2.0 https://doi.org/10.1093/nar/gkab688
- Aragorn https://doi.org/10.1093/nar/gkh152
- Diamond https://doi.org/10.1038/s41592-021-01101-x
- BLAST+ https://doi.org/10.1186/1471-2105-10-421
- HMMER https://doi.org/10.1371/journal.pcbi.1002195

## Databases
- UniProt: https://doi.org/10.1093/nar/gky1049
- RefSeq: https://doi.org/10.1093/nar/gkx1068
- COG: https://doi.org/10.1093/bib/bbx117
- KEGG: https://doi.org/10.1093/bioinformatics/btz859
- Pfam: https://doi.org/10.1093/nar/gky995
- VFDB: https://doi.org/10.1093/nar/gky1080

## Software packaging/containerisation tools

- [Anaconda](https://anaconda.com)

- [Bioconda](https://pubmed.ncbi.nlm.nih.gov/29967506/)

  > Grüning B, Dale R, Sjödin A, Chapman BA, Rowe J, Tomkins-Tinch CH, Valieris R, Köster J; Bioconda Team. Bioconda: sustainable and comprehensive software distribution for the life sciences. Nat Methods. 2018 Jul;15(7):475-476.

  > da Veiga Leprevost F, Grüning B, Aflitos SA, Röst HL, Uszkoreit J, Barsnes H, Vaudel M, Moreno P, Gatto L, Weber J, Bai M, Jimenez RC, Sachsenberg T, Pfeuffer J, Alvarez RV, Griss J, Nesvizhskii AI, Perez-Riverol Y. BioContainers: an open-source and community-driven framework for software standardization. Bioinformatics. 2017 Aug 15;33(16):2580-2582.

amr

Science Score: 67.0%