amr
In this repository, we have compiled the WGS analysis schematics of the bacterial isolates obtained from poultry manure of layer and broiler poultry farms
Science Score: 67.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 6 DOI reference(s) in README -
✓Academic publication links
Links to: zenodo.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (10.6%) to scientific vocabulary
Keywords
Repository
In this repository, we have compiled the WGS analysis schematics of the bacterial isolates obtained from poultry manure of layer and broiler poultry farms
Basic Info
- Host: GitHub
- Owner: dinesh2k19
- License: cc0-1.0
- Default Branch: main
- Homepage: https://github.com/dinesh2k19/amr
- Size: 1.78 MB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 1
Topics
Metadata Files
README.md
Poultry manure bacterial isolates
- Comprehending the existence of Antimicrobial Resistant Genes (ARGs) in poultry litter and bacteria harbouring them may provide insights into the potentiality of gene transmission into microbial communities of humans and opportunistic pathogens, highlighting the significance of a One-Health approach that considers the intricate relationships between humans, animals, and environmental health.
- The outcomes provide insights into the interaction of genes, phenotypes, and environmental variables in forming AMR profiles in poultry contexts, which has implications for agricultural settings and public health.
Graphical abstract
Whole genome sequence (WGS) analysis workflow
- WGS Analysis workflow includes sequence quality checking, quality control, taxonomy assignment, genome assembly, assembly quality assessment, assembled contig annotation, antimicrobial resistance, and taxonomy predictions.
Setup Conda Environment for workflow management
- Miniconda is a lightweight version of Anaconda that includes only Conda and its dependencies. If you need a full package, install Anaconda instead of Miniconda. For detailed instructions, please refer to https://docs.anaconda.com/anaconda/install/
Analysis workflow steps
- We created a conda environment named
env_amrand installed all the required packages, dependencies were installed and configured in this environment to execute the analysis workflow steps as per following commands. - Raw sequence quality assessment using FASTQC Toolkit (https://www.bioinformatics.babraham.ac.uk/projects/fastqc/)
bash $ conda create -n env_amr -c bioconda fastqc
bash
$ conda activate env_amr
bash
$ fastqc -o /fastqc_output -t 64 /data/*.fastq.gz
Adapter trimming and low quality filtering using PrinSeq-Lite v0.20.4 and reports using MultiQC version 1.20
bash
$ perl prinseq-lite.pl -fastq Sample1_R1.fastq.gz -fastq2 Sample1_R2.fastq.gz -min_qual_mean 30 -min_len 50 -ns_max_n 0
bash
$ fastp -i Sample1_R1.fastq.gz -I Sample1_R2.fastq.gz -o S1_clean_R1.fastq.gz -O S1_clean_R2.fastq.gz -h S1_fastp_report.html -j S1_fastp_report.json --low_complexity_filter --cut_mean_quality 30 --min_length 50 --thread 16
Generate MultiQC report
bash
$ multiqc . -o multiqc_report/
Genome Assembly
- Illumina Paired-end trimmed and filtered reads were assembled using Unicycler with the command line options at local server. Unicycler is an assembly pipeline for bacterial genomes.
bash $ unicycler -1 S1_filt_reads_R1.fastq.gz -2 S1_filt_reads_R2.fastq.gz -o output_dir --assembly_method bold## Genome quality check and statistics using CheckM v2 and QUality ASsessment Tool (QUAST) v5.0.2 - We obtained the genomic assembly statistics using CheckM v2 for all the samples and obtained the QUAST reports
bash $ checkm lineage_wf -x fasta output_dir/ checkm_report/
bash
$ quast -i genome_dir/assembly.fasta -o quast_report/ --threads 64 --m bacterial
Genome based taxonomy assignment using (GTDB-Tk) and multi-locus sequence typing (MLST) using PubMLST server (https://pubmlst.org/)
- The bacterial genomes were assigned taxonomy using Genome based taxonomy database and processed for the multi-locus strain typing by PubMLST server.
bash
$ gtdbtk classify_wf --genome_dir genome_dir/ --output_dir gtdbtk_report/ --cpus 64
Gene prediction using Prokka and annotation
- Subsystems utilizing the SEED Servers (http://www.theseed.org/servers) and Rapid Annotation using Subsystem Technology (RAST) Server (https://rast.nmpdr.org/) and eggNOG-mapper v2 assisted with precomputed eggNOG v5.0 clusters were used to perform functional annotation.
Idenitfication of the Antimicrobial Resistant Genes (ARGs)
- The ARGs were identified using the Resistance Gene Identifier (RGI) from respective bacterial genome assembly nucleotide sequences based on homology and SNP models using Comprehensive Antibiotic Resistance Database (CARD) (https://card.mcmaster.ca/)
Prediction of phage sequences using PHASTER (PHAge Search Tool Enhanced Release) server (https://phaster.ca/).
- PHASTER web server was used for the rapid identification and annotation of prophage sequences within bacterial genome assemblies.
Analysis of virulence factors
- Analysis was performed using VFDB server version 2022 (http://www.mgc.ac.cn/cgi-bin/VFs/v5/main.cgi)
Analysis of Integrative and conjugative elements (ICEs), integrative and mobilizable elements (IMEs), and cis-mobilizable elements (CIMEs)
- ICEfinder server was used (https://bioinfo-mml.sjtu.edu.cn/ICEfinder/ICEfinder.html) for the detection of ICEs/IMEs of bacterial genomes.
The heavy metal resistant genes (HMRGs) analysis
- The HMRGs sequences were downloaded to create a custom database and analysed using the Hidden Markov models (HMMs) profile protein family (PF13801: Heavy-metal resistance) https://www.ebi.ac.uk/interpro/entry/pfam/PF13801/
Genomic feature visulisation using Proksee server (https://proksee.ca/). Proksee is a suite of command line tools for performing assembly and evaluation of microbial genomes.
- Proksee server system was used for genome assembly, annotation and visualization, featuring interactive circular and linear genome maps for the respective bacterial genome assemblies, and ARGs
Schematic
Taxonomic identification using 16S ribosomal RNA gene sequencing
MEGA 11 (Version 11.0.13) was employed for multiple sequence alignment, while the phylogenetic investigation was done using a maximum likelihood approach with 1000 bootstrap replicates. Using the iTOL server, aligned sequences were subsequently utilized to generate a circular phylogenetic tree (https://itol.embl.de/). All 16S rRNA gene sequences have been submitted to the NCBI public repository (GenBank Accession numbers PP053433-PP053445)
Genome Analysis and Visualisation in Proksee Server
Proksee (https://proksee.ca) provides users with a powerful, easy-to-use, and feature-rich system for assembling, annotating, analysing, and visualizing bacterial genomes. Proksee accepts Illumina sequence reads as compressed FASTQ files or pre-assembled contigs in raw, FASTA, or GenBank format.
The circular genome visualised below incidate the features of selected isolate (Acinetobacter indicus BRLT-5) isolated from broiler litter samples:
Raw Data Availaility
Accession numbers (PP053433-PP053445) of 16S rRNA nucleotide sequences of selected poultry litter bacterial isolates used in this study are available in National Centre for Biotechnology Information (NCBI) database and the raw reads generated through WGS has been submitted to the NCBI under Sequence Read Archive (SRA) submission with Bio-project accession number PRJNA1058239.
Funding Information
This research work was supported by Gujarat Biotechnology Research Centre (GBRC), Department of Science and Technology (DST), Government of Gujarat, Gandhinagar, India. Authors would like to gratefully acknowledge the UKRI-GCRF One Health Poultry Hub (OHPH), UK for supporting research activity. Department of Biotechnology (DBT), Government of India, New Delhi gratefully acknowledged for financial assistance to Animesh Tripathi in the form of Junior Research Fellow (DBTHRDPMU/JRF/BET-20/2020/AL/293).
Citation
If you find it helpful, we would appreciate if you could cite the following publication:
Tripathi, Animesh, Anjali Jaiswal, Dinesh Kumar, Ramesh Pandit, Damer Blake, Fiona Tomley, Madhvi Joshi, Chaitanya G. Joshi, and Suresh Kumar Dubey. "High occurrence of antimicrobial resistance genes in bacteria isolated from poultry manure assessed using bacterial genome sequencing.
Owner
- Name: Dinesh Kumar
- Login: dinesh2k19
- Kind: user
- Location: Hyderabad, Telangana, India 500078
- Company: Birla Institute of Technology and Science (BITS) Pilani Hyderabad Campus
- Website: https://sites.google.com/view/dinesh2k19/
- Twitter: dinesh_2k19
- Repositories: 1
- Profile: https://github.com/dinesh2k19
Postdoctoral Researcher at BITS-Pilani, Hyderabad Campus
Citation (citations.md)
## Please cite the following primary sources and databases ## Pipeline tools - ## FastQC > https://www.bioinformatics.babraham.ac.uk/projects/fastqc/ - ## PRINSEQ: PReprocessing and INformation of SEQuence data (http://prinseq.sourceforge.net) > Schmieder R, Edwards R. Quality control and preprocessing of metagenomic datasets. Bioinformatics 2011;27:863–4. doi: 10.1093/bioinformatics/btr026. - ## MultiQC > Ewels P, Magnusson M, Lundin S, Käller M. MultiQC: summarize analysis results for multiple tools and samples in a single report. Bioinformatics 2016;32:3047–8. doi: 10.1093/bioinformatics/btw354. - ## Unicycler > Wick RR, Judd LM, Gorrie CL, Holt KE. Unicycler: Resolving bacterial genome assemblies from short and long sequencing reads. PLOS Comput Biol 2017;13:e1005595. doi: 10.1371/journal.pcbi.1005595. - ## CheckM > Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res 2015;25:1043–55. doi: 10.1101/gr.186072.114. - ## QUAST > Gurevich A, Saveliev V, Vyahhi N, Tesler G. QUAST: quality assessment tool for genome assemblies. Bioinformatics 2013;29:1072–5. doi: 10.1093/bioinformatics/btt086. - ## GTDB-Tk > Chaumeil P-A, Mussig AJ, Hugenholtz P, Parks DH. GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database. Bioinformatics 2020;36:1925–7. doi: 10.1093/bioinformatics/btz848. - ## PubMLST Server > Jolley KA, Bray JE, Maiden MCJ. Open-access bacterial population genomics: BIGSdb software, the PubMLST.org website and their applications. Wellcome Open Res 2018;3:124. doi: 10.12688/wellcomeopenres.14826.1. - ## CARD Database > Alcock BP, Huynh W, Chalil R, Smith KW, Raphenya AR, Wlodarski MA, et al. CARD 2023: expanded curation, support for machine learning, and resistome prediction at the Comprehensive Antibiotic Resistance Database. Nucleic Acids Res 2023;51:D690–9. doi: 10.1093/nar/gkac920. - ## Proksee Server > Grant JR, Enns E, Marinier E, Mandal A, Herman EK, Chen C, et al. Proksee: in-depth characterization and visualization of bacterial genomes. Nucleic Acids Res 2023;51:W484–92. doi: 10.1093/nar/gkad326. - ## The SEED Server > Overbeek R, Olson R, Pusch GD, Olsen GJ, Davis JJ, Disz T, et al. The SEED and the Rapid Annotation of microbial genomes using Subsystems Technology (RAST). Nucleic Acids Res 2014;42:D206–14. doi: 10.1093/nar/gkt1226. - ## PHASTER > Arndt D, Grant JR, Marcu A, Sajed T, Pon A, Liang Y, et al. PHASTER: a better, faster version of the PHAST phage search tool. Nucleic Acids Res 2016;44:W16–21. doi: 10.1093/nar/gkw387. - ## PHAST > Zhou Y, Liang Y, Lynch KH, Dennis JJ, Wishart DS. PHAST: A Fast Phage Search Tool. Nucleic Acids Res 2011;39:W347–52. doi: 10.1093/nar/gkr485. - ## VFDB 2022 > Liu B, Zheng D, Zhou S, Chen L, Yang J. VFDB 2022: a general classification scheme for bacterial virulence factors. Nucleic Acids Res 2022;50:D912–7. doi: 10.1093/nar/gkab1107. - ## eggNOG 5.0 > Huerta-Cepas J, Szklarczyk D, Heller D, Hernández-Plaza A, Forslund SK, Cook H, et al. eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. Nucleic Acids Res 2019;47:D309–14. doi: 10.1093/nar/gky1085. - ## eggNOG-mapper v2 > Cantalapiedra CP, Hernández-Plaza A, Letunic I, Bork P, Huerta-Cepas J. eggNOG-mapper v2: Functional annotation, orthology assignments, and domain prediction at the metagenomic scale. Mol Biol Evol 2021;38:5825–9. doi: 10.1093/molbev/msab293. ## Tools - tRNAscan-SE 2.0 https://doi.org/10.1093/nar/gkab688 - Aragorn https://doi.org/10.1093/nar/gkh152 - Diamond https://doi.org/10.1038/s41592-021-01101-x - BLAST+ https://doi.org/10.1186/1471-2105-10-421 - HMMER https://doi.org/10.1371/journal.pcbi.1002195 ## Databases - UniProt: https://doi.org/10.1093/nar/gky1049 - RefSeq: https://doi.org/10.1093/nar/gkx1068 - COG: https://doi.org/10.1093/bib/bbx117 - KEGG: https://doi.org/10.1093/bioinformatics/btz859 - Pfam: https://doi.org/10.1093/nar/gky995 - VFDB: https://doi.org/10.1093/nar/gky1080 ## Software packaging/containerisation tools - [Anaconda](https://anaconda.com) - [Bioconda](https://pubmed.ncbi.nlm.nih.gov/29967506/) > Grüning B, Dale R, Sjödin A, Chapman BA, Rowe J, Tomkins-Tinch CH, Valieris R, Köster J; Bioconda Team. Bioconda: sustainable and comprehensive software distribution for the life sciences. Nat Methods. 2018 Jul;15(7):475-476. > da Veiga Leprevost F, Grüning B, Aflitos SA, Röst HL, Uszkoreit J, Barsnes H, Vaudel M, Moreno P, Gatto L, Weber J, Bai M, Jimenez RC, Sachsenberg T, Pfeuffer J, Alvarez RV, Griss J, Nesvizhskii AI, Perez-Riverol Y. BioContainers: an open-source and community-driven framework for software standardization. Bioinformatics. 2017 Aug 15;33(16):2580-2582.
GitHub Events
Total
- Release event: 1
- Watch event: 1
- Push event: 20
- Create event: 1
Last Year
- Release event: 1
- Watch event: 1
- Push event: 20
- Create event: 1