variantcatalogue_fix

This is a fix to the nf-core/variantcatalogue.

https://github.com/pinegraphite123/variantcatalogue_fix

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (2.4%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

This is a fix to the nf-core/variantcatalogue.

Basic Info
  • Host: GitHub
  • Owner: Pinegraphite123
  • License: mit
  • Language: Python
  • Default Branch: fix-forked-pipeline
  • Size: 0 Bytes
Statistics
  • Stars: 0
  • Watchers: 0
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created 10 months ago · Last pushed 10 months ago
Metadata Files
Readme Changelog Contributing License Code of conduct Citation

README.md

This is a modified version of originally abandoned nf-core/VariantCatalogue

Modification: 1: MT workflow has been deleted, along with unused params 2: --region-bed in Deepvariant removed 3: added VCF indexing and trimming (deletes unreadable contig) just before Hail steps

The VCF files from Hail are usable with Maftools. First convert them to .maf files (Filter for PASS), then read in Maftools in R. This step is possible to be pipelined with VEP, might be fixed in a future update.

Owner

  • Login: Pinegraphite123
  • Kind: user

Citation (CITATIONS.md)

# nf-core/variantcatalogue: Citations

## [The varaint catalogue pipeline](https://www.biorxiv.org/content/10.1101/2022.10.03.508010v2)
> Correard S, Abdallah OE M, Hewitson B, Wasserman WW. The variant catalogue pipeline: A workflow to generate a background variant library from Whole Genome Sequences. bioRxiv 2022.10.03.508010; doi: https://doi.org/10.1101/2022.10.03.508010

## [nf-core](https://pubmed.ncbi.nlm.nih.gov/32055031/)

> Ewels PA, Peltzer A, Fillinger S, Patel H, Alneberg J, Wilm A, Garcia MU, Di Tommaso P, Nahnsen S. The nf-core framework for community-curated bioinformatics pipelines. Nat Biotechnol. 2020 Mar;38(3):276-278. doi: 10.1038/s41587-020-0439-x. PubMed PMID: 32055031.

## [Nextflow](https://pubmed.ncbi.nlm.nih.gov/28398311/)

> Di Tommaso P, Chatzou M, Floden EW, Barja PP, Palumbo E, Notredame C. Nextflow enables reproducible computational workflows. Nat Biotechnol. 2017 Apr 11;35(4):316-319. doi: 10.1038/nbt.3820. PubMed PMID: 28398311.

## Pipeline tools


- [BCFTools](https://pubmed.ncbi.nlm.nih.gov/21903627/)

  > Li H: A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics. 2011 Nov 1;27(21):2987-93. doi: 10.1093/bioinformatics/btr509. PubMed PMID: 21903627; PubMed Central PMCID: PMC3198575.

- [BWA-MEM](https://arxiv.org/abs/1303.3997v2)

  > Li H: Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv 2013. doi: 10.48550/arXiv.1303.3997

- [DeepVariant](https://www.nature.com/articles/nbt.4235)

  > Poplin, R., Chang, PC., Alexander, D. et al. A universal SNP and small-indel variant caller using deep neural networks. Nat Biotechnol 36, 983–987 (2018). doi: 10.1038/nbt.4235.

- [EnsemblVEP](https://pubmed.ncbi.nlm.nih.gov/27268795/)

  > McLaren W, Gil L, Hunt SE, et al.: The Ensembl Variant Effect Predictor. Genome Biol. 2016 Jun 6;17(1):122. doi: 10.1186/s13059-016-0974-4. PubMed PMID: 27268795; PubMed Central PMCID: PMC4893825.

- [FastQC](https://www.bioinformatics.babraham.ac.uk/projects/fastqc/)

- [GATK](https://pubmed.ncbi.nlm.nih.gov/20644199/)

  > McKenna A, Hanna M, Banks E, et al.: The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010 Sep;20(9):1297-303. doi: 10.1101/gr.107524.110. Epub 2010 Jul 19. PubMed PMID: 20644199; PubMed Central PMCID: PMC2928508.

- [GLnexus](https://academic.oup.com/bioinformatics/article/36/24/5582/6064144)

  > Yun T, Li H, Chang PC, Lin MF, Carroll A, McLean CY. Accurate, scalable cohort variant calls using DeepVariant and GLnexus. Robinson P, ed. Bioinformatics. 2021;36(24):5582-5589. doi:10.1093/bioinformatics/btaa1081

- [Hail](https://github.com/hail-is/hail)
> Hail Team. Hail 0.2. https://github.com/hail-is/hail

- [Mosdepth](https://academic.oup.com/bioinformatics/article/34/5/867/4583630)

  > Brent S Pedersen, Aaron R Quinlan, Mosdepth: quick coverage calculation for genomes and exomes, Bioinformatics, Volume 34, Issue 5, 01 March 2018, Pages 867–868. doi: 10.1093/bioinformatics/btx699. PubMed PMID: 29096012. PubMed Central PMCID: PMC6030888.

- [MultiQC](https://pubmed.ncbi.nlm.nih.gov/27312411/)
  > Ewels P, Magnusson M, Lundin S, Käller M. MultiQC: summarize analysis results for multiple tools and samples in a single report. Bioinformatics. 2016 Oct 1;32(19):3047-8. doi: 10.1093/bioinformatics/btw354. Epub 2016 Jun 16. PubMed PMID: 27312411; PubMed Central PMCID: PMC5039924.

- [picard-tools](http://broadinstitute.github.io/picard)

- [SAMtools](https://pubmed.ncbi.nlm.nih.gov/19505943/)

  > Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R; 1000 Genome Project Data Processing Subgroup. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009 Aug 15;25(16):2078-9. doi: 10.1093/bioinformatics/btp352. Epub 2009 Jun 8. PubMed PMID: 19505943; PubMed Central PMCID: PMC2723002.

- [TRIMMOMATIC](http://www.usadellab.org/cms/?page=trimmomatic)

  > Bolger, A. M., Lohse, M., & Usadel, B. (2014). Trimmomatic: A flexible trimmer for Illumina Sequence Data. Bioinformatics, btu170. doi: 10.1093/bioinformatics/btu170

## Software packaging/containerisation tools

- [Anaconda](https://anaconda.com)

  > Anaconda Software Distribution. Computer software. Vers. 2-2.4.0. Anaconda, Nov. 2016. Web.

- [Bioconda](https://pubmed.ncbi.nlm.nih.gov/29967506/)

  > Grüning B, Dale R, Sjödin A, Chapman BA, Rowe J, Tomkins-Tinch CH, Valieris R, Köster J; Bioconda Team. Bioconda: sustainable and comprehensive software distribution for the life sciences. Nat Methods. 2018 Jul;15(7):475-476. doi: 10.1038/s41592-018-0046-7. PubMed PMID: 29967506.

- [BioContainers](https://pubmed.ncbi.nlm.nih.gov/28379341/)

  > da Veiga Leprevost F, Grüning B, Aflitos SA, Röst HL, Uszkoreit J, Barsnes H, Vaudel M, Moreno P, Gatto L, Weber J, Bai M, Jimenez RC, Sachsenberg T, Pfeuffer J, Alvarez RV, Griss J, Nesvizhskii AI, Perez-Riverol Y. BioContainers: an open-source and community-driven framework for software standardization. Bioinformatics. 2017 Aug 15;33(16):2580-2582. doi: 10.1093/bioinformatics/btx192. PubMed PMID: 28379341; PubMed Central PMCID: PMC5870671.

- [Docker](https://dl.acm.org/doi/10.5555/2600239.2600241)

- [Singularity](https://pubmed.ncbi.nlm.nih.gov/28494014/)
  > Kurtzer GM, Sochat V, Bauer MW. Singularity: Scientific containers for mobility of compute. PLoS One. 2017 May 11;12(5):e0177459. doi: 10.1371/journal.pone.0177459. eCollection 2017. PubMed PMID: 28494014; PubMed Central PMCID: PMC5426675.

GitHub Events

Total
  • Push event: 1
  • Create event: 2
Last Year
  • Push event: 1
  • Create event: 2

Dependencies

.github/workflows/awsfulltest.yml actions
  • actions/upload-artifact v3 composite
  • nf-core/tower-action v3 composite
.github/workflows/awstest.yml actions
  • actions/upload-artifact v3 composite
  • nf-core/tower-action v3 composite
.github/workflows/branch.yml actions
  • mshick/add-pr-comment v1 composite
.github/workflows/ci.yml actions
  • actions/checkout v3 composite
  • nf-core/setup-nextflow v1 composite
.github/workflows/fix-linting.yml actions
  • actions/checkout v3 composite
  • actions/setup-node v3 composite
.github/workflows/linting.yml actions
  • actions/checkout v3 composite
  • actions/setup-node v3 composite
  • actions/setup-python v4 composite
  • actions/upload-artifact v3 composite
  • mshick/add-pr-comment v1 composite
  • nf-core/setup-nextflow v1 composite
  • psf/black stable composite
.github/workflows/linting_comment.yml actions
  • dawidd6/action-download-artifact v2 composite
  • marocchino/sticky-pull-request-comment v2 composite
modules/local/bcftools/norm/meta.yml cpan
modules/local/bwa/mem/meta.yml cpan
modules/local/collecthsmetrics/meta.yml cpan
modules/local/deepvariant/meta.yml cpan
modules/local/filtermutectcalls/meta.yml cpan
modules/local/gatk4/leftalignandtrimvariants/meta.yml cpan
modules/local/gatk4/mergemutectstats/meta.yml cpan
modules/local/gatk4/printreads/meta.yml cpan
modules/local/gatk4/samtofastq/meta.yml cpan
modules/local/haplocheck/meta.yml cpan
modules/local/variantfiltration/meta.yml cpan
modules/nf-core/bcftools/annotate/meta.yml cpan
modules/nf-core/bcftools/index/meta.yml cpan
modules/nf-core/bcftools/norm/meta.yml cpan
modules/nf-core/bcftools/view/meta.yml cpan
modules/nf-core/bwa/index/meta.yml cpan
modules/nf-core/bwa/mem/meta.yml cpan
modules/nf-core/custom/dumpsoftwareversions/meta.yml cpan
modules/nf-core/deepvariant/meta.yml cpan
modules/nf-core/ensemblvep/vep/meta.yml cpan
modules/nf-core/fastqc/meta.yml cpan
modules/nf-core/gatk4/filtermutectcalls/meta.yml cpan
modules/nf-core/gatk4/leftalignandtrimvariants/meta.yml cpan
modules/nf-core/gatk4/markduplicates/meta.yml cpan
modules/nf-core/gatk4/mergemutectstats/meta.yml cpan
modules/nf-core/gatk4/mergevcfs/meta.yml cpan
modules/nf-core/gatk4/mutect2/meta.yml cpan
modules/nf-core/gatk4/printreads/meta.yml cpan
modules/nf-core/gatk4/samtofastq/meta.yml cpan
modules/nf-core/gatk4/variantfiltration/meta.yml cpan
modules/nf-core/glnexus/meta.yml cpan
modules/nf-core/haplocheck/meta.yml cpan
modules/nf-core/mosdepth/meta.yml cpan
modules/nf-core/multiqc/meta.yml cpan
modules/nf-core/picard/collecthsmetrics/meta.yml cpan
modules/nf-core/picard/collectwgsmetrics/meta.yml cpan
modules/nf-core/samtools/index/meta.yml cpan
modules/nf-core/tabix/tabix/meta.yml cpan
modules/nf-core/trimmomatic/meta.yml cpan
modules/nf-core/bcftools/index/environment.yml pypi
modules/nf-core/bcftools/view/environment.yml pypi
pyproject.toml pypi