Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (7.5%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: umccr
- License: mit
- Language: Nextflow
- Default Branch: main
- Size: 2.51 MB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 7
- Releases: 0
Metadata Files
README.md
sash
sash is the UMCCR post-processing WGS workflow. The workflow takes DRAGEN small variant calls and oncoanalyser results
as input to perform annotation, prioritisation, rescue and filtering, and reporting for the WGS variant data.
Additionally, sash runs several sensors for biomarker assessment and genomic characterisation including HRD status,
mutational signatures, purity/ploidy, MSI, and TMB.
While the sash workflow utilises a range of tools and software, it is most closely coupled with
bolt, a Python package that implements the UMCCR post-processing logic and supporting
functionality.
Table of contents
Summary
The general processes sash runs include:
- gpgr for generating the summary Cancer Report
- PCGR to report processed small somatic variants (annotated, rescued, filtered, prioritised)
- CPSR to report processed small germline variants (filtered, annotated, prioritised)
- linxreport to collate SV annotations and plots from LINX
- MultiQC for reporting various WGS statistics / metrics for QC
- SAGE variant calling to supplement DRAGEN small somatic variants
- PURPLE for TMB, MSI, CNV calling, and purity / ploidy estimation
- HRDetect and CHORD for HRD inference
- MutationalPatterns to fit mutational signatures
Requirements
- Java
- Nextflow ≥22.10.6
- Docker
Usage
Create a samplesheet
text
id,subject_name,sample_name,filetype,filepath
subject_a.example,subject_a,sample_germline,dragen_germline_dir,/path/to/dragen_germline/
subject_a.example,subject_a,sample_somatic,dragen_somatic_dir,/path/to/dragen_somatic/
subject_a.example,subject_a,sample_somatic,oncoanalyser_dir,/path/to/oncoanalyser/
Execute analysis
bash
nextflow run scwatts/sash \
-profile docker \
--input samplesheet.csv \
--ref_data_path /path/to/reference_data/ \
--outdir output/
Owner
- Name: UMCCR
- Login: umccr
- Kind: organization
- Email: oliver.hofmann@unimelb.edu.au
- Location: Melbourne, Australia
- Website: https://umccr.github.io/
- Twitter: umccr
- Repositories: 126
- Profile: https://github.com/umccr
University of Melbourne Centre for Cancer Research
Citation (CITATIONS.md)
# umccr/sash: Citations ## [nf-core](https://pubmed.ncbi.nlm.nih.gov/32055031/) > Ewels PA, Peltzer A, Fillinger S, Patel H, Alneberg J, Wilm A, Garcia MU, Di Tommaso P, Nahnsen S. The nf-core framework for community-curated bioinformatics pipelines. Nat Biotechnol. 2020 Mar;38(3):276-278. doi: 10.1038/s41587-020-0439-x. PubMed PMID: 32055031. ## [Nextflow](https://pubmed.ncbi.nlm.nih.gov/28398311/) > Di Tommaso P, Chatzou M, Floden EW, Barja PP, Palumbo E, Notredame C. Nextflow enables reproducible computational workflows. Nat Biotechnol. 2017 Apr 11;35(4):316-319. doi: 10.1038/nbt.3820. PubMed PMID: 28398311. ## Pipeline tools - [FastQC](https://www.bioinformatics.babraham.ac.uk/projects/fastqc/) - [MultiQC](https://pubmed.ncbi.nlm.nih.gov/27312411/) > Ewels P, Magnusson M, Lundin S, Käller M. MultiQC: summarize analysis results for multiple tools and samples in a single report. Bioinformatics. 2016 Oct 1;32(19):3047-8. doi: 10.1093/bioinformatics/btw354. Epub 2016 Jun 16. PubMed PMID: 27312411; PubMed Central PMCID: PMC5039924. ## Software packaging/containerisation tools - [Anaconda](https://anaconda.com) > Anaconda Software Distribution. Computer software. Vers. 2-2.4.0. Anaconda, Nov. 2016. Web. - [Bioconda](https://pubmed.ncbi.nlm.nih.gov/29967506/) > Grüning B, Dale R, Sjödin A, Chapman BA, Rowe J, Tomkins-Tinch CH, Valieris R, Köster J; Bioconda Team. Bioconda: sustainable and comprehensive software distribution for the life sciences. Nat Methods. 2018 Jul;15(7):475-476. doi: 10.1038/s41592-018-0046-7. PubMed PMID: 29967506. - [BioContainers](https://pubmed.ncbi.nlm.nih.gov/28379341/) > da Veiga Leprevost F, Grüning B, Aflitos SA, Röst HL, Uszkoreit J, Barsnes H, Vaudel M, Moreno P, Gatto L, Weber J, Bai M, Jimenez RC, Sachsenberg T, Pfeuffer J, Alvarez RV, Griss J, Nesvizhskii AI, Perez-Riverol Y. BioContainers: an open-source and community-driven framework for software standardization. Bioinformatics. 2017 Aug 15;33(16):2580-2582. doi: 10.1093/bioinformatics/btx192. PubMed PMID: 28379341; PubMed Central PMCID: PMC5870671. - [Docker](https://dl.acm.org/doi/10.5555/2600239.2600241) - [Singularity](https://pubmed.ncbi.nlm.nih.gov/28494014/) > Kurtzer GM, Sochat V, Bauer MW. Singularity: Scientific containers for mobility of compute. PLoS One. 2017 May 11;12(5):e0177459. doi: 10.1371/journal.pone.0177459. eCollection 2017. PubMed PMID: 28494014; PubMed Central PMCID: PMC5426675.
GitHub Events
Total
- Issues event: 11
- Delete event: 2
- Issue comment event: 34
- Push event: 124
- Pull request review comment event: 31
- Pull request review event: 16
- Pull request event: 7
- Create event: 12
Last Year
- Issues event: 11
- Delete event: 2
- Issue comment event: 34
- Push event: 124
- Pull request review comment event: 31
- Pull request review event: 16
- Pull request event: 7
- Create event: 12
Issues and Pull Requests
Last synced: 10 months ago
All Time
- Total issues: 7
- Total pull requests: 6
- Average time to close issues: 1 day
- Average time to close pull requests: 5 days
- Total issue authors: 2
- Total pull request authors: 2
- Average comments per issue: 0.57
- Average comments per pull request: 0.0
- Merged pull requests: 1
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 7
- Pull requests: 6
- Average time to close issues: 1 day
- Average time to close pull requests: 5 days
- Issue authors: 2
- Pull request authors: 2
- Average comments per issue: 0.57
- Average comments per pull request: 0.0
- Merged pull requests: 1
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- qclayssen (5)
- alexiswl (2)
Pull Request Authors
- qclayssen (5)
- alexiswl (1)