Recent Releases of https://github.com/broadinstitute/long-read-pipelines
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.56
Updated to GATK 4.6.1.0 and added new parameter for extra memory for joint calling (#505)
Updated to GATK 4.6.1.0 for joint genotyping.
Updated the following tasks to use GATK 4.6.1.0:
- GnarlyGenotypeGVCFs
- GenotypeGVCFs
- ImportGVCFs
Added parameter for extra memory to ImportGVCFs
- Jupyter Notebook
Published by github-actions[bot] 11 months ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.55
Added workaround for batch python issue. (#504)
- Jupyter Notebook
Published by github-actions[bot] about 1 year ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.54
Fix to HRP2/3 deletion detection for GCP batch backend. (#503)
Updated HRP2/3 workflow to fix Batch backend issues.
Now uses
Filefor the bam file input.
- Jupyter Notebook
Published by github-actions[bot] about 1 year ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.53
Fixes to runtime attributes for workflows for new GCP Batch engine (#501)
- Fixed all
bootDiskSizeGbentries to be ints.
- Jupyter Notebook
Published by github-actions[bot] about 1 year ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.52
Set Conda py version for miniwdl (#502)
set to use latest miniwdl version
set python version to 3.12 in CI/CD
set python version to 3.13 in CI/CD, set conda environment to 3.12
Added composite action.yml
added checkout before setup env action
new line in setup env yml
setting lfs set to true
setting lfs set to true to cd and ci_nightly
Co-authored-by: bshifaw bshifaw@broadinstitute.com
- Jupyter Notebook
Published by github-actions[bot] about 1 year ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.51
Added base_bucket parameter to reports to fix issue. (#499)
- Jupyter Notebook
Published by github-actions[bot] about 1 year ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.50
Create stand-alone workflow for short read malaria QC (#498)
- Added QCAssessment workflow to run stand-alone QC on ILMN samples. (Primarily for malaria work)
- Added QC WDL to dockstore.
- Jupyter Notebook
Published by github-actions[bot] about 1 year ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.49
Added stand-alone workflow for barcode/fingerprint extraction (#497)
- Added workflow to extract SNP barcodes / fingerprints from a VCF (
ExtractSnpBarcodeFromVcfFile) - Added
ExtractSnpBarcodeFromVcfFileto dockstore.
- Jupyter Notebook
Published by github-actions[bot] about 1 year ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.48
Changed docker image for mosdepth to use gcr instead of quay (#496)
- Jupyter Notebook
Published by github-actions[bot] about 1 year ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.47
Improvement: Add updated bubble plots to summary reports (#495)
fix reading drug resistance text file and change page title
fix another spot parsing the DR text file
add new bubble plot with support for hom/het/missing variants
update docker image tag to latest
change source of drug resistances to workspace table, not text file
- Jupyter Notebook
Published by github-actions[bot] over 1 year ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.46
Fixed drug resistance calculations for missing data; added marker output to data table (#494)
Fixed bug where functionally annotated output VCF would only be set if the gcs_output dir was specified.
Removed all finalization steps from
PfalciparumTypeDrugResistanceMarkers.Fixed drug res analysis to handle
missingvshom_refUpdated
CreateDrugResistanceSummaryto usemissingandhom_refAdded individual marker outputs.
- Jupyter Notebook
Published by github-actions[bot] over 1 year ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.45
Add WDL Workflow: Generate summary reports for malaria samples (#493)
- Jupyter Notebook
Published by github-actions[bot] over 1 year ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.44
Added minimum for miniwdl version to use (#492)
Co-authored-by: bshifaw bshifaw@broadinstitute.com
- Jupyter Notebook
Published by github-actions[bot] over 1 year ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.43
Updated pip cache step to version 4 (#491)
Co-authored-by: bshifaw bshifaw@broadinstitute.com
- Jupyter Notebook
Published by github-actions[bot] over 1 year ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.42
Added DeepTrio Workflow (#490)
Added
DeepTriotask toDeepVariant.wdlAdded DeepTrio wdl with workflow to call DeepTrio.
Added
DeepTrioworkflow to.dockstore.yml
- Jupyter Notebook
Published by github-actions[bot] over 1 year ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.41
set a single python version in git_pages (#489)
Co-authored-by: bshifaw bshifaw@broadinstitute.com
- Jupyter Notebook
Published by github-actions[bot] over 1 year ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.40
Added system deps in nightly test (#488)
Co-authored-by: bshifaw bshifaw@broadinstitute.com
- Jupyter Notebook
Published by github-actions[bot] over 1 year ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.39
Made MarkDuplicates optional to enable Amplicon data processing. (#486)
- Jupyter Notebook
Published by github-actions[bot] over 1 year ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.38
Updated python version in git actions from 3.7 to 3.13 (#487)
Updated python version in git actions from 3.7 to 3.13
Update python setup action version from 2 to 4
Update python setup action version from 4 to 5
Install libbz2-dev in deps
Added additional deps in "Install dependencies"
update tox py version from py37 to 313
created separate step for system deps, added step to make sure miniconda is installed, added cache for pip deps
removed miniconda step, replaced the system dep installation step
Added system deps and pip caching to cd.yml
Co-authored-by: bshifaw bshifaw@broadinstitute.com
- Jupyter Notebook
Published by github-actions[bot] over 1 year ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.37
Fixed typo in zarr output file. (#484)
- Made zarr output file optional
- Jupyter Notebook
Published by github-actions[bot] over 1 year ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.36
Made zarr conversion optional in SRJointCallGVCFsWithGenomicsDB.wdl (#483)
- Jupyter Notebook
Published by github-actions[bot] over 1 year ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.35
Added snpeffdbidentifier for Functional annotation. (#482)
- Removed hard-coded plasmodium snpeff database specification.
- Added new input
snpeff_db_identifierfor a snpeff database specification to SRJointCallGVCFsWithGenomicsDB and FunctionalAnnotation`
- Jupyter Notebook
Published by github-actions[bot] over 1 year ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.34
Reserved more non-heap memory for the JVM when calling HaplotypeCaller (#481)
- Updated HaplotypeCaller task to reserve 20% or 4gb of memory for off-heap use (whichever is larger). This prevents out of memory errors occurring in native libraries that HaplotypeCaller calls into (i.e. the GKL).
- Jupyter Notebook
Published by github-actions[bot] over 1 year ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.33
SRJointCallGVCFsWithGenomicsDB.wdl now prefers dbsnp_vcf to known_sites_vcf for genotyping (#480)
- Added ResolveMapKeysInPriorityOrder to resolve keys in a WDL Map object in priority order. This enables fallbacks / preferences for different entries in a map (such as the refmap).
- For genotyping with the gnarly genotyper, the dbsnp resource now prefers dbsnpvcf from the RefMap. If this does not exist, it will fall back to knownsitesvcf. This allows different resources to be used for BQSR (which prefers knownsitesvcf in SRFlowcell) and the gnarly genotyper, which now prefers dbsnpvcf.
- Jupyter Notebook
Published by github-actions[bot] over 1 year ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.32
Updated SRUtils.BamToFq and SRBamToFq with cram support and better memory / cpu settings (#479)
- Made finalization optional in
SRBamToFq. - Now
BamToFqsorts and converts in 2 separate commands. - Now
BamToFqutilizes all available processors. BamToFqcpu and ram now 8 cores and 16gb respectively.
- Jupyter Notebook
Published by github-actions[bot] over 1 year ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.31
Fixed unbound variable error in some cases of QC failure. (#478)
- When $mosdepthqcstatus was true and $callable_status was false, there
was a bug that caused an
unbound variableexception (message had not been defined in that code path).
- Jupyter Notebook
Published by github-actions[bot] over 1 year ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.30
Fixed bug when workflow had no inputs/outputs. (#477)
- Fixed the bug where if a workflow had no inputs or outputs the backup
process would fail due to a python
KeyError.
- Jupyter Notebook
Published by github-actions[bot] over 1 year ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.29
Increased minimum boot disk size to 25Gb for all tasks. (#476)
Fixed critical issue with boot disk size in Terra. Google likely changed their image and the boot disk is no longer big enough. Updated boot disk sizes to be 25 gb (or larger) by default.
- Jupyter Notebook
Published by github-actions[bot] over 1 year ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.28
Added more QC metrics and a pass / fail assessment to SRFlowcell (#475)
- Changed read quality stratification numbers to better suit ILMN data in
SRFlowcell. - Added QC pass / fail assessment to
SRFlowcell. Defaults are set up for P. falciparum short read data, but can be adapted to other use cases.
- Jupyter Notebook
Published by github-actions[bot] over 1 year ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.27
Added support for giving background samples to joint calling workflow. (#474)
This adds in the ability to specify an Array[Array[File]] containing GVCFs to include as background samples in a joint call in SRJointCallGVCFsWithGenomicsDB. This input is optional.
- Jupyter Notebook
Published by github-actions[bot] over 1 year ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.26
Updates for Malaria pipelines: human read decontamination and WDLs for on-prem Malaria pipelines (#473)
- Added WDLS for Broad on-prem Malaria Pipelines
- Updated dockstore with Broad on prem malaria pipelines
- Finished implementing
bowtie2based read decontamination - Enabled read decontamination in
SRFlowcell - Updated
Utils.SortSamto usesamtools
- Jupyter Notebook
Published by github-actions[bot] over 1 year ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.25
Updated interval extraction for short read joint calling for sharding. (#471)
- Jupyter Notebook
Published by github-actions[bot] over 1 year ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.24
Fixed JVM memory spec for VariantUtils::AnnotateVcfWithBedRegions (#469)
- Jupyter Notebook
Published by github-actions[bot] over 1 year ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.23
Updated drug resistance marker code to account for het genotypes. (#467)
Updated drug resistance marker code.
Now
PfalciparumTypeDrugResistanceMarkers::CallDrugResistanceMutationspulls info directly from the VCF file using pysam, rather than a series of shell commands.Now
PfalciparumTypeDrugResistanceMarkers::CallDrugResistanceMutationspreserves het/hom genotype information in the raw output file.PfalciparumDrugResistanceSummary.wdlnow calls intoPfalciparumTypeDrugResistanceMarkersforCreateDrugResistanceSummary.Minor refactoring of names.
Added VCF Index as input to
PfalciparumTypeDrugResistanceMarkers.wdlFixed output names in
PfalciparumDrugResistanceSummary.wdl.
- Jupyter Notebook
Published by github-actions[bot] almost 2 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.22
Updates to ONT workflows to bring them closer to current conventions (#466)
- Documentation updates for
SRWholeGenome.wdlandONTWholeGenome.wdl VariantUtils::MergeAndSortVCFsnow has a fall-back to bcftools concat with normal compression if--naivefails.- Fixed
VariantUtils::MergeAndSortVCFsto uselr-basic:0.1.2 CallVariantsONT.wdlnow takes aref_mapinstead of reference fasta/fai/dict.CallVariantsONT.wdlnow filters out themt_chr_nameifcall_small_vars_on_mitochondriais set tofalse.
- Jupyter Notebook
Published by github-actions[bot] almost 2 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.21
Hotfix for BackupWorkspace.wdl and minor change to the hrp2/hrp3 detection wdl. (#464)
Added
volatile: trueto `BackupWorkspace.wdl task.Increased disk space for
PfalciparumHrp2Hrp3Status.wdlFixed disk size to depend on all inputs for
PfalciparumHrp2Hrp3Status.wdl
- Jupyter Notebook
Published by github-actions[bot] almost 2 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.20
Changed output names of drug resistances in Pf drug resistance WDLs. (#463)
- Names of drug resistance outputs in
PfalciparumTypeDrugResistanceMarkers.wdlhave been changed to the form:drug_status_DRUGNAME(e.g.drug_status_chloroquine)
- Jupyter Notebook
Published by github-actions[bot] almost 2 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.19
Cleaned up short read workflows, minor feature additions (#461)
- Added docker image and WDL to backup workspaces.
- Added
util-malaria-coidocker image. - Fixed bug in
SRFlowcellthat preferentially pulled the existing aligned bam rather than the merged aligned bam. - Cleaned up redundant / unnecessary inputs and outputs from
SRWholeGenome.wdl - Enabled
SRWholeGenome.wdlandHaplotypeCaller.wdlto use aninterval_listto call variants in subsets of the genome. - Renamed
ONTPfTypeDrugResistanceMarkers.wdltoPfalciparumTypeDrugResistanceMarkers.wdl. - Renamed
ONTPfHrp2Hrp3Status.wdltoPfalciparumHrp2Hrp3Status.wdl PfalciparumDrugResistanceSummary.wdlnow produces individual drug resistances/sensitivites as additional outputs.PfalciparumDrugResistanceSummary.wdlis now deprecated.PfalciparumDrugResistanceMarkers.wdlnow produces drug resistances/sensitivites as well as the summary file. This deprecatesPfalciparumDrugResistanceSummary.wdl.- Added
PfalciparumPolygenomicityBarcodeEstimate.wdlto estimate CoI based on the 24 SNP molecular barcode. - Cleaned up some outputs in
SRJointCallGVCFsWithGenomicsDB.wdl. - Added flag for dangling end recovery to
HaplotypeCaller.wdl. - Minor fixes for MiniWDL style suggestions (
TrainCnnFilters.wdl,LRCNVs.wdl,AlignedMetrics.wdl,FastQC.wdl,SRUtils.wdl,Utils.wdl,VariantUtils.wdl,SRJointGenotyping.wdl,NanoPlot.wdl,Pf_Niare_HaplotypeCaller.wdl)
- Jupyter Notebook
Published by github-actions[bot] almost 2 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.18
Reworked SRJointGenotyping sharding to be configurable at runtime. (#455)
Sharding now splits the VCF files into chunks by number of bp. This allows for wider sharding / better scalability for large callsets. By default the behavior is to shard by contig - this is controlled by setting the number of bp per shard to be very large (999999999). If the number of bp per shard is larger than the contig size, the whole contig is used.
Changed
MakeSitesOnlyVCFtask to usebcftools.Updated joint call WDL to accept interval lists.
- Jupyter Notebook
Published by github-actions[bot] almost 2 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.17
Made the finalize steps optional for workflows in the srmalaria workspace. (#453)
- Made
gcs_root_out_diroptional inSRFlowcell - Made
gcs_out_root_diroptional in SRWholeGenome - Made
gcs_out_root_diroptional forSRJointCallGVCFsWithGenomicsDB
- Jupyter Notebook
Published by github-actions[bot] about 2 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.16
Updated short read variant calling and joint calling pipelines to run at scale (#450)
Single-sample variant calling:
- Fixied critical typo in HaplotypeCaller disk spec.
- Fixed wdl-computed divide by zero error in
SRFlowcelloutputs. - Updated
MergeVCFsto have an option to name an output as a GVCF. - Removed deprecated GC logging flags from GATK commands.
- Fixed issue in
FastQCif no read qualities are in the file. - Added missing annotation groups to ReblockGVCFs.
- HaplotypeCaller WDL now returns the reblocked GVCF.
Joint Calling:
* Added option to use gnarly genotyper
* Added het inputs to joint genotyping
* Fixed java memory allocation in joint genotyping to be based on memory of the VM, not hard-coded
* Fixed name of outdir in ConvertToZarrStore to be correct for this workflow.
* Updated the zarr conversion to use parallel Dask processes and to log to stdout.
* Upped default zarr conversion memory to 32gb.
VETS:
* Added stack trace logging for errors in ExtractVariantAnnotations,
TrainVariantAnnotationsModel, and ScoreVariantAnnotations.
* Removed HAPCOMP, HAPDOM, and HEC from default annotations for SNP and INDEL VETS filtration on joint-called VCF files. Need to do more testing / debugging to include these in joint calling.
Misc: * Fixed the name of the workflow in ExpandedDrugResistanceMarkerAggregation to reflect the actual name. * Miscellaneous updates for debugging
- Jupyter Notebook
Published by github-actions[bot] about 2 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.15
make submissions folder deletion better (#445)
- make submissions folder deletion better
- Jupyter Notebook
Published by github-actions[bot] about 2 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.14
Several updates for making Malaria Joint Calling easier. (#449)
Reintroduced
ReblockGVCFstep.Updated HaplotypeCaller::ReblockGVCF to point to latest Docker image with fix for ReblockGVCF annotations.
Updated HaplotypeCaller::ReblockGVCF to use 2 cores.
Disabled DeepVariant/Pepper calling in
SRWholeGenomeby default.Removed
SRJointGenotyping::ReblockGVCF- it should only be defined in one place.Removed hard filtered output file.
Updating all GATK 4.3 tasks to GATK 4.5
Disabled QC when running on a singe bam file input.
Added a runtime_attr override for HaplotypeCaller subworkflow.
Update Utils.wdl
Modified the disks from " LOCAL" to " SSD"
Updating tasks to use
SSDrather thanLOCALdisk.Moved HaplotypeCaller and ReblockGVCF to SSD from HDD
Fixed a bug in
compute_sr_stats.pythat allowednansUpdated
sr-utilsdocker to usemambaconda env solver.Updates to
sr-utilsdocker image.Updated conda solver in
sr-utilstomamba.Fixed minor deprecation warning in
compute_sr_stats.pyFixed issue in
compute_sr_stats.pythat caused certain inputs to fail due to missing base qualities ornanvalues.Updated version of
sr-utilsdocker image to0.2.2.Updated
sr-utilsdocker image to version0.2.2Added note for updating nightly GATK build.
Co-authored-by: Shadi Zaheri 74751641+shadizaheri@users.noreply.github.com
- Jupyter Notebook
Published by github-actions[bot] about 2 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.13
update dockstore cli in git action yml (#447)
Co-authored-by: bshifaw bshifaw@broadinstitute.com
- Jupyter Notebook
Published by github-actions[bot] about 2 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.12
Add WDL Workflow: Plots metrics for SV Calls for Multiple Samples (#438)
SvQcplots.wdl draft
Added concat sv stats
added compile sv stats by sample
added runtime in compileSVstats
added addcoverage task
added plotting task
adding docker for plotting
Co-authored-by: Steve Huang shuang@broadinstitute.org
- Update wdl/pipelines/TechAgnostic/Visualization/SvQCPlots.wdl
specified bcfQuerySV task docker
Co-authored-by: Steve Huang shuang@broadinstitute.org
add meta section to SvQCPlots.wdl
add option to name output plot notebook SvQCPlots.wdl
Minor additions (#444)
Co-authored-by: Steve Huang shuang@broadinstitute.org
update docker to ubuntu for addCoverageToSVstats
containerized compilesvstats
added a full path to compilesvstats.py
fix compilesvstats.py svtypes
updated papermillbase docker version build
Co-authored-by: bshifaw bshifaw@broadinstitute.com Co-authored-by: Steve Huang shuang@broadinstitute.org
- Jupyter Notebook
Published by github-actions[bot] about 2 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.11
Adding Base Image for Papermill Docker (#437)
- adding docker for papermill
Co-authored-by: bshifaw bshifaw@broadinstitute.com
- Jupyter Notebook
Published by github-actions[bot] over 2 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.10
Adding in all the work from the long-term Special Project Malaria branch (#440)
- Adding in all the work from the long-term Special Project Malaria branch
- Jupyter Notebook
Published by github-actions[bot] over 2 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.9
Added section on external contribution to development Readme (#435)
Added section on external contribution
Added links to repo site to main repo README.md
Co-authored-by: bshifaw bshifaw@broadinstitute.com
- Jupyter Notebook
Published by github-actions[bot] over 2 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.8
Merge pull request #428 from broadinstitute/tws_cleanup
repo cleanup
- Jupyter Notebook
Published by github-actions[bot] over 2 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.7
trivial update to .gitignore
- Jupyter Notebook
Published by github-actions[bot] over 2 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.6
Fix dockstore validator step in yml to insure action fails if step fails (#415)
Updated java version
manually checking dockstore validation results
Co-authored-by: bshifaw bshifaw@broadinstitute.com
- Jupyter Notebook
Published by github-actions[bot] almost 3 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.5
fix a missing space in dockstore.yml (#413)
- Jupyter Notebook
Published by github-actions[bot] almost 3 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.4
Send nightly status message to #lrma-repos, rather than #lrma (#411)
- Jupyter Notebook
Published by github-actions[bot] almost 3 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.3
add CleanupIntermediate to dockstore.yml (#409)
- Jupyter Notebook
Published by github-actions[bot] about 3 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.2
deprecate sniffles-1 and CLR WG workflow (#408)
- Jupyter Notebook
Published by github-actions[bot] about 3 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.1
finalize merged FASTQ in PBAssembleWithHifiasm (#405)
- Jupyter Notebook
Published by github-actions[bot] about 3 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_4.0.0
Major changes from the last major release include
- WDL directory structure change which organizes .wdls into specific analysis folders
- Actions to generate a repository Git page to list WDL workflow documentation and Developer Guide
- Jupyter Notebook
Published by bshifaw about 3 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.76
Update Continuous integration to use bumpversion tool (#404)
Updated cd.yml to use bumpversion tool action
Updated .bumpversion.cfg to specify tag_name and have version match current repo version
Co-authored-by: bshifaw bshifaw@broadinstitute.com
- Jupyter Notebook
Published by github-actions[bot] about 3 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.75
Add bumpversion cfg (#403)
- add .bumpversion.cfg
Co-authored-by: bshifaw bshifaw@broadinstitute.com
- Jupyter Notebook
Published by github-actions[bot] about 3 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.74
Update task descriptions (#402)
updated description to wdl files in task Alignment, Assembly, and Preprocessing
updated description to wdl files in task Transcriptomics
updated description to wdl files in task QC
updated description to wdl files in task Utility
updated description to wdl files in task VariantCalling
fixes to pass miniwdl validation
fixes to pass womtool validation
Update wdl/tasks/QC/Fingerprinting.wdl
Update wdl/tasks/Utility/ONTUtils.wdl
Co-authored-by: Steve Huang shuang@broadinstitute.org
Co-authored-by: bshifaw bshifaw@broadinstitute.com Co-authored-by: Steve Huang shuang@broadinstitute.org
- Jupyter Notebook
Published by github-actions[bot] about 3 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.73
development docs draft (#396)
development docs draft
added intro for dockerstyleguide.md
add to dockerstyleguide.md
add to wdlstyleguide.md
renamed devguide to developmentguide
rewrote overview, added visuals for directory structure
Minor fixes, Added Workflow Deployment in README.md
Co-authored-by: bshifaw bshifaw@broadinstitute.com
- Jupyter Notebook
Published by github-actions[bot] about 3 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.72
Fix for Repo Git Page action yml (#398)
added make docs dir if not exist
add graphviz as an installation requirement
switching to mac environment
Co-authored-by: bshifaw bshifaw@broadinstitute.com
- Jupyter Notebook
Published by github-actions[bot] about 3 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.71
Automatically Generate Repository Git Page (#391)
add git page action
Add scripts to create dot and png files for workflows
Add scripts to add png link to md
Add mkdocs.yml file
changed png link to be relative path instead of absolute.
Added script to add header to wdl markdown, and remove task info from wdl workflow markdown
edited wdlviz.py to add the option to name the output file by a specified name, edited createwdlvisual.py to use wdl basename for dot file
Co-authored-by: bshifaw bshifaw@broadinstitute.com
- Jupyter Notebook
Published by github-actions[bot] about 3 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.70
New workflows to deal with ONT flowcells that have multiple basecall directories (#325)
- one workflow that essentially copies the current ONTFlowcell, but aims at a basecall directory
- one workflow that plays the role of ONTFlowcell, but actually aggregates results from the WF above
- Jupyter Notebook
Published by github-actions[bot] about 3 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.69
add miniwdl and shellcheck installation to cd.yml (#397)
Co-authored-by: bshifaw bshifaw@broadinstitute.com
- Jupyter Notebook
Published by github-actions[bot] about 3 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.68
reheader FP vcfs from full hg38 to no_alt (#344)
- Jupyter Notebook
Published by github-actions[bot] about 3 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.67
Move wdl files to central directory called wdl (#393)
moved wdls to wdl folder, updated testing location for wdl validation
updated wdl paths in dockstore.yml
renamed 'deprecated_wdl' to 'deprecated', updated docstore.yml accordingly
Co-authored-by: bshifaw bshifaw@broadinstitute.com
- Jupyter Notebook
Published by github-actions[bot] about 3 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.66
Standardize wdl description (#392)
standardized pipeline/ONT workflow meta and parameter_meta bocks
standardized pipeline/pacbio workflow meta and parameter_meta bocks
standardized pipeline/techAgnostic workflow meta and parameter_meta bocks
Co-authored-by: bshifaw bshifaw@broadinstitute.com
- Jupyter Notebook
Published by github-actions[bot] about 3 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.65
Update hifiasm assembly (only) workflow (#390)
MAJOR CHANGES:
- HiFiasm
- update hifiasm docker
- formulated resource allocation
- As requested, pipeline generates alternate contigs as well. Note this essentially forces two assemblies on one set of FASTQs, as haplotig-generating assemblies and alternate-generating assemblies are generated with different options.
- QUAST
- update and shrink Quast docker
- bigger disk for Quast task
- now supports multiple assemblies being evaluated at the same time
- It's almost reference-free now:
- variant calling is no longer offered
- only depends on a reference (e.g. T2T) for assembly evaluation (Quast)
MINOR CHANGES:
- update various tasks
- optionally compress files when finalizing
- merge MergeFastqs and MergeFastqGzs into one task
- more memory for MergeFastqs; uses LOCAL SSD when files are not too large
- Jupyter Notebook
Published by github-actions[bot] about 3 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.64
Kvg smrtlink updates (#389)
Fix to handle a change in where SMRTLink keeps the run date field
Updated docker image and WDL tasks to make use of updated PacBio metadata parser
Specify version number for lr-basic docker image in MergeAndSortVCFs
Fixed an issue where Sniffles2 wasn't always run when it should be
- Jupyter Notebook
Published by github-actions[bot] about 3 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.63
Kvg update downloaders (#379)
- Updated cloud downloaders
- Jupyter Notebook
Published by github-actions[bot] about 3 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.62
Rev to Guppy 6.4.6 (#386)
- Jupyter Notebook
Published by github-actions[bot] about 3 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.61
Adding WDL validation with miniwdl to nightly and push tests (#388)
added test for miniwdl validation
added shellcheck to push and nightly actions so that miniwdl can validate command in tasks
Longbow.wdl and PBFlowcell.wdl changes to satisfy miniwdl validation
Postprocessing_Tasks.wdl, Longbow.wdl, and MASSeq.wdl changes to satisfy miniwdl validation
Assigning ConvertToHailMT call an alias to avoid having similar names as wdl workflow name
miniwdl 1.9.0 does not allow concatenating optional variables with none-optional Strings. Using select_first to set optional to none-optional.
fix womtool validation error for ONTFlowcell.wdl regarding "Cannot dereference a Map[String, String]? value using a String key"
updated wdls such that they user separate instead of sep
add end of file line Structs.wdl
Co-authored-by: bshifaw bshifaw@broadinstitute.com Co-authored-by: Steve Huang shuang@broadinstitute.org
- Jupyter Notebook
Published by github-actions[bot] about 3 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.60
WDL Reorganisation (#384)
Add pipeline, tasks, and structs folder to root dir.
update pbccs.wdl, AlignAndCheckFingerprintCCS.wdl path in dockstore.yml
updated several import paths for wdl workflows
added dockstore validation action, updated dockstore.yml
"git moved" wdl workflows to pipeline folder
"git moved" wdl workflows to task folder
Renamed wdl to deprecated_wdl
Added ReadMe.md deprecated, updated imported for deprecated wdls
removed PBCCSNoSharding.wdl and ExtractAouData.wdl from dockstore.yml
updated wdl validate py script to point to new dir
added getreversewdl_deps.py
added draftreleasenotes.py, script to get wdls edited between two branches
removed MalariaAssembly.wdl from .dockstore.yml
add ArgumentParser for draftreleasenotes.py
minor change to raising error for missing file in testwdlvalidity.py
Co-authored-by: bshifaw bshifaw@broadinstitute.com Co-authored-by: Steve Huang shuang@broadinstitute.org
- Jupyter Notebook
Published by github-actions[bot] about 3 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.59
Merge pull request #377 from broadinstitute/kvg_glnexus
Added GLNexus GVCF joint calling.
- Jupyter Notebook
Published by github-actions[bot] over 3 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.58
Terra changed where submissions go (now at gs://
- Jupyter Notebook
Published by github-actions[bot] over 3 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.57
update help message with -g
- Jupyter Notebook
Published by github-actions[bot] over 3 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.56
fix an error in the step checking multiple bams sharing the same sample name
- Jupyter Notebook
Published by github-actions[bot] over 3 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.55
add sample_id to workflow (#369)
- Jupyter Notebook
Published by github-actions[bot] over 3 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.54
Create Sniffles2 Docker and workflow. Update variant calling pipeline with Sniffles2. (#365)
- Jupyter Notebook
Published by github-actions[bot] over 3 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.53
Merge pull request #368 from broadinstitute/jtsupdatelongbowfor3pumifix
Updated longbow to v0.5.40 for 3' UMI length fix.
- Jupyter Notebook
Published by github-actions[bot] over 3 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.52
Significantly reduce NanoPlotFromBam wall-clock time * use a new docker which supports parallelization * drop supp alns
- Jupyter Notebook
Published by github-actions[bot] almost 4 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.51
Merge pull request #360 from broadinstitute/jtsupdatemasseqlongbowversion
Updating to longbow v0.5.39 to fix pomegranate heap overflow.
- Jupyter Notebook
Published by github-actions[bot] almost 4 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.50
Merge pull request #357 from broadinstitute/jtsfixtagsformasseqv0
Added single-cell tag cleanup for v0 of the MAS-ISO-seq pipeline
- Jupyter Notebook
Published by github-actions[bot] almost 4 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.49
Merge pull request #356 from broadinstitute/jtsmas-seqoptimizations
- Reduced cost of
pbflowcellfor MAS-ISO-seq samples. - Added step to fix
longbowtags that have position information for reads that are reverse-complemented.
- Jupyter Notebook
Published by github-actions[bot] almost 4 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.48
Merge pull request #340 from broadinstitute/kvgpbmasseq
Bring the MAS-ISO-seq workflows into our production pipelines
- Jupyter Notebook
Published by github-actions[bot] almost 4 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.47
New workflow that follows up to process the result of the demultiplexing workflow
performs alignment to specified reference (practically avoid sharding BAMs as we've seen issues of sharding demultiplexed BAMs)
performs fingerprint verification by default (offers option to turn off FP check)
collects alignment metrics
- Jupyter Notebook
Published by github-actions[bot] almost 4 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.46
New unified workflow for handling barcoded & on-instrument CCSed SMRTCells
- summary on raw input PBI
- smrttools-based metrics generation
- demultiplexing and demultiplexing 2ndary metrics generation and check all expected barcodes are detected
- default to extract Hifi reads from the input CCS bam, and offers choice if demultiplexing with CCS bam or Hifi bam
- printout processing date on success
- Jupyter Notebook
Published by github-actions[bot] almost 4 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.45
A quick fix of pb align task (#341)
use tab explicitly, this avoids errors when lab enters spaces in the RG line
ignore vscode stuff
- Jupyter Notebook
Published by github-actions[bot] almost 4 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.44
randomly assign zones to resource-demanding tasks (#328)
- DV-Pepper-Margin subworkflows
- Clair
- PBSV
random zone generation happens only once per genome, attempting to balance wall-clock time and cross-zone egress
- Jupyter Notebook
Published by github-actions[bot] almost 4 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.43
Kvg pf drug resistance (#337)
- Added a simple workflow to type some common drug resistance markers in P. falciparum.
- Jupyter Notebook
Published by github-actions[bot] almost 4 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.42
Kvg lr malaria cigass ucad (#336)
- Workflow to test HRP2/HRP3 presence/absence in P. falciparum data
- Jupyter Notebook
Published by github-actions[bot] almost 4 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.41
Parse on-instrument CCS report (#334)
test HA test sample
found better way to check and re-org a bit
- Jupyter Notebook
Published by github-actions[bot] almost 4 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.40
Fix PB run info (#332)
tiny fix for detectruninfo.py, update dockers accordingly
fix how to detech if CCS'ed
- Jupyter Notebook
Published by github-actions[bot] about 4 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.39
default resource allocation for DV-pepper pipeline was off (#333)
- Jupyter Notebook
Published by github-actions[bot] about 4 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.38
minimum disk 1 (#331)
- Jupyter Notebook
Published by github-actions[bot] about 4 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.37
Small fixes to PBFlowcell (#326)
- handle updated ccs report
- new task for computing appropriate # of LOCAL ssd disks due to new PAPI change
- to increase throughput, use randomly assigned zones for ShardLongReads (avoiding issue 324)
- Jupyter Notebook
Published by github-actions[bot] about 4 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.36
Update FP workflow (#318)
allow users to specify explicitly which gt VCF to use
fix a bug where VCF filtering may filter away all contig lines in the header
- Jupyter Notebook
Published by github-actions[bot] about 4 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.35
Some cleanup (#320)
drop longshot from pipeline calls (not removing it all together yet)
check on sample names when merging
avoid pipefail when there're multiple read groups
- Jupyter Notebook
Published by github-actions[bot] about 4 years ago
https://github.com/broadinstitute/long-read-pipelines - lrp_3.0.34
Misc updates to the WGS pipelines (#302)
Separate small, SV call, and:
- FixSnifflesVCF guard against when only a subset of issues exist in the VCF
- Drop Clair3 from CLR data, as that's not supported
FOUR SMALL FIXES:
- guard against spaces in LB field in input BAM....
- optionally check PacBio ShardLongreads outputs (sometimes EOF mark missing)
- use DV 1.3.0, bundled in custom docker with resource monitoring script
- improved PBSV resource allocation based on scatter scheme choice, as PAPI 10 observed for high coverage data when not proecssing per chr
provide default values to WGS pipelines
- Jupyter Notebook
Published by github-actions[bot] about 4 years ago