Recent Releases of smoove
smoove - allow setting max depth
previously regions with > 1000 depth were filtered out. That can now be set with an environment variable:
export SMOOVE_MAX_DEPTH=20000
smoove call ...
- Go
Published by brentp almost 5 years ago
smoove - larger chroms, more samples
as of this release, the argument to smoove paste can be a file ending with .list that has one file per sample.
v0.2.7
- use csi index for bams to support larger organisms
- fix ordering of error-checking (#146)
- support for longer file-lists (#151)
- Go
Published by brentp about 5 years ago
smoove - shorten run-time for problematic samples
v0.2.5
- slight sensitivity increase
- fix bug in VCF parser that could manifest inside of
smoove annotate - reduce run time for problematic samples. using a kdtree.
- @brwnj fixed the docker container so that conversion to singularity works without futzing.
- Go
Published by brentp over 6 years ago
smoove - general bug-fixes
v0.2.4
- add ##reference=$fasta to vcf header. (#58)
- fix smoove annotate for recent ensembl GFFs which lack transcript features
- fix bug in lumpy_filter for some versions of GCC (https://github.com/arq5x/lumpy-sv/pull/300)
- just report message on weird alignments instead of failing (#68)
- lower lumpy z cutoff from 4 to 2.75. because of distribution of fragment lengths, this does not affect much, but does improve sensitivity after duphold-DHFFC filtering slightly.
- Go
Published by brentp almost 7 years ago
smoove - diagnostics and lower FPR
v0.2.3
- save disc and split counts and output a plot as QC
- annotate: annotate with up and downstream 5KB. so any gene within 5kb will be annotated.
- smoove merge now outputs a plot that shows the number of split and discordant reads for each sample.
- smoove now filters about 25% more split reads by looking at how the read is consumed this results in a 5% reduction in false positive genome-in-a-bottle deletions > 1KB while leaving the true positives unchanged relative to version 0.2.2. (Thanks @ernfrid for feedback on this feature).
- Go
Published by brentp over 7 years ago
smoove -
NOTE that this release requires a very recent version of mosdepth
v0.2.2
- huge reduction in number of discordant reads sent to lumpy for some problematic samples. this is work by @ernfrid which reduces lumpy runtime for all cases and dramatically lowers it for cases that were previously problematic due to large numbers of discordant reads. On our Genome in a Bottle test sample, this drops the number of discordant reads from 1,434,810 to 332,384; a 4.3 fold reduction while leaving the final smoove output unchanged except for 2 BNDs that were removed. This reduction in reads will be even more dramatic in problematic samples.
- switch to use svtyper>=0.7.0 with
--max_ci_distparameter. To avoid this set:export SMOOVE_NO_MAX_CI=xx# any value will work; however, it's higly recommended to use as-is. - cnvnator changes and filtering. (still requires github.com/brentp/CNVnator fork).
- bugfix for case when no output directory was specified.
- use mosdepth with --fast-mode NOTE this requires the latest version of mosdepth (0.2.4 or greater)
- Go
Published by brentp over 7 years ago
smoove - general improvements in sensitivity for small events
this release was driven by feedback from Dave Larson (@ernfrid) and Ira Hall's group. It results in improved sensitivity for smaller events (at a small cost of specificity). It also fixes a bug with the duphold runner.
v0.2.1
- fix bug in smoove duphold for samples > threads that resulted in stalling
- smoove is now more discerning about reads that are soft-clipped on both ends as these could be due to inversions. if a read is not flipped relative to its mate and it is soft-clipped on both ends, it is still removed.
- in some cases reads with a large NM (mismatches) were not filtered because they
had an unexpected type (uint8).
smoovenow checks more types and for some bams will remove more records which improves specificity. - smoove will now be more conservative with
NMcounting. an insertion of 7 bases is counted as an NM of 7. smoove will now correct this to an NM of 1 so it counts the number of events rather than the number of bases of each event. - simplify rules for filtering and make them less strict.
- Go
Published by brentp over 7 years ago
smoove - s/Number/Integer/
the type of SHQ and MSHQ was invalid in previous versions that is fixed in this release.
- Go
Published by brentp almost 8 years ago
smoove - more error handling
v0.1.9
- better error message in merge.
- use set -e in process calls so we don't exit with 0 even on failure
- Go
Published by brentp about 8 years ago
smoove - no --sum in smoove merge
this release will slightly improve sensitivity when using smoove in cohort mode.
- Go
Published by brentp about 8 years ago
smoove - variant prioritization and quality
The smoove annotate command now adds a SHQ (Smoove Het Quality) tag to every sample format) a value of 4 is a high quality call and the value of 1 is low quality. -1 is non-het.
It also adds a MSHQ for Mean SHQ to the INFO field which is the mean SHQ score across all heterozygous samples for that variant.
As a first pass, users can look for variants with MSHQ > 3.
This value is created from a data-driven model of het quality trained on 603 samples in large pedigrees and on > 500 quartets.
- Go
Published by brentp about 8 years ago
smoove - specificity improvements
found a read filter that removes about 30% of false-positive calls in a test-case using the CHM1/CHM13 pseudo-diploid.
- Go
Published by brentp about 8 years ago
smoove - sensitivity improvements
NOTE: there is a change to svtyper that we found in evaluating smoove that will reduce the FPR from genotyping. Please update to svtyper>0.6.0 for best results.
v0.1.5
- sensitivity increases for variants without split reads.
- use a fixed length of 4 in merge (gives much better merging for
smoove merge) - add new annotate command which takes a gff and annotates with gene names
- increase max_reads argument to svtyper to reduce false negatives
- remove QCFail and duplicate reads from split and disc.bams. This is now
part of
smoovebut also fixed inlumpy_filterupstream.
- Go
Published by brentp about 8 years ago
smoove - fix for --removepr
v0.1.4
- fix bug when using
-x/--removeprwith--genotype(thanks @chapmanb for test-case)
- Go
Published by brentp about 8 years ago
smoove - mosdepth compat
the only change compared to v0.1.2 is to support the most recent mosdepth output (previous versions of mosdepth will also work with this version).
- Go
Published by brentp about 8 years ago
smoove - usability changes
v0.1.2
- bcftools and gsort are now required.
- output never goes to STDOUT
- output contigs to header
- add svtools to docker
- add option to remove PRPOS and PREND
- Go
Published by brentp about 8 years ago