Recent Releases of https://github.com/acenglish/kanpig
https://github.com/acenglish/kanpig - v1.1.0
- New
--aballele balance threshold lowers frequency of assigning erroneous compound heterozygous genotypes - Experimental
--squishchanges gpenalty behavior to preferring simpler paths - Fix score's gpenalty
- FORMAT/NE now uses neighborhood's start position for consistent labeling across samples
Full Changelog: https://github.com/ACEnglish/kanpig/compare/v1.0.2...v1.1.0
- Rust
Published by ACEnglish 11 months ago
https://github.com/acenglish/kanpig - v1.0.2
Minor patch
- UI tweaks, including cleaner gt parameters
- Slight htslib bam record and other minor efficiency improvement
- Fix bug arising from ploidy==1 regions sometimes.
- Fix minor bugs around PS and HP tag counting/record keeping
Full Changelog: https://github.com/ACEnglish/kanpig/compare/v1.0.1...v1.0.2
- Rust
Published by ACEnglish about 1 year ago
https://github.com/acenglish/kanpig - v1.0.1
Faster, Smaller, More Accurate Genotyping
Kanpig now has sub-commands gt and plup. The new plup command will extract reads and their SVs from a bam file into a small file that's useful for long-term storage of reads.
* gt improvements:
* can parse plup files much more quickly (up to 8x) than parsing a bam. Though bam parsing is also now ~2x faster.
* parse PS and HP information from haplotagged reads to increase genotyping accuracy as well as record long-range phasing information in the output VCF.
* now uses kmedoid clustering instead of kmeans, resulting in a modest improvement to genotyping accuracy.
Full Updates: https://github.com/ACEnglish/kanpig/wiki/Updates#v101 Full Changelog: https://github.com/ACEnglish/kanpig/compare/v0.3.1...v1.0.1
- Rust
Published by ACEnglish about 1 year ago
https://github.com/acenglish/kanpig - v0.3.1
Consistency improvements
- New filtering of haplotypes without paths increases accuracy
- New path scoring improves accuracy and consistency
- ZS and SS FORMAT fields replaced by KS reporting the score
- Requiring reads to span the full variant graph window including --chunksize buffer increases accuracy
- Exhaustive search of partial haplotypes
- Slight runtime reduction from avoidance of redundant path searches
Full Changelog: https://github.com/ACEnglish/kanpig/compare/v0.3.0...v0.3.1
- Rust
Published by ACEnglish over 1 year ago
https://github.com/acenglish/kanpig - v0.3.0
General improvements
- ~8% speed increase from less work in the path-searching
- Partial haplotypes bug fix increases accuracy
- Fixed SQ and FT fields
- Dedicated writing thread helps reduce memory usage by preventing a backlog of completed variants while reading
- Default
--outis stdout to allow easier compression/indexing (e.g.kanpig .. | bcftools sort -O z -o out.vcf.gz) - IUPAC codes are fixed by kanpig according to vcf specifications (Issue #1)
- Fixed filtering of symbolic alts and BNDs
- Argument validation
Full Changelog: https://github.com/ACEnglish/kanpig/compare/v0.2.0...v0.3.0
- Rust
Published by ACEnglish over 1 year ago
https://github.com/acenglish/kanpig - v0.2.0
- Up to 40% reduction in runtime
- Hemizygous and sex chromosome aware genotyping with new
--ploidy-bed - Variants with alternate alleles of stars, monozygotic reference, and BNDs are filtered out
- Comparing PathScores by their average size and sequence similarity increases accuracy
Full Changelog: https://github.com/ACEnglish/kanpig/compare/v0.1.2...v0.2.0
- Rust
Published by ACEnglish almost 2 years ago
https://github.com/acenglish/kanpig - v0.1.2
- New optional hompolymer filter doesn't kmerize long homopolymers
- Improved logging info
- Correcting GQ field
- Correcting kmer counting
- Small speed/memory/io improvements
- Off-loaded annotation work from the single writer thread to the worker threads and using a large multiple of page size for the BufWriter capacity
- Fewer bam file opens
- Fewer clone operations
Full Changelog: https://github.com/ACEnglish/kanpig/compare/v0.1.0...v0.1.2
- Rust
Published by ACEnglish almost 2 years ago