Rasusa
Rasusa: Randomly subsample sequencing reads to a specified coverage - Published in JOSS (2022)
compression_benchmark
Benchmarking FASTQ compression with 'mature' compression algorithms
https://github.com/biojulia/fastx.jl
Parse and process FASTA and FASTQ formatted files of biological sequences.
https://github.com/biojulia/readdatastores.jl
Datastores for reads, not your papa's FASTQ files.
fastqrepair
A pipeline that can be used to recover corrupted FASTQ.gz files, drop or fix uncompliant reads, remove unpaired reads, and settles reads that became disordered
fetch_ngs
Workflow to Fetch Public Sequencing Data and Metadata Using iSeq and MrBiomics Module.
grepq
grepq: A Rust application that quickly filters FASTQ files by matching sequences to a set of regular expressions - Published in JOSS (2025)
detaxizer
A pipeline to identify (and remove) certain sequences from raw genomic data. Default taxon to identify (and remove) is Homo sapiens. Removal is optional.
viralmetagenome
Detect iSNV and construct whole viral genomes from metagenomic samples
phredsort
`phredsort` is a cli tool for sorting sequences in a FASTQ file by their quality scores
https://github.com/ajodeh-juma/bixcop-2021-python
Simple to moderate python programming tasks with a focus in bioinformatics
nf-fastq2bam
Nextflow pipeline to convert FASTQ files to BAM for WGS and RNA-seq data