Recent Releases of Metagenomic classification with KrakenUniq on low-memory computers

Metagenomic classification with KrakenUniq on low-memory computers - KrakenUniq v1.0.4

In this release we removed the requirement to have "file" command in the Docker or Singularity image (thanks @lskatz and @boulund) by eliminating use of "file" in determining the compression type for paired read files. We now also support gzipped/bzipped fasta files with paired reads. We now force --preload switch when building the database for speed.

@alekseyzimin

Scientific Software - Peer-reviewed - C++
Published by alekseyzimin almost 3 years ago

Metagenomic classification with KrakenUniq on low-memory computers - KrakenUniq v1.0.3

This release adds Dockerfile (thanks @Jessime), and adds few changes to the documentation.

Scientific Software - Peer-reviewed - C++
Published by alekseyzimin about 3 years ago

Metagenomic classification with KrakenUniq on low-memory computers - KrakenUniq v1.0.2

This release fixes an issue with possibly incorrect output produced when running multiple krakenuniq processes in the same folder with --paired input files.

Scientific Software - Peer-reviewed - C++
Published by alekseyzimin over 3 years ago

Metagenomic classification with KrakenUniq on low-memory computers - KrakenUniq v1.0.1

This release includes a fix for https://github.com/fbreitwieser/krakenuniq/pull/117 . Thanks to @boulund

Scientific Software - Peer-reviewed - C++
Published by alekseyzimin over 3 years ago

Metagenomic classification with KrakenUniq on low-memory computers - KrakenUniq v1.0.0

This is the official 1.0.0 release of KrakenUniq. This release fixes the bug with downloading databases (Issue #87). Thanks to @clescoat for the fix!

@alekseyzimin

Scientific Software - Peer-reviewed - C++
Published by alekseyzimin over 3 years ago

Metagenomic classification with KrakenUniq on low-memory computers - KrakenUniq v0.7.3

This maintenance release provides the following updates: (1) fixes issues with building large databases (2) installs Jellyfish version 1 KRAKENUNIQINSTALLDIR/jellyfish-install/bin/, if -j switch is used (KrakenUniq requires Jellyfish version 1 to build databases) (3) fixes --work-on-disk option (#97)

@alekseyzimin

Scientific Software - Peer-reviewed - C++
Published by alekseyzimin over 3 years ago

Metagenomic classification with KrakenUniq on low-memory computers - KrakenUniq v0.7.2

This maintenance release fixes the use of --paired option in krakenuniq, and the failure at the last step (report) of building a krakenuniq database.

@alekseyzimin

Scientific Software - Peer-reviewed - C++
Published by alekseyzimin over 3 years ago

Metagenomic classification with KrakenUniq on low-memory computers - KrakenUniq v0.7.1

This minor release fixes a bug in the Makefile that resulted in installation of unusable executables countunique and setlcas. The bug resulted in fatal error in building new krakenuniq database. Classification with an existing database was not affected.

Thanks to @alekseyzimin for fixing the bug.

Scientific Software - Peer-reviewed - C++
Published by alekseyzimin over 3 years ago

Metagenomic classification with KrakenUniq on low-memory computers - KrakenUniq v0.7

New option for low-memory computers: --preload-size By default, KrakenUniq performs memory mapping to load the database; i.e., it does not load the entire database into main memory. (Kraken 1 employs the same strategy.) This makes classification of larger read datasets much slower, but it allows KrakenUniq to run on machines with low available main memory. If enough free RAM is available to hold the entire database in main memory, users are recommended to explicitly load the entire database prior to classification using the flag --preload, which dramatically speeds up the classification, often by a factor of 20 or more.

To improve the performance when not enough main memory is available to load the entire database into RAM, we added a new capability to KrakenUniq. When using this new feature, only a chunk of the database is loaded into memory at a time, after which the algorithm iterates over the reads and looks up all k-mers in those reads that are matching in this database chunk. This process is repeated until the entire database has been processed. The k-mer lookups are then merged, and reads are classified based on the results of the full database. This new feature makes it feasible to run KrakenUniq on very large datasets and huge databases on virtually any computer, even a laptop, while providing exact classifications that are identical to those of KrakenUniq in its other modes. Users can employ this feature with --preload-size and specify the amount of available main memory that they want to use for loading chunks of the database, e.g., --preload-size 8G or --preload-size 500M.

Automatic detection of compressed fastq/fasta input The input format (fastq or fasta, bzip2 or gzip compressed) is now detected automatically. No need to use --fasta-input, --fastq-input, --gzip-compressed or --bzip2-compressed switches.

Thanks to @cpockrandt for developing the code and @salzberg and @alekseyzimin for the initial idea, suggestions and testing.

Scientific Software - Peer-reviewed - C++
Published by alekseyzimin almost 4 years ago

Metagenomic classification with KrakenUniq on low-memory computers - KrakenUniq v0.6 with improved preloading of the database

KrakenUniq (and also Kraken) often ran very slow with really big databases. The problem was that --preload didn't truly force to load the DB in memory, so it spends forever (many days) going back and forth to disk. With the fix included in this release, krakenuniq ran in 16 minutes on a database where before it took >100 hours.

Thanks a lot to @alekseyzimin for the development and contribution of the fix and @salzberg for reporting the issue!

Scientific Software - Peer-reviewed - C++
Published by fbreitwieser over 4 years ago

Metagenomic classification with KrakenUniq on low-memory computers - KrakenUniq v0.5.8

  • Enable resetting taxIDs
  • Check for taxid mappings without AC version
  • Use estimated k-mer count for database build

Scientific Software - Peer-reviewed - C++
Published by fbreitwieser about 7 years ago

Metagenomic classification with KrakenUniq on low-memory computers - KrakenUniq v0.5.7

  • Fix build issues introduced in previous release
  • Fix running of krakenuniq on OSX

Scientific Software - Peer-reviewed - C++
Published by fbreitwieser about 7 years ago

Metagenomic classification with KrakenUniq on low-memory computers - KrakenUniq v0.5.6

  • Feature added: Setting LCAs hierarchically when building database
  • Streamling dustmasking of sequences
  • Check that report_file is defined before writing to it
  • Enable building with clang and without OpenMP on OSX

Scientific Software - Peer-reviewed - C++
Published by fbreitwieser about 7 years ago

Metagenomic classification with KrakenUniq on low-memory computers - KrakenUniq v0.5.5

Fixed extraction of reads with parameter -i - would stop prematurely

Scientific Software - Peer-reviewed - C++
Published by fbreitwieser about 7 years ago

Metagenomic classification with KrakenUniq on low-memory computers - KrakenUniq v0.5.3

Fix naming of scripts.

Scientific Software - Peer-reviewed - C++
Published by fbreitwieser over 7 years ago

Metagenomic classification with KrakenUniq on low-memory computers - KrakenUniq v0.5.2

Scientific Software - Peer-reviewed - C++
Published by fbreitwieser over 7 years ago

Metagenomic classification with KrakenUniq on low-memory computers - KrakenUniq v0.5.1

Fix to building issues for count_unique

Scientific Software - Peer-reviewed - C++
Published by fbreitwieser over 7 years ago

Metagenomic classification with KrakenUniq on low-memory computers - KrakenHLL v0.4.9

Scientific Software - Peer-reviewed - C++
Published by fbreitwieser over 7 years ago

Metagenomic classification with KrakenUniq on low-memory computers - KrakenHLL v0.4.8

Scientific Software - Peer-reviewed - C++
Published by fbreitwieser almost 8 years ago

Metagenomic classification with KrakenUniq on low-memory computers - KrakenHLL v0.4.2

Fixed Jellyfish install

Scientific Software - Peer-reviewed - C++
Published by fbreitwieser almost 8 years ago

Metagenomic classification with KrakenUniq on low-memory computers - KrakenHLL v0.4.1

Fixes script paths, re https://github.com/bioconda/bioconda-recipes/issues/8474 .

Scientific Software - Peer-reviewed - C++
Published by fbreitwieser almost 8 years ago

Metagenomic classification with KrakenUniq on low-memory computers - v0.4

Scientific Software - Peer-reviewed - C++
Published by fbreitwieser almost 8 years ago

Metagenomic classification with KrakenUniq on low-memory computers - v0.4-beta release

KrakenHLL is a fast metagenomics classifier that provides taxon k-mer coverage information along with read classifications. This release will go out of beta once issues #1, #5 and #7 are fixed.

Scientific Software - Peer-reviewed - C++
Published by fbreitwieser almost 8 years ago