Recent Releases of veba

[2.5.1] - 2025.04.12

Added

Added install-gpu.sh which installs GPU accelerated environments when applicable (i.e., VEBA-binning-prokaryotic_env and VEBA-binning-viral_env)
Added Dockerfile-GPU which is experimental

Changed

Changed install.sh so it only installs CPU-based environments Issue #167
Changed containerize_environments.sh so it only installs CPU-based environments Issue #167

Deprecated

Deprecated VirFinder algorithm in binning-viral.py so now only geNomad is supported

- Python
Published by jolespin about 1 year ago

[2.5.0] - 2025.04.10

Added

Added VAMB support to binning-prokaryotic.py (now a default binner) and binning_wrapper.py.
Added automatic gzipping of output files based on .gz extension in edgelist_to_clusters.py using pyexeggutor.open_file_writer.
Added xxhash dependency to VEBA-binning-prokaryotic_env for bin name reproducibility (Issue #140).
Added -e/--exclude and -d/--domain_predictions options to filter_binette_results.py for removing eukaryotic genomes and setting up domain assignments (Issue #153).
Added semibin2-[biome] option to binning-prokaryotic.py allowing specification of multiple biomes (e.g., semibin2-global, semibin2-ocean), replacing --semibin2_biome (Issue #155).
Added --semibin2_orf_finder option to binning_wrapper.py.
Added genome_statistics.tsv.gz, gene_statistics.cds.tsv.gz, gene_statistics.rRNA.tsv.gz, and gene_statistics.tRNA.tsv.gz outputs to essentials.py.
Added --identifiers, --index_name, and --no_header options to convert_metabat2_coverage.py for broader applicability, including VAMB.
Added -l eukaryota_odb12 as default but also allow --auto-lineage-euk for BUSCO in binning-eukaryotic.py

Changed

Changed binning-eukaryotic.py behavior to provide a solution to BUSCO Issue #447
Changed CHANGELOG.md format to best practice Keep a Changelog
Changed prodigal-gv to pyrodigal-gv in multithreaded mode for binning-viral.py for performance.
Removed metacoag from the default set of binning algorithms in binning-prokaryotic.py.
Updated geNomad to v1.11.0 and geNomad database to v1.8 to resolve numpy import errors (Issue #160).
Updated Pyrodigal usage in binning-eukaryotic.py for organelles to allow piping and threading.
Updated BUSCO to v5.8.3 and associated databases.
Updated Tiara to Tiara-NAL in VEBA-binning-prokaryotic_env and VEBA-binning-eukaryotic_env to enable stdin usage.
Updated biosynthetic.py to use antiSMASH v7 (Issue #159).
Changed behavior when --taxon fungi is specified: precomputed genes are not used due to formatting issues.
Simplified the method for adding headers to Diamond outputs in biosynthetic.py.
Changed Dockerfile working directory from /tmp/ to /home/.
Integrated Tiara and consensus_domain_classification.py into the binette step of binning-prokaryotic.py.
Renamed database identifier from VDB to VEBA-DB.
Updated CheckM2 and Binette versions in binning-prokaryotic.py.
Updated CheckM2 Diamond database included in VEBA-DB_v9 (Issue #154).
Removed usage of precomputed genes in the SemiBin2 wrapper due to SemiBin2/issue-#185.
Allowed faulty return codes in iterative mode for binette to permit convergence in genome recovery.

Fixed

Fixed CONDA_ENVS_PATH detection in the veba controller executable to correctly handle environments outside the base Conda directory.
Fixed bug where VFDB hits were incorrectly counted as MIBiG in biosynthetic.py (Issue #141).
Fixed --tta_threshold argument in biosynthetic.py which was previously defined but not connected to the command execution.
Removed capitalization from column headers in filter_binette_results.py output.
Fixed missing --antismash_options argument connection in biosynthetic.py.

Removed

Removed CONCOCT support from binning-eukaryotic.py.

Deprecated

Deprecated amplicon.py module in favor of external pipelines like nf-core/ampliseq.

- Python
Published by jolespin about 1 year ago

v2.4.2 fixed a small bug where de bruijn graph for MEGAHIT wasn't included in output directory if the graph was created [2025.2.1] - Added --megahitbuilddebruijngraph to make de-Bruijn graph construction for MEGAHIT optional in assembly.py