Recent Releases of assembly_finder
assembly_finder - v0.8.0
assembly_finder v0.8.0 - 24-10-2024
Simplified help output, bumped ncbi-datasets version and added option to only download summary tables.
Features
- Added option to only download summary tables by @farchaab in https://github.com/metagenlab/assembly_finder/pull/24 ## Changes
- Updated workflows and bump tool versions by @farchaab in https://github.com/metagenlab/assembly_finder/pull/23
- Simplified help output fefa3899710162b6fa77f70f6bed6c8b11e01947 b4b8fb6a6515fd6d5b224184d2a567dbff4f53d1
Full Changelog: https://github.com/metagenlab/assembly_finder/compare/v0.7.7...v0.8.0
- Python
Published by farchaab over 1 year ago
assembly_finder - v0.7.7
Summary
Small release to fix docker file, update unit tests and other small fixes
Additions
- 2621c5cd2e6f87b3f6ab3782390cec655b2020da Added unit tests for taxa and accessions string inputs
- be970e813a9022260a4ee9e41d3dd4e83b57a9cc Added ps in container (for usage in nextflow), and install curl, rsync and p7zip via apt
Changes
- 82c790406675c82135c6e53905cdc553d8a104fd Added taxon column when downloading taxa in assembly_summary.tsv
- 621e8b4ba0d2a33ef8d952ff49931b9b235cfb6e Removed curl, rsync and p7zip from env.yaml
- 066cf8a6749be42b40a35b7fabc339a75559cde3 Bumped version
Fixes
- d0fc28b6570d2b87ae456ad0e0fd0414e169e02a Return query when it does not contain GCF, GCA or "_"
PRs
- Fixed github workflows by @farchaab in https://github.com/metagenlab/assembly_finder/pull/21
- Release 0.7.7 by @farchaab in https://github.com/metagenlab/assembly_finder/pull/22
Full Changelog: https://github.com/metagenlab/assembly_finder/compare/v0.7.6...v0.7.7
- Python
Published by farchaab almost 2 years ago
assembly_finder - Release 0.7.6
Summary
Small release to fix container, conda-envs and unit-tests
Additions
71f057b835d6fe61548047b14db548af10b7c054 Added CITATION.cff file
Changes
- Conda:
- 1896d62569fcaef09478e062f62f4d36c6f15aaf Seperated utils conda envs
- 6d7724707ac5029f9eabe00c4d6d31451be03a28 Froze env versions
- 4175e0ca010efdb594f7fa0f4462613164608b81 Seperated csvtk and taxonkit envs
- a8acd3c1c6db56749461ca56457742d7e3c422ec Changed conda prefix to .snakemake/conda
- 6fce3120f69831b3e3287a6cf28f32f18a6ce58f Use p7zip instead of unzip for extracting archive on macos
- Container:
- 0a7e034bebe9260d6124c88ffa10551a62d34aaa Added all dependencies in env.yaml
Fixes
- d845dc2462de5f8232272c889485024309c3d497 Fixed arch64 container build
- 4bbc993588d6cc42038e7c6fc01aa991ba780b22 Fixed XDGCACHEHOME
- d3cb2a93bd4dd67bf68b010386cef9918cf8ba0c Fixed unexpected characters in query wildcard
- 9b18f69 Include genome and seq-report by default (fixes missing input files)
- 050dff4f288f0b014e269f15c3bf42c79232bdc5 Fixed macos-latest unit-test
Full Changelog: https://github.com/metagenlab/assembly_finder/compare/v0.7.5...v0.7.6
- Python
Published by farchaab almost 2 years ago
assembly_finder - Release 0.7.5
What's Changed
- Fix genomes paths performance by @farchaab in https://github.com/metagenlab/assembly_finder/pull/16
Full Changelog: https://github.com/metagenlab/assembly_finder/compare/v0.7.4...v0.7.5
- Python
Published by farchaab almost 2 years ago
assembly_finder - assembly_finder v0.7.4
Summary
Small patch to fix api-key arg
Full Changelog: https://github.com/metagenlab/assembly_finder/compare/v0.7.3...v0.7.4
- Python
Published by farchaab about 2 years ago
assembly_finder - assembly_finder v0.7.3
Summary
Small patch to download reference or representative genomes by default, and docs update
Full Changelog: https://github.com/metagenlab/assembly_finder/compare/v0.7.2...v0.7.3
- Python
Published by farchaab about 2 years ago
assembly_finder - assembly_finder v0.7.2
Summary
Small patch to fix taxonkit env and change temp directory cleanup
Full Changelog: https://github.com/metagenlab/assembly_finder/compare/v0.7.1...v0.7.2
- Python
Published by farchaab about 2 years ago
assembly_finder - Release 0.7.1
Summary
Release to add snaketool template and taxonkit for lineage taxonomy
What's Changed
- fb6d86132b740917315aec7806c526ca8a43caf9 Added snaketool template
- 1f5f8bde77a87471ee15431b6f4ae09a2db7e11b Added unit tests
- c551224e6e7667d3d3e1d7f37aa2d12efa8d8a4d Added taxonkit for lineage table
- b8f4b8d532e542f40118c844b73d96f52d5a1655 Cleanup for json and jsonl files
- 4704373484ccd90c2b61d931e9dc093d2ea20229 97b2b547d49616def2449bc3bb996bca5a7c3660 Added conda envs for each rule
- 8fbc583062b7f5a4ec40c99fa677b66979a3261a Updated docs
Full Changelog: https://github.com/metagenlab/assembly_finder/compare/v0.6.2...v0.7.1
- Python
Published by farchaab about 2 years ago
assembly_finder - assembly_finder v0.6.2
Summary
Small release to fix version and taxonomy
What's Changed
- 6fad4952e0e17a1d2ffcdd040cad33165e04e9c0 Removes floats in lineage_id, and uppercase rank
- 573e8bed48310d2a0a0c264166913e40ea4e49d8 Adds version without empty line
Full Changelog: https://github.com/metagenlab/assembly_finder/compare/v0.6.1...v0.6.2
- Python
Published by farchaab about 2 years ago
assembly_finder - assembly_finder v0.6.1
Summary
Small release to fix spaces in genome paths
What's Changed
- edcedad2c8f04c32672b408231365880df581646 Fixes spaces in genome paths
- 3583df172298e1d6fbb837265d21b74ac8bad915 Appends long metavars like lineage ranks to help
Full Changelog: https://github.com/metagenlab/assembly_finder/compare/v0.6.0...v0.6.1
- Python
Published by farchaab over 2 years ago
assembly_finder - assembly_finder v0.6.0
Summary
Major release to use ncbi-datasets-cli to fetch genomes, assembly info and taxonomy
What's Changed
- Release 0.6.0 by @farchaab in https://github.com/metagenlab/assembly_finder/pull/11
Full Changelog: https://github.com/metagenlab/assembly_finder/compare/v0.4.2...v0.6.0
- Python
Published by farchaab over 2 years ago
assembly_finder - assembly_finder v0.4.2
Summary
- Small release to fix genbank paths and checksum verification
Changes
- a98eacd5083c87ec501279ebb06fb61c675ea374 Changed default exclude to anomalous
- 25264d8aea11fa64cee7d009dd783f5fd46ae222 Use python os.path for rule paths
- f9307883959a6b52f9a120a20322d96218161398 Changed default outdir
Fixes
- fedcb6f859e91608907fe74d0ce2486edeb310d5 fixes absolute paths when downloading from genbank
- 25264d8aea11fa64cee7d009dd783f5fd46ae222 fixes #9
- Python
Published by farchaab over 2 years ago
assembly_finder - assembly_finder v0.4.1
Summary
- Small patch to fix input table parsing
###
Fixes - 800f44b Fixed entries and nb conversion to integers
- Python
Published by farchaab over 2 years ago
assembly_finder - assembly_finder v0.4.0
Summary
- Input table requires column names now
- Auto complete of missing columns when input is a table
- Added a more flexible input search (assembly accessions, names...)
- Added retries in downloads
- Bug fixes
- updated README
Additions
- e38d74e Added input table examples
- 39db04a Added auto complete of missing columns in input table
- d5575c3 Added retries in download rules
Changes
- fbe383a Improved params names
- a1d8ebb d1cacbc Changed input search to support assembly names, accessions and uid
- 3f4f6f0 Renamed extensions param to suffix
- d6a8b96 Use column names in input table
- e9d7e70 Changed params ordering
- 8ed3daa Improved click help message (click choices)
- 43bac2f Snakemake args are now added with "--" (--force intead of force for example)
- 1bfd3c7 Updated documentation in README
###
Fixes - 2f76629 Fixed index error when no fasta is downloaded
- 3f4f6f0 Fixed checksum extension bug
- Python
Published by farchaab over 2 years ago
assembly_finder - assembly_finder v0.3.3
Summary
- Added option to download assembly reports, fasta and more (gff,gtf ...)
- Bug fixes
## Details
###
Additions - 306278ccff83fbd4feb83cec099c97fe040800de Added option to download assembly fasta, reports, gff, gtf... #5
###
Changes - 75374611e07acab99b9c32568e1e211ce9d59933 Simplified params names
- f570581c75863636f4c92a8ad4b8dc91063e0d54 Removed unnecessary dictionary
###
Fixes - ab40351cba5fa1aed8af44a2c61e130772ec47ad 5c98c715d1787723a6eec98d0811f8937a1b6658 Fixed boolean params
- 9f837933b07f1de993956b59a2dd2554e71f04e9 Fixed assembly level filter #4
- Python
Published by farchaab over 2 years ago
assembly_finder - assembly_finder v0.3.2
Summary
- Small update to fix the bioconda package
## Details
###
Fixes - 2ac37e012e542c19a5a3c69a49aa7c2b5f0ba26b Add Snakefile and scripts in bioconda package
- bb9a9e4e7ed1590cbbfb235f2dba88d6490646d7 Fixed empty assembly method, and sequencing technology columns from assembly report
- Python
Published by farchaab over 2 years ago
assembly_finder - assembly_finder v0.3.1
assembly_finder v0.3.1
Summary
- Small update to use up-to-date aspera-cli from bioconda
## Details ###Changes - e4b85d34366aadff9f38e586e423e660fbd5abee Changed path to ascp private key
- 931114fef070ad098570abbb3461c0306e048b8a Use updated aspera-cli from bioconda instead of HCC
- Python
Published by farchaab over 2 years ago
assembly_finder - assembly_finder v0.3.0
Summary of changes
- Simplified assembly_finder cli
- Added checksum verification
- Split summary table into three tables for sequence, assembly and taxonomy summary
- Allow different parameters (assembly status, refseq category ...) for each entry
- Improvements and bugfixes
## Details
###
Additions - 25e7f0598a53579fcf2bf37d0e4455540aeb37f1 81efbf6f9107a00f85f2e9585c8c544602af4e60 Added sequencing technology, organism name, Biosource submission types in assembly table
- 81efbf6f9107a00f85f2e9585c8c544602af4e60 Added sequence table to have sequence info (header to taxid correspondance)
- 81efbf6f9107a00f85f2e9585c8c544602af4e60 Added taxonomy table to have linear taxonomy info
- 12dfee45f8c3cc8362103b7d4563a1a82bae2b5e Allow for different assembly search parameters for each entry
- dbca07a07e45173a8530e39e4a4cd428bb28c929 Added sha256 checksum verification
- 5907660f279b3e8b3e58d400271c55a77f254a5b 56a0389f8d1189ef72c67dd5955b0737bdbf6318 Added taxa.sqlite rules and option to avoid saving it in the conda env
Changes
- 81efbf6f9107a00f85f2e9585c8c544602af4e60 Split summary table into three tables for sequence, assembly and taxonomy summary
- 12dfee45f8c3cc8362103b7d4563a1a82bae2b5e Changed sort priority to refseq category then assembly status, followed by coverage and N50
- a17e3a16b4235a69dcd8f6e1e53ddea0c031c8ff Improved directory structure (moved scripts to bin directory)
- cb1f4afb88e1e09783d42243ec639def7e305e2f Removed run sub-command for simpler cli
- 25e7f0598a53579fcf2bf37d0e4455540aeb37f1 Renamed summary table headers (no more CamelCase !)
- 43a8d0ea71c7d99839f35c2fa5ad1e462bd1d498 Removed required NCBI email and key
- 2335ea7389dee36fc22f2ea940dc3eaa8c6be32a Removed wget option
- 8c947f47e8586665804d7f9b898b5106294011c1 Updated dependency versions
###
Fixes - 6f43cd7218ac51c48ae7bae791d0ed6798b58a91 Fixed number of assembly selection with or without rank filtering
- Python
Published by farchaab over 2 years ago
assembly_finder -
- added wrapper
- added ntargetrank to retrieve n genomes of each target rank (e.g species) within a taxon (e.g an entire family)
- Python
Published by tpillone almost 5 years ago
assembly_finder - First alpha release
Pipeline that searches and download genomes from NCBI's assembly database.
For this release assembly_finder:
Takes taxonomy identifiers or taxonomic ranks plus the number of assemblies to download as inputs in a tsv file.
Outputs a summary table listing information on the downloaded assemblies.
Has a search term that includes all assemblies except the ones from metagenomes (by default).
Has a filtering function that prioritizes reference or representative genomes (by default).
- Python
Published by idfarbanecha almost 7 years ago