Recent Releases of zeldarose
zeldarose - v0.12.0
Changed
- Python bumped to
>= 3.10, tests up to3.13 - Bumped datasets to
>= 3.0, < 3.2 - Bumped lightning to
< 2.6 - Remove hard dependency on
sentencepiece. Users can still install it if the tokenizer they use needs it, but their release policy is too brittle to allow them to block us, especially since it's only a quality of life dependency for us. - Bumped torch to
< 2.7 - Bumped tokenizers to
< 0.22 - Bumped transformers to allow
< 5.0, skipping versions from4.41to4.43
Full Changelog: https://github.com/LoicGrobol/zeldarose/compare/v0.11.0...v0.12.0
- Python
Published by LoicGrobol over 1 year ago
zeldarose - v0.10.0
Changed
- Bumped minimal (Pytorch) Lightning version to
2.0.0 - Pytorch compatibility changed to
>= 2.0, < 2.4 - 🤗 datasets compatibility changed to
>= 2.18, < 2.20 - Added support for the new lightning precision plugins.
Full Changelog: https://github.com/LoicGrobol/zeldarose/compare/v0.9.0...v0.10.0
- Python
Published by LoicGrobol about 2 years ago
zeldarose - v0.9.0
Fixed
- Training a m2m100 model on a language (code) not originally included in its tokenizer now works.
Changed
- Pytorch compatibility changed to
>= 2.0, < 2.3 - 🤗 datasets compatibility changed to
>= 2.18, < 2.19
Full Changelog: https://github.com/LoicGrobol/zeldarose/compare/v0.8.0...v0.9.0
- Python
Published by LoicGrobol about 2 years ago
zeldarose - v0.8.0
Fixed
- Fixed multiple save when using step-save-period in conjunction with bach accumulation (close #30)
Changed
- Maximum Pyorch compatibility bumped to 2.1
max_stepsandmax_epochscan now be set in the tuning config. Setting them via command line options is deprecated and will be removed in a future version.
- Python
Published by LoicGrobol over 2 years ago
zeldarose - v0.7.3 — Bug Fix
Fixed
- Behaviour when asking for denoising in mBART with a model that has no mask token.
- Python
Published by LoicGrobol over 3 years ago
zeldarose - v0.7.2 — Now with a doc??!?
Fixed
- In mBART training, loss scaling now works as it was supposed to.
- We have a documentation now! Check it out at https://zeldarose.readthedocs.io, it will get better over time (hopefully!).
- Python
Published by LoicGrobol over 3 years ago
zeldarose - v0.7.1 Bug fix
Fixed
- Translate loss logging is not always zero anymore.
- Python
Published by LoicGrobol over 3 years ago
zeldarose - Now with mBART translations!
The main highlight of this release is the addition of mBART training as a task, so far slightly different from the original one, but similar enough to work in our tests.
Added
- The
--tf32-modeoption allows to select the level of NVidia Ampère matmul otpimisations. - The
--seedoption allows to fix a random seed. - The
mbarttask allows training general seq2seq and translation models. - A
zeldarose command that serves as entry point for both tokenizer and transformer training.
Changed
- BREAKING
--use-fp16has been replaced by--precision, which allows to also use fp64 and bfloat. Previous behaviour can be emulated with--precision 16. - Remove the GPU stats logging from the profile mode since Lightning stopped supporting it
- Switched TOML library from toml to tomli
- BREAKING Bumped the min version of several dependency
pytorch-lightning >= 1.8.0torch >= 1.12
- Bumped max version of several dependency
datasets < 2.10pytorch-lightning < 1.9tokenizers < 0.14
- Python
Published by LoicGrobol over 3 years ago
zeldarose - v0.6.0 — Dependencies compatibilities
This one to fix compatibilities issues with our dependencies. Bumps minimal versions and add upper version limits.
Changed
- Bumped
torchmetricsminimal version to 0.9 - Bumped
datasets minimal version to 2.4 - Bumped
torchmax version to 1.12
Fixed
- Dataset fingerprinting/caching issues #31
Full Changelog: https://github.com/LoicGrobol/zeldarose/compare/v0.5.0...v0.6.0
- Python
Published by LoicGrobol almost 4 years ago
zeldarose - v0.5.0 — Housekeeping
The minor bump is because we have several new minimal version requirements (and to fairly recent versions with that). Otherwise, this is mostly internal stuff.
Added
lintextra that install linting tools and plugins- Config for flakeheaven
- Support for
pytorch-lightning 1.6
Changed
- Move packaging config to
pyproject.tomland requiresetuptools>=61. click_pathlibis no longer a dependency andclickhas a minimal version of8.0.3
Full Changelog: https://github.com/LoicGrobol/zeldarose/compare/v0.4.0...v0.5.0
- Python
Published by LoicGrobol about 4 years ago
zeldarose - v0.4.0 — experimental ELECTRA
Added
- Replaced Token Detection (ELECTRA-like) pretraining
- Some of the API is still provisional, the priority was to get it out, a nicer interface will hopefully come later.
--val-check-periodand--step-save-periodallowing to evaluate and save a model decoupled from epochs. This should be useful for training with very long epochs.- The datasets path in
zeldarose-transformercan now be 🤗 hub handles. See--help.
Changed
- The command line options have been changed to reflect change in Lightning
--acceleratoris now used for devices, tested values are"cpu"and"gpu"--strategynow specifies how to train, tested values areNone(missing),"ddp","ddp_sharded""ddp_spawn"and"ddp_sharded_spawn".- No more option to select sharded training, use the strategy alias for that
--n-gpushas been renamed to--num-devices.--n-workersand--n-nodeshave been respectively renamed to--num-workersand--num-nodes.
- Training task configs now have a
typeconfig key to specify the task type - Lightning progress bars are now provided by Rich
- Now supports Pytorch 1.11 and Python 3.10
Internal
- Tests now run in Pytest using the console-scripts plugin for smoke tests.
- Smoke tests now include
ddp_spawntests and tests on gpu devices if available. - Some refactoring for better factorization of the common utilities for MLM and RTD.
- Python
Published by LoicGrobol over 4 years ago
zeldarose - v0.3.4 —Lightning bump
Just bumping pytorch-lightning to the current minor version.
- Python
Published by LoicGrobol over 4 years ago
zeldarose - v0.3.3 — bugfix release
Changed
max_stepsis automatically inferred from the tuning config if a number of lr decay steps is givenmax_epochsis now optional (if bothmax_stepsandmax_epochsare unset and no lr schedule is provided, Lightning's default will be used)find_unused_parametersis now disabled in DDP mode, unless in profile mode- Bumped lightning to 1.4.x
Fixed
- Linear decay now properly takes the warmup period into account
- Python
Published by LoicGrobol over 4 years ago
zeldarose - v0.3.2 — switch to torchmetrics
We now internally use torchmetrics, which improves the stability of accuracy computations
Fixed
- Accuracy should stop NaN-ing
- Empty lines in datasets are now ignored
- Python
Published by LoicGrobol about 5 years ago
zeldarose - v0.3.0 — flattening some creases
Changed
- Stop saving tokenizers in legacy format
- Create data dir if they don't exist
- Python
Published by LoicGrobol about 5 years ago
zeldarose - v0.2.0 – Now eating less RAM
Added
--checkpointoption to load an existing lightning checkpoint- DDP sharding is now also possible with
ddp_spawn
Changed
- Text datasets are now loaded line-by-line by default and the block mode has been removed.
- We now use 🤗 datasets as backend, so the datasets are implemented as memory-mapped files with dynamic loaders instead of being held in RAM. This significantly decrease RAM consumption for a very decent speed cost and allows us to train on much larger datasets.
- GPU usage is now logged in
--profilemode when relevant. - LR is now logged.
Removed
- The
--line-by-lineflag has been removed, since this is now the default behaviour. - The
zeldarose-create-cachehas been removed, since dataset processing now works correctly in ddp. - The
datamodule has been completely rewritten and the Dataset classes are no more. mlm.masked_accuracysince it was not used anywhere.
Fixed
- Logging has been improved for internal pytorch warnings and pytorch-lightning and 🤗 transformers.
- Python
Published by LoicGrobol about 5 years ago