Recent Releases of litgpt
litgpt - v0.5.10
What's Changed
- qwen2.5 long context by @ysjprojects in https://github.com/Lightning-AI/litgpt/pull/1933
- debug installing
torchfor Thunder by @Borda in https://github.com/Lightning-AI/litgpt/pull/2072 - Fix in
convert_hf_checkpointrelated to Gemma 3 by @mseeger in https://github.com/Lightning-AI/litgpt/pull/2062 - debug some failing standalone tests with compiler by @Borda in https://github.com/Lightning-AI/litgpt/pull/2075
- Qwen3 MoE by @ysjprojects in https://github.com/Lightning-AI/litgpt/pull/2060
- Deferring import of torch in config to allow faster import by @JackUrb in https://github.com/Lightning-AI/litgpt/pull/2079
- update bug-report/issue with reproducing in Studio by @Borda in https://github.com/Lightning-AI/litgpt/pull/2081
- ci: show the longest tests for improvement by @Borda in https://github.com/Lightning-AI/litgpt/pull/2083
- docs: Add documentation for OpenAI-compatible API in LitGPT deployment by @bhimrazy in https://github.com/Lightning-AI/litgpt/pull/2082
- doc: add
n_query_groupsto attention notation table by @raishish in https://github.com/Lightning-AI/litgpt/pull/2092 - doc: add comments for clarifying query / KV groups by @raishish in https://github.com/Lightning-AI/litgpt/pull/2093
- Complete pending todos in testing by @raishish in https://github.com/Lightning-AI/litgpt/pull/2088
- pin: restrict datasets version to <4.0.0 for compatibility by @Borda in https://github.com/Lightning-AI/litgpt/pull/2095
- add/debug Lit CI by @Borda in https://github.com/Lightning-AI/litgpt/pull/2094
- add
api_pathparameter to cli options to allow custom API endpoint configuration by @botirk38 in https://github.com/Lightning-AI/litgpt/pull/2080 - Fix commands in pretrain.md by @H4kor in https://github.com/Lightning-AI/litgpt/pull/2097
New Contributors
- @JackUrb made their first contribution in https://github.com/Lightning-AI/litgpt/pull/2079
- @raishish made their first contribution in https://github.com/Lightning-AI/litgpt/pull/2092
- @botirk38 made their first contribution in https://github.com/Lightning-AI/litgpt/pull/2080
- @H4kor made their first contribution in https://github.com/Lightning-AI/litgpt/pull/2097
Full Changelog: https://github.com/Lightning-AI/litgpt/compare/v0.5.9...v0.5.10
- Python
Published by Borda 6 months ago
litgpt - v0.5.9
What's Changed
- inputposmaxp1 as a Python integer by @Andrei-Aksionov in https://github.com/Lightning-AI/litgpt/pull/2016
- add testing for py3.12 & py3.13 by @Borda in https://github.com/Lightning-AI/litgpt/pull/2025
- ci: update guardian for PRs by @Borda in https://github.com/Lightning-AI/litgpt/pull/2043
- Add devcontainer by @twsl in https://github.com/Lightning-AI/litgpt/pull/2035
- Remove dependency from config to utils by @lukemerrick in https://github.com/Lightning-AI/litgpt/pull/2034
- fix: Add fallback chat template by @andyland in https://github.com/Lightning-AI/litgpt/pull/2040
- Add optional sys prompt by @twsl in https://github.com/Lightning-AI/litgpt/pull/2036
- fix: Pretraining text files with recent litdata versions by @andyland in https://github.com/Lightning-AI/litgpt/pull/2048
- tests: mark
test_evaluate_scriptas flaky by @Borda in https://github.com/Lightning-AI/litgpt/pull/2049 - simplify the GPU testing flow by @Borda in https://github.com/Lightning-AI/litgpt/pull/2053
- ci: extend testing with
ubuntu-24.04by @Borda in https://github.com/Lightning-AI/litgpt/pull/2056 - Remove litserve version constraint by @twsl in https://github.com/Lightning-AI/litgpt/pull/2055
- req: pin
bitsandbytes>=0.45.2,<0.45.5by @Borda in https://github.com/Lightning-AI/litgpt/pull/2057 - ci: use Thunder dev images for testing by @Borda in https://github.com/Lightning-AI/litgpt/pull/2054
- Update spacing in README.md by @Borda in https://github.com/Lightning-AI/litgpt/pull/2058
- Transformers version bump by @KaelanDt in https://github.com/Lightning-AI/litgpt/pull/2029
- Qwen3 Dense by @ysjprojects in https://github.com/Lightning-AI/litgpt/pull/2044
- phi-4 reasoning models by @ysjprojects in https://github.com/Lightning-AI/litgpt/pull/2047
- adding logger args by @ysjprojects in https://github.com/Lightning-AI/litgpt/pull/1973
- Qwen3 MoE Preliminary: add intermediate_size argument to MLP modules by @ysjprojects in https://github.com/Lightning-AI/litgpt/pull/2046
- OLMo 2 by @ysjprojects in https://github.com/Lightning-AI/litgpt/pull/1897
- bump: testing with PT 2.7.1 by @Borda in https://github.com/Lightning-AI/litgpt/pull/2063
- Add Dependabot for Pip & GitHub Actions by @Borda in https://github.com/Lightning-AI/litgpt/pull/2066
- build(deps): bump litdata from 0.2.45 to 0.2.49 by @dependabot in https://github.com/Lightning-AI/litgpt/pull/2068
New Contributors
- @twsl made their first contribution in https://github.com/Lightning-AI/litgpt/pull/2035
- @lukemerrick made their first contribution in https://github.com/Lightning-AI/litgpt/pull/2034
- @prabhuteja12 made their first contribution in https://github.com/Lightning-AI/litgpt/pull/2027
- @andyland made their first contribution in https://github.com/Lightning-AI/litgpt/pull/2040
- @KaelanDt made their first contribution in https://github.com/Lightning-AI/litgpt/pull/2029
Full Changelog: https://github.com/Lightning-AI/litgpt/compare/v0.5.8...v0.5.9
- Python
Published by Borda 9 months ago
litgpt - v0.5.8
Many great updates!
What's Changed
- add missing r1 prompt style by @ali-alshaar7 in https://github.com/Lightning-AI/litgpt/pull/1929
- fix incremental save for PyTorch 2.6 by @t-vi in https://github.com/Lightning-AI/litgpt/pull/1928
- Update pyproject.toml by @syntheticgio in https://github.com/Lightning-AI/litgpt/pull/1939
- fix: resolve failing CI by @Borda in https://github.com/Lightning-AI/litgpt/pull/1944
- handle wrapped thundermodules in generate by @t-vi in https://github.com/Lightning-AI/litgpt/pull/1955
- fix skip condition by @t-vi in https://github.com/Lightning-AI/litgpt/pull/1956
- ci: use HF cache by @Borda in https://github.com/Lightning-AI/litgpt/pull/1958
- nits for CI by @Andrei-Aksionov in https://github.com/Lightning-AI/litgpt/pull/1940
- ci: split HF caching by @Borda in https://github.com/Lightning-AI/litgpt/pull/1960
- bump: PT 2.6 +
bitsandbytes& standalone tests by @Borda in https://github.com/Lightning-AI/litgpt/pull/1959 - prune whitespaces for code readability by @Borda in https://github.com/Lightning-AI/litgpt/pull/1962
- fixing various typos in examples & tutorials by @Borda in https://github.com/Lightning-AI/litgpt/pull/1963
- fix
n_query_groupsfor llama-3.1-405b by @ysjprojects in https://github.com/Lightning-AI/litgpt/pull/1946 - tests: make flaky test due to connection issues by @Borda in https://github.com/Lightning-AI/litgpt/pull/1964
- Fix: incorrect gradient accumulation steps bug by @ysjprojects in https://github.com/Lightning-AI/litgpt/pull/1947
- fix: use default
num_nodes=1for back-compatibility by @Borda in https://github.com/Lightning-AI/litgpt/pull/1967 - Do not wrap LoRA layers with FSDP by @janEbert in https://github.com/Lightning-AI/litgpt/pull/1538
- Speculative decoding: Base implementation by @Andrei-Aksionov in https://github.com/Lightning-AI/litgpt/pull/1938
- Better clarity on SFT dataset attributes by @ysjprojects in https://github.com/Lightning-AI/litgpt/pull/1970
- Enforce Consistent Formatting and Validation for YAML Files by @Borda in https://github.com/Lightning-AI/litgpt/pull/1977
- Apply Standard Formatting and Fix Import & Test Name Issues by @Borda in https://github.com/Lightning-AI/litgpt/pull/1981
- Setting
config.sliding_window_layer_strideexplicity by @ysjprojects in https://github.com/Lightning-AI/litgpt/pull/1972 - feat: add linear rope type by @k223kim in https://github.com/Lightning-AI/litgpt/pull/1982
- feat: update tests for transformers 4.50.2 by @k223kim in https://github.com/Lightning-AI/litgpt/pull/1983
- fix:
test_tokenizer_against_hfby @Borda in https://github.com/Lightning-AI/litgpt/pull/1984 - feat: replace sliding window type with offset by @k223kim in https://github.com/Lightning-AI/litgpt/pull/1989
- ci: with
pull_request_targetby @Borda in https://github.com/Lightning-AI/litgpt/pull/1992 - Phi4 mini by @ysjprojects in https://github.com/Lightning-AI/litgpt/pull/1949
- aggregate
val_lossby @ysjprojects in https://github.com/Lightning-AI/litgpt/pull/1971 - feat: add local base freq for rope by @k223kim in https://github.com/Lightning-AI/litgpt/pull/1993
- test: flexible wait for serve start by @Borda in https://github.com/Lightning-AI/litgpt/pull/1996
- fix: replace sliding window configuration parameters to sliding windows indices by @k223kim in https://github.com/Lightning-AI/litgpt/pull/1995
- QwQ-32B by @ysjprojects in https://github.com/Lightning-AI/litgpt/pull/1952
- feat: run thunder tests as part of LitGPT CI by @deependujha in https://github.com/Lightning-AI/litgpt/pull/1975
- try pyupgrade-up py38 by @Borda in https://github.com/Lightning-AI/litgpt/pull/1999
- [1/4] feat: add gemma 3 27b by @k223kim in https://github.com/Lightning-AI/litgpt/pull/1998
- [2/4] add gemma 3 1b by @k223kim in https://github.com/Lightning-AI/litgpt/pull/2000
- feat: add gemma-3-12b by @k223kim in https://github.com/Lightning-AI/litgpt/pull/2002
- [3/4] feat: add gemma 3 4b by @k223kim in https://github.com/Lightning-AI/litgpt/pull/2001
- Add resume for adapter_v2, enable continued finetuning for adapter by @altria-zewei-wang in https://github.com/Lightning-AI/litgpt/pull/1354
- Fix/loading gemma 3 1b by @pquadri in https://github.com/Lightning-AI/litgpt/pull/2004
- feat: add gemma 3 in readme and tutorials by @k223kim in https://github.com/Lightning-AI/litgpt/pull/2005
- add borda as codeowner by @t-vi in https://github.com/Lightning-AI/litgpt/pull/2007
- example for full finetuning with python code by @astrobdr in https://github.com/Lightning-AI/litgpt/pull/1331
- feat: add tests for gemma3 by @k223kim in https://github.com/Lightning-AI/litgpt/pull/2006
- [pre-commit.ci] pre-commit suggestions by @pre-commit-ci in https://github.com/Lightning-AI/litgpt/pull/2009
- building tutorials as mkdocs pages by @Borda in https://github.com/Lightning-AI/litgpt/pull/2011
- Add mlflow logger support by @topikachu in https://github.com/Lightning-AI/litgpt/pull/1985
- fix support for
litserve>0.2.4by @ali-alshaar7 in https://github.com/Lightning-AI/litgpt/pull/1994 - Cast tensors in KVCache only when needed by @Andrei-Aksionov in https://github.com/Lightning-AI/litgpt/pull/2017
- feat: load only text weights from multimodal gemma by @pquadri in https://github.com/Lightning-AI/litgpt/pull/2008
- Feature: Adds support for OpenAISpec in litgpt serve by @bhimrazy in https://github.com/Lightning-AI/litgpt/pull/1943
- fix typo by @Lynsoo in https://github.com/Lightning-AI/litgpt/pull/2018
- drop upper bounds in dependencies by @t-vi in https://github.com/Lightning-AI/litgpt/pull/2022
- prepare 0.5.8 by @t-vi in https://github.com/Lightning-AI/litgpt/pull/2023
New Contributors
- @syntheticgio made their first contribution in https://github.com/Lightning-AI/litgpt/pull/1939
- @deependujha made their first contribution in https://github.com/Lightning-AI/litgpt/pull/1975
- @altria-zewei-wang made their first contribution in https://github.com/Lightning-AI/litgpt/pull/1354
- @pquadri made their first contribution in https://github.com/Lightning-AI/litgpt/pull/2004
- @astrobdr made their first contribution in https://github.com/Lightning-AI/litgpt/pull/1331
- @pre-commit-ci made their first contribution in https://github.com/Lightning-AI/litgpt/pull/2009
- @topikachu made their first contribution in https://github.com/Lightning-AI/litgpt/pull/1985
- @bhimrazy made their first contribution in https://github.com/Lightning-AI/litgpt/pull/1943
- @Lynsoo made their first contribution in https://github.com/Lightning-AI/litgpt/pull/2018
Full Changelog: https://github.com/Lightning-AI/litgpt/compare/v0.5.7...v0.5.8
- Python
Published by t-vi 10 months ago
litgpt - v0.5.7
What's Changed
- Add Deepseek r1 distill llama models by @ali-alshaar7 in https://github.com/Lightning-AI/litgpt/pull/1922
New Contributors
- @ali-alshaar7 made their first contribution in https://github.com/Lightning-AI/litgpt/pull/1922
Full Changelog: https://github.com/Lightning-AI/litgpt/compare/v0.5.6...v0.5.7
- Python
Published by t-vi about 1 year ago
litgpt - v0.5.5
What's Changed
- Post-release setup for 0.5.5.dev1 by @Andrei-Aksionov in https://github.com/Lightning-AI/litgpt/pull/1885
- Falcon3 by @ysjprojects in https://github.com/Lightning-AI/litgpt/pull/1881
- ChatML prompt template by @ysjprojects in https://github.com/Lightning-AI/litgpt/pull/1882
- Small fixes and refactoring by @mseeger in https://github.com/Lightning-AI/litgpt/pull/1861
- Drop interleave placement in QKV matrix by @Andrei-Aksionov in https://github.com/Lightning-AI/litgpt/pull/1013
- Bump PyTorch, PyTorch-Lightning and BnB versions by @Andrei-Aksionov in https://github.com/Lightning-AI/litgpt/pull/1893
- Pin version of mistune in check links workflow by @Andrei-Aksionov in https://github.com/Lightning-AI/litgpt/pull/1895
- Skip converting .safetensors to .bin by @ysjprojects in https://github.com/Lightning-AI/litgpt/pull/1853
- Some improvements for KV caching by @mseeger in https://github.com/Lightning-AI/litgpt/pull/1891
- added query-key norm to accomodate OLMo2 by @ysjprojects in https://github.com/Lightning-AI/litgpt/pull/1894
- Improve HF download speed by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1899
- Bump version for 0.5.5 release by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1901
New Contributors
- @mseeger made their first contribution in https://github.com/Lightning-AI/litgpt/pull/1861
Full Changelog: https://github.com/Lightning-AI/litgpt/compare/v0.5.4...v0.5.5
- Python
Published by rasbt about 1 year ago
litgpt - v0.5.4
What's Changed
- 0.5.3 post release setup by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1817
- Add cff file by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1818
- Deprecate Support for Dolly, Nous-Hermes, Redpajama-Incite, Vicuna and H2O Danube Models. by @ParagEkbote in https://github.com/Lightning-AI/litgpt/pull/1821
- Adding OLMo by @aflah02 in https://github.com/Lightning-AI/litgpt/pull/1827
- Adding Qwen2.5 by @ysjprojects in https://github.com/Lightning-AI/litgpt/pull/1834
- Restore SlimPajama preprocessing code by @aflah02 in https://github.com/Lightning-AI/litgpt/pull/1840
- Add QwQ-32B-Preview by @ysjprojects in https://github.com/Lightning-AI/litgpt/pull/1844
- Add Mixtral-8x22B by @ysjprojects in https://github.com/Lightning-AI/litgpt/pull/1845
- add Llama-3.3-70B-Instruct by @ysjprojects in https://github.com/Lightning-AI/litgpt/pull/1859
- add Salamandra by @ysjprojects in https://github.com/Lightning-AI/litgpt/pull/1857
- Qwen2.5: fix block size for Coder series by @ysjprojects in https://github.com/Lightning-AI/litgpt/pull/1856
- fix: add missing"," by @vra in https://github.com/Lightning-AI/litgpt/pull/1855
- fix llama3.3 readme url by @ysjprojects in https://github.com/Lightning-AI/litgpt/pull/1862
- Set torch.load(...,
weights_only=False) in litgpt/api.py by @Andrei-Aksionov in https://github.com/Lightning-AI/litgpt/pull/1874 - Add Qwen2.5 math by @ysjprojects in https://github.com/Lightning-AI/litgpt/pull/1863
- Add SmolLM2 by @ysjprojects in https://github.com/Lightning-AI/litgpt/pull/1848
- Add Mistral-Large-Instruct-2411 by @ysjprojects in https://github.com/Lightning-AI/litgpt/pull/1876
- Bump version for 0.5.4 release by @Andrei-Aksionov in https://github.com/Lightning-AI/litgpt/pull/1883
- Temporary remove Thunder to make a release by @Andrei-Aksionov in https://github.com/Lightning-AI/litgpt/pull/1884
New Contributors
- @ParagEkbote made their first contribution in https://github.com/Lightning-AI/litgpt/pull/1821
- @ysjprojects made their first contribution in https://github.com/Lightning-AI/litgpt/pull/1834
- @vra made their first contribution in https://github.com/Lightning-AI/litgpt/pull/1855
Full Changelog: https://github.com/Lightning-AI/litgpt/compare/v0.5.3...v0.5.4
- Python
Published by Andrei-Aksionov about 1 year ago
litgpt - v0.5.3
What's Changed
- Post-release setup for 0.5.3.dev1 by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1799
- Add Phi 3 128k model by @deveworld in https://github.com/Lightning-AI/litgpt/pull/1800
- Add token counts to compute performance by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1801
- Fixed the issue that precision is always "32-true". by @jianpingw in https://github.com/Lightning-AI/litgpt/pull/1802
- Add Nvidia Llama 3.1 70B Nemotron weights by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1803
- Choose evaluation example from test set by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1804
- Pretrain tok sec by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1805
- typo in converttolitgpt command by @wasifferoze in https://github.com/Lightning-AI/litgpt/pull/1807
- Move distributed all_reduce import into a function by @IvanYashchuk in https://github.com/Lightning-AI/litgpt/pull/1810
- Remove hardcoded 32-precision conversion by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1814
New Contributors
- @deveworld made their first contribution in https://github.com/Lightning-AI/litgpt/pull/1800
- @jianpingw made their first contribution in https://github.com/Lightning-AI/litgpt/pull/1802
- @wasifferoze made their first contribution in https://github.com/Lightning-AI/litgpt/pull/1807
- @IvanYashchuk made their first contribution in https://github.com/Lightning-AI/litgpt/pull/1810
Full Changelog: https://github.com/Lightning-AI/litgpt/compare/v0.5.2...v0.5.3
- Python
Published by rasbt over 1 year ago
litgpt - v0.5.2
What's Changed
- Fix prompt style usage in evaluation example by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1790
- Pin to PyTorch 2.4.1 by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1796
- Fix step iteration bug in finetuning scripts by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1794
- Housekeeping: remove unused code and fix tests by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1795
Full Changelog: https://github.com/Lightning-AI/litgpt/compare/v0.5.1...v0.5.2
- Python
Published by rasbt over 1 year ago
litgpt - v0.5.1
What's Changed
- v0.5.0 post release setup by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1774
- Be more specific about missign RoPE parameters by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1781
- Use correct Llama 3.1 and 3.2 context lengths by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1779
- Fixing Llama 3.1 and 3.2 Maximum Context Length by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1782
- Use more realistic RoPE tests by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1785
- AMD (MI250X) support by @TensorTemplar in https://github.com/Lightning-AI/litgpt/pull/1775
- Tidy up RoPE by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1786
- Bump version for 0.5.1 bugfix release by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1787
New Contributors
- @TensorTemplar made their first contribution in https://github.com/Lightning-AI/litgpt/pull/1775
Full Changelog: https://github.com/Lightning-AI/litgpt/compare/v0.5.0...v0.5.1
- Python
Published by rasbt over 1 year ago
litgpt - v0.5.0
What's Changed
- Post 0.4.13 release set up by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1755
- Add missing explanation on how to use a finetuned model by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1756
- Bump lightning version to latest stable release (2.4.0) by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1765
- Improve rope by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1745
- Add bnb.nn.StableEmbedding for quantized training by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1770
- [fix][1760] Added fix for the missing
contextkey issue in dolly! by @pytholic in https://github.com/Lightning-AI/litgpt/pull/1766 - Fix Llama 3.2 tokenizer by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1772
New Contributors
- @pytholic made their first contribution in https://github.com/Lightning-AI/litgpt/pull/1766
Full Changelog: https://github.com/Lightning-AI/litgpt/compare/v0.4.13...v0.5.0
- Python
Published by rasbt over 1 year ago
litgpt - v0.4.13
What's Changed
- Make 0.4.13.dev1 version by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1722
- Enable MPS support for LitGPT by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1724
- Simplify MPS support by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1726
- Add Chainlit Studio by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1728
- Fixing the tokenizer for slimpajama data preparation by @tomaslaz in https://github.com/Lightning-AI/litgpt/pull/1734
- Add pretrain conversion by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1735
- Typo fix and formatting improvements in API Trainer docs by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1736
- bump macos to m1 by @t-vi in https://github.com/Lightning-AI/litgpt/pull/1725
- Improve filepath handling in unit tests by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1737
- Add a more informative message in case text exceeds context size by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1738
- Update Thunder README.md by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1740
- Add sliding window attention to Mistral and Phi 3 by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1741
- Extend context length for sliding window tests by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1742
- Fix jsonarparse version by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1748
- Update RoPE tests by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1746
- Make json parsing more robust by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1749
- Support for optimizers which don't have "fused" parameter such as grokadamw and 8bit bnb by @mtasic85 in https://github.com/Lightning-AI/litgpt/pull/1744
- Increase rtol and atol in Gemma 2 for macOS by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1751
- Repair json files by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1752
- Llama 3.2 weights by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1750
- Bump version to 0.4.13 for new release by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1753
- Temporarily take out thunder dependency for deployment by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1754
New Contributors
- @tomaslaz made their first contribution in https://github.com/Lightning-AI/litgpt/pull/1734
- @mtasic85 made their first contribution in https://github.com/Lightning-AI/litgpt/pull/1744
Full Changelog: https://github.com/Lightning-AI/litgpt/compare/v0.4.12...v0.4.13
- Python
Published by rasbt over 1 year ago
litgpt - v0.4.12
What's Changed
- Preserve eos in encoding when maxseqlength = -1 by @sanderland in https://github.com/Lightning-AI/litgpt/pull/1694
- Add
batched_next_token()andbatched_sample()by @apaz-cli in https://github.com/Lightning-AI/litgpt/pull/1693 - bump thunder dependency to main by @t-vi in https://github.com/Lightning-AI/litgpt/pull/1701
- add support for batched input_pos to model by @t-vi in https://github.com/Lightning-AI/litgpt/pull/1700
- Add
batched_generate_fn()by @apaz-cli in https://github.com/Lightning-AI/litgpt/pull/1702 - minor Readme update/typos by @Borda in https://github.com/Lightning-AI/litgpt/pull/1710
- Fix device Error in Decode Stream by @Motsepe-Jr in https://github.com/Lightning-AI/litgpt/pull/1707
- Temporarily remove MPS device support by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1718
- Add uvloop as a dependency by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1719
- Change default precision on macOS by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1720
- Bump for v0.4.12 by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1721
New Contributors
- @Motsepe-Jr made their first contribution in https://github.com/Lightning-AI/litgpt/pull/1707
Full Changelog: https://github.com/Lightning-AI/litgpt/compare/v0.4.11...v0.4.12
- Python
Published by rasbt over 1 year ago
litgpt - v0.4.11
What's Changed
- Add distribute=None to python-api.md by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1676
- Make LitGPT LLM API compatible with PyTorch Lightning Trainer 1/2 by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1667
- Auto device handling in LLM API by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1677
- Fix KV cache issue in LLM API by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1678
- Improved benchmark utils by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1679
- Add PR benchmark util for internal use by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1680
- Added git hash to benchmark utility. by @apaz-cli in https://github.com/Lightning-AI/litgpt/pull/1681
- Spelling fix by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1685
- Add Microsoft Phi 3.5 checkpoint by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1687
- Update checknvlinkconnectivity by @sanderland in https://github.com/Lightning-AI/litgpt/pull/1684
- Make number of generated tokens consistent with CLI by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1690
- Avoid error when executing benchmark util outside a git folder by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1691
- Combine
generate()functions by @apaz-cli in https://github.com/Lightning-AI/litgpt/pull/1675 - Bumb version to 0.4.11 by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1695
- Fix falcon prompt template by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1696
New Contributors
- @sanderland made their first contribution in https://github.com/Lightning-AI/litgpt/pull/1684
Full Changelog: https://github.com/Lightning-AI/litgpt/compare/v0.4.10...v0.4.11
- Python
Published by rasbt over 1 year ago
litgpt - v0.4.10
What's Changed
- Support Tensor Parallel in Python API by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1661
- Swap old Llama model with Phi-3 by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1666
- Update azure-gpu-test.yml by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1669
- Support the refactored API in litgpt serve by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1668
- Multi-gpu serving by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1670
- Add Mistral Large 123B by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1673
- Bumb version to 0.4.10 for next release by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1674
Full Changelog: https://github.com/Lightning-AI/litgpt/compare/v0.4.9...v0.4.10
- Python
Published by rasbt over 1 year ago
litgpt - v0.4.9
What's Changed
- Update LitServe version and tests by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1654
- Support for using large models in the Python API via sequential generation by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1637
- Add a PyTorch Lightning example by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1656
- Refactor Python API to introduce new distribute method (part of a larger refactor for PTL support) by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1657
- Fix some issues with circular and relative imports by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1658
- Optionally return benchmark info in Python API by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1660
- Bumb version for 0.4.9 release by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1664
Full Changelog: https://github.com/Lightning-AI/litgpt/compare/v0.4.8...v0.4.9
- Python
Published by rasbt over 1 year ago
litgpt - v0.4.8
What's Changed
- Adds unit test to test for parity between streaming and non-streaming API by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1650
- Add Gemma 2 2B by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1651
- Pin litserve version by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1652
- Version bumb for Gemma 2 2B release by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1653
Full Changelog: https://github.com/Lightning-AI/litgpt/compare/v0.4.7...v0.4.8
- Python
Published by rasbt over 1 year ago
litgpt - v0.4.7
What's Changed
- Apply prompt style for tp.py and sequentially.py by @Andrei-Aksionov in https://github.com/Lightning-AI/litgpt/pull/1629
- Fix prompt docstring in Python API by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1635
- Update windows cpu-tests.yml by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1630
- Remove NumPy < 2.0 pin by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1631
- Fix kv-cache issue in Python API streaming mode by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1633
- Updates installation requirements to install minimal required packages for basic use by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1634
- Faster safetensors conversion when downloading model by @awaelchli in https://github.com/Lightning-AI/litgpt/pull/1624
- Add Sebastian as code owner by @awaelchli in https://github.com/Lightning-AI/litgpt/pull/1641
- Add missing super() call in data modules by @awaelchli in https://github.com/Lightning-AI/litgpt/pull/1639
- Update Lightning version to 2.4.0 pre by @awaelchli in https://github.com/Lightning-AI/litgpt/pull/1640
- Add tunable kvcache with error handling for nonsense inputs. by @apaz-cli in https://github.com/Lightning-AI/litgpt/pull/1636
- Use Python API in serve code by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1644
- Fix autodownload + conversion issue by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1645
- Properly clear kv-cache by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1647
- Fix error raising where maxreturnedtokens > maxseqlength_setting by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1648
- Add quantization support to litgpt serve by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1646
- Bump for 0.4.7 release by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1649
Full Changelog: https://github.com/Lightning-AI/litgpt/compare/v0.4.6...v0.4.7
- Python
Published by rasbt over 1 year ago
litgpt - v0.4.6
What's Changed
- Change default top_k to 50 everywhere for consistency by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1592
- Fix kv-cache clearing in Python API and Serve by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1596
- dynamic KV Cache batching by @aniketmaurya in https://github.com/Lightning-AI/litgpt/pull/1600
- Remove non-used eos_id in Python API by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1594
- Add quantization test and revert lightning version by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1605
- Dynamically set kv-cache size in serve by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1602
- Update LitData version and restore previous LitData assertions in tests by @awaelchli in https://github.com/Lightning-AI/litgpt/pull/1609
- Gemma 2:
9band27bversions by @Andrei-Aksionov in https://github.com/Lightning-AI/litgpt/pull/1545 - Update config hub table qlora sections by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1611
- maxreturnedtokens -> maxnewtokens by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1612
- Add warning about pretrain preprocessing by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1618
- Print warning about unsupported repo_ids by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1617
- Restore capability to load alternative weights by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1620
- Enable unbalanced number of layers in sequential generation by @awaelchli in https://github.com/Lightning-AI/litgpt/pull/1623
- Llama 3.1 8B and 70B checkpoints by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1619
- Add Llama 3.1 405B config by @awaelchli in https://github.com/Lightning-AI/litgpt/pull/1622
- Bumb version to 0.4.6 for next release (Gemma 2 and Llama 3.1) by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1626
Full Changelog: https://github.com/Lightning-AI/litgpt/compare/v0.4.5...v0.4.6
- Python
Published by rasbt over 1 year ago
litgpt - v0.4.5
What's Changed
- Update LitData to latest version 0.2.16 by @awaelchli in https://github.com/Lightning-AI/litgpt/pull/1573
- Fixed warning during model download and conversion. by @bhargavyagnik in https://github.com/Lightning-AI/litgpt/pull/1574
- Bump lightning version by @awaelchli in https://github.com/Lightning-AI/litgpt/pull/1579
- chat/base.py: extend checkpoint_dir before accessing it by @Andrei-Aksionov in https://github.com/Lightning-AI/litgpt/pull/1575
- Chat: dynamically set kv-cache size by @Andrei-Aksionov in https://github.com/Lightning-AI/litgpt/pull/1583
- Add slow interconnect warning by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1577
- Mathstral checkpoints by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1587
- Make sure quantization is used correctly by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1585
- Improve mathstral tests by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1589
- Fixes an issue where the LitGPT Python API was consuming too much memory by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1590
- Increase version number for 0.4.5 release by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1591
New Contributors
- @bhargavyagnik made their first contribution in https://github.com/Lightning-AI/litgpt/pull/1574
Full Changelog: https://github.com/Lightning-AI/litgpt/compare/v0.4.4...v0.4.5
- Python
Published by rasbt over 1 year ago
litgpt - v0.4.4
What's Changed
- Tokenizer: fix issue when decoding a single token at a time by @Andrei-Aksionov in https://github.com/Lightning-AI/litgpt/pull/1559
- Filter out false positive PyTorch scheduler warning by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1562
- Add an explanation on how to evaluate on a custom test set by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1564
- Remove duplicated bos_token for CodeLlama by @alealv in https://github.com/Lightning-AI/litgpt/pull/1566
- Fix SFTDataset sample generation by @alealv in https://github.com/Lightning-AI/litgpt/pull/1568
- Add Python API section to 0 to LitGPT docs by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1570
- Fix multi-turn prompting error handling and extra newline by @apaz-cli in https://github.com/Lightning-AI/litgpt/pull/1569
- Add automatic downloading to CLI by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1571
- Bumb version for 0.4.4 release by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1572
New Contributors
- @alealv made their first contribution in https://github.com/Lightning-AI/litgpt/pull/1566
- @apaz-cli made their first contribution in https://github.com/Lightning-AI/litgpt/pull/1569
Full Changelog: https://github.com/Lightning-AI/litgpt/compare/v0.4.3...v0.4.4
- Python
Published by rasbt over 1 year ago
litgpt - v0.4.3
What's Changed
- Fix Twine Build Issue by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1530
- Fix Mixtral prompt style by @janEbert in https://github.com/Lightning-AI/litgpt/pull/1531
- Raise warning when loading a large model on a CPU device by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1532
- Tutorial
convert_from_litgptdoc fix to removeoutput_dircommand argument by @jentfoo in https://github.com/Lightning-AI/litgpt/pull/1533 - Test against eager attention on GPTNeoX by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1537
- Bump to 0.4.3.dev0 version by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1536
- Fix for LM Eval harness 0.4.3 by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1542
- Add
Phi-3-mini-4k-instructcheckpoint by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1341 - Update phi-3 prompt template by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1544
- Add progress bar for weight loading by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1534
- Remove redundant code in checkpoint conversion script by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1547
- Update how evaluate tasks are listed by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1550
- Fix generate issue in finetuning script when finetuning with small datasets by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1551
- Tokenizer: check that
bosandeostokens are applied w.r.t. to a provided config/args by @Andrei-Aksionov in https://github.com/Lightning-AI/litgpt/pull/1527 - Add num_nodes argument by @janEbert in https://github.com/Lightning-AI/litgpt/pull/1546
- Add Phi-3 Configs by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1553
- Update to version 0.4.3 for new release by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1554
New Contributors
- @janEbert made their first contribution in https://github.com/Lightning-AI/litgpt/pull/1531
- @jentfoo made their first contribution in https://github.com/Lightning-AI/litgpt/pull/1533
Full Changelog: https://github.com/Lightning-AI/litgpt/compare/v0.4.2...v0.4.3
- Python
Published by rasbt over 1 year ago
litgpt - v0.4.2
What's Changed
- Disable "validating ..." comment if initial_validation is False by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1499
- Add simple issue templates by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1501
- Add note about dependencies by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1502
- Update test_readme.py by @Andrei-Aksionov in https://github.com/Lightning-AI/litgpt/pull/1505
- Print message when unnecessary can't be deleted by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1507
- Bumb version by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1509
- LitGPT Python API v1 by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1463
- Fix pip upgrade command on windows by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1513
- add
LLM.generatewith streaming by @aniketmaurya in https://github.com/Lightning-AI/litgpt/pull/1511 - Fix and update Python API docs by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1514
- Update LLM import and make stream/non-stream option consistent by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1515
- Extend Python API to initialize model with random weights by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1512
- Add Python API download option by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1516
- Add Rope group size as constexpr to fix thunder tests by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1520
- Explain how to access pretraining dataset options by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1510
- Bump thunder by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1519
- Add automatic checkpoint downloading in Python API by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1521
- Tokenizer: prefer HF Tokenizer by @Andrei-Aksionov in https://github.com/Lightning-AI/litgpt/pull/1518
- iterate response from streaming server by @aniketmaurya in https://github.com/Lightning-AI/litgpt/pull/1523
- Add new python API to readme by @williamFalcon in https://github.com/Lightning-AI/litgpt/pull/1525
- Remove
init="local"option by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1524 - Readme improvements by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1526
- Add
evaluateto Readme by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1528
Full Changelog: https://github.com/Lightning-AI/litgpt/compare/v0.4.1...v0.4.2
- Python
Published by rasbt over 1 year ago
litgpt - v0.4.1
What's Changed
- Fix typos in Download Model Weights documentation by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1477
- 'Add models' docs by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1478
- Exclude prompt from generated response by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1485
- Make litgpt serve consistent with LitServe by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1466
- add multi turn prompt by @aniketmaurya in https://github.com/Lightning-AI/litgpt/pull/1487
- Add docs explaining the streaming mode by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1484
- Fix Mistral 7B v0.3 weight download by @davmacario in https://github.com/Lightning-AI/litgpt/pull/1486
- Fixes for latest lm_eval by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1489
- bugfix: LLM generated text with containing
{KEY}would result in error by @aniketmaurya in https://github.com/Lightning-AI/litgpt/pull/1490 - Add SAMBA to project gallery by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1493
- Add
resume="auto"by @awaelchli in https://github.com/Lightning-AI/litgpt/pull/1492 - Fix imports by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1491
- Pin NumPy version by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1495
- Version bump for new release to fix NumPy issue by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1496
Full Changelog: https://github.com/Lightning-AI/litgpt/compare/v0.4.0...v0.4.1
- Python
Published by rasbt over 1 year ago
litgpt - v0.4.0
What's Changed
- Set litdata < 0.2.6 by @carmocca in https://github.com/Lightning-AI/litgpt/pull/1400
- Remove per-file CLIs by @carmocca in https://github.com/Lightning-AI/litgpt/pull/1397
- Simillar -> Similar by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1405
- LitData: set
iterate_over_allto False forCombinedStreamingDatasetby @Andrei-Aksionov in https://github.com/Lightning-AI/litgpt/pull/1404 - Allow multiline prompts by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1279
- Explain dataset options by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1407
- Support
no_syncwith Thunder FSDP by @carmocca in https://github.com/Lightning-AI/litgpt/pull/1414 - Minimal Python example by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1410
- Fix bug where LitData doesn't use seed by @bradfordlynch in https://github.com/Lightning-AI/litgpt/pull/1425
- Add prompt style mapping for llama3 by @davmacario in https://github.com/Lightning-AI/litgpt/pull/1406
- Simplify code by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1429
- OptimizerArgs by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1409
- Fix optimizer init with fused=True by @carmocca in https://github.com/Lightning-AI/litgpt/pull/1434
- Fix learning rate calculation in pretrain by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1435
- Align readme by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1438
- Pin litdata by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1440
- Fix README.md alignment by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1439
- Update README.md for one last time by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1442
- A more centered look by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1449
- New CLI by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1437
- Update error message by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1453
- Explain how to list all available models by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1455
- Detect tensor cores by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1456
- Check checkpoint_dir and add
checkpointsto path by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1454 - Add MicroLlama training support by @keeeeenw in https://github.com/Lightning-AI/litgpt/pull/1457
- Streaming for serving with chat's generate function by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1426
- Fix sequence length bug by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1462
- Add
lr_warmup_steps,max_stepsvalues validation by @shenxiangzhuang in https://github.com/Lightning-AI/litgpt/pull/1460 - Fix issue where path in merge_lora is overwritten by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1465
- Option to skip expensive final validation by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1372
- Allow batch size "auto" setting in evaluate by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1469
- Warn users when there is a bnb mismatch by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1468
- Allow batch argument with batch recomputation by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1470
- LitGPT Python API draft by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1459
- Bump version for PyPI release by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1476
New Contributors
- @bradfordlynch made their first contribution in https://github.com/Lightning-AI/litgpt/pull/1425
- @davmacario made their first contribution in https://github.com/Lightning-AI/litgpt/pull/1406
- @keeeeenw made their first contribution in https://github.com/Lightning-AI/litgpt/pull/1457
Full Changelog: https://github.com/Lightning-AI/litgpt/compare/v0.3.1...v0.4.0
- Python
Published by rasbt over 1 year ago
litgpt - Development release 0.4.0.dev0
What's Changed
- Streaming for serving with chat's generate function by @rasbt in #1426
- Add MicroLlama training support by @keeeeenw in #1457
- Check checkpoint_dir and add
checkpointsto path by @rasbt in #1454 - Detect tensor cores by @rasbt in #1456
- Explain how to list all available models by @rasbt in #1455
- Update error message by @rasbt in #1453
- New CLI by @rasbt in #1437
- A more centered look by @rasbt in #1449
- Update README.md for one last time by @rasbt in #1442
- Fix README.md alignment by @rasbt in #1439
- Pin litdata by @rasbt in #1440
- Align readme by @rasbt in #1438
- Fix learning rate calculation in pretrain by @rasbt in #1435
- Fix optimizer init with fused=True by @carmocca in #1434
- OptimizerArgs by @rasbt in #1409
- Simplify code by @rasbt in #1429
- Add prompt style mapping for llama3 by @davmacario in #1406
- Fix bug where LitData doesn't use seed by @bradfordlynch in #1425
- Minimal Python example by @rasbt in #1410
- Support
no_syncwith Thunder FSDP by @carmocca in #1414 - Explain dataset options by @rasbt in #1407
- Allow multiline prompts by @rasbt in #1279
- LitData: set
iterate_over_allto False forCombinedStreamingDatasetby @Andrei-Aksionov in #1404 - Simillar -> Similar by @rasbt in #1405
- Remove per-file CLIs by @carmocca in #1397
- Set litdata < 0.2.6 by @carmocca in #1400
Full Changelog: View on GitHub
- Python
Published by rasbt over 1 year ago