Recent Releases of oumi

What's Changed

Update README.md with v0.3 release by @stefanwebb in https://github.com/oumi-ai/oumi/pull/1904
Rename BaseExperimentalDpoDataset to BaseDpoDataset by @oelachqar in https://github.com/oumi-ai/oumi/pull/1903
[tiny] Pin exact omegaconf version by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1905
Add gpt-oss-120b Together AI inference config by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1907
feat: Add basic hugging face model management functionality in CLI by @aniruddh-alt in https://github.com/oumi-ai/oumi/pull/1897
[Breaking change] Change default model dtype from fp32 to "auto" by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1910
Add inference config for gpt-oss by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1909
Update dataset planner to use dependency injection by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1913
Modify document ingestion to read bytes by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1917
Update dependencies by @oelachqar in https://github.com/oumi-ai/oumi/pull/1908
Update dependencies [part 2] by @oelachqar in https://github.com/oumi-ai/oumi/pull/1912
Add processor support to AnalyzeConfig for vision-language datasets by @ryan-arman in https://github.com/oumi-ai/oumi/pull/1916
Unpin flash-attn version by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1918
Add august webinar to the readme. by @taenin in https://github.com/oumi-ai/oumi/pull/1919
Add Conversation-Level Analysis by @ryan-arman in https://github.com/oumi-ai/oumi/pull/1914
Deprecate experimental ring attention support by @oelachqar in https://github.com/oumi-ai/oumi/pull/1921
[tiny] Cleanup redundant specifications of lora_dropout 0.0 by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1926
Update README.md to highlight gpt-oss support by @stefanwebb in https://github.com/oumi-ai/oumi/pull/1927
Add lora target_parameters field and upgrade peft library by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1925
Improve e2e eval tests by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1924
Update README.md - updated after Aug webinar by @stefanwebb in https://github.com/oumi-ai/oumi/pull/1930
refactor: Build tokenizer inside analysis_utils based on input config by @ryan-arman in https://github.com/oumi-ai/oumi/pull/1928
Replace Luma link with YouTube recording for August webinar by @stefanwebb in https://github.com/oumi-ai/oumi/pull/1931
Ryan arman add analysis summary by @ryan-arman in https://github.com/oumi-ai/oumi/pull/1929
Add KTO support for preference tuning by @efsiatras in https://github.com/oumi-ai/oumi/pull/1538
[tiny] Rename mac to macos in config names by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1932
GGUF configs, MacOS LlamaCPP configs by @penfever in https://github.com/oumi-ai/oumi/pull/1923
Enable gpt-oss training and vLLM inference by @penfever in https://github.com/oumi-ai/oumi/pull/1920
Judge CLI | Display Overall Score by @kaisopos in https://github.com/oumi-ai/oumi/pull/1934
Replace mentions of deprecated huggingface-cli tool with hf by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1933
Add support for custom datasets in Analyze by @ryan-arman in https://github.com/oumi-ai/oumi/pull/1935
Judge | Validate Input Dataset by @kaisopos in https://github.com/oumi-ai/oumi/pull/1938
Add Pre-Populated GitHub Issue Link On Failures by @rlehman221 in https://github.com/oumi-ai/oumi/pull/1936
Adding Verbose Flag by @rlehman221 in https://github.com/oumi-ai/oumi/pull/1940
[tiny] Update .gitignore by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1944
Upgrade verl to 0.5.0 by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1941
Add shell script for launching jobs on NERSC Perlmutter HPC cluster by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1945
Update llamacppinference_engine.py by @penfever in https://github.com/oumi-ai/oumi/pull/1946
Simplify github issue creation in cli by @rlehman221 in https://github.com/oumi-ai/oumi/pull/1949
Add a parsable enum state field to the launcher to convey job status by @taenin in https://github.com/oumi-ai/oumi/pull/1950
Penfever/api and macos configs by @penfever in https://github.com/oumi-ai/oumi/pull/1947
Penfever/deepspeed by @penfever in https://github.com/oumi-ai/oumi/pull/1886
Run GitHub workflows in merge queue instead of on PR events by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1952
Dependency Upgrade For V4 by @rlehman221 in https://github.com/oumi-ai/oumi/pull/1942
Allow initializing DatasetAnalyzer directly with a dataset object by @ryan-arman in https://github.com/oumi-ai/oumi/pull/1953
Follow-up updates to DeepSpeed PR by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1954
Add vision dpo support, refactor dpo by @oelachqar in https://github.com/oumi-ai/oumi/pull/1922
Enable users to log data samples during training for debugging by @shanghongsim in https://github.com/oumi-ai/oumi/pull/1943
[tiny] Remove add_special_tokens kwarg from vision DPO dataset processor by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1957
Fix CPU tests by @taenin in https://github.com/oumi-ai/oumi/pull/1958
Penfever/llamacpp role fix by @penfever in https://github.com/oumi-ai/oumi/pull/1960
Update synthesis with usability feedback by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1939

New Contributors

@efsiatras made their first contribution in https://github.com/oumi-ai/oumi/pull/1538
@rlehman221 made their first contribution in https://github.com/oumi-ai/oumi/pull/1936

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.3.0...v0.4.0

- Python
Published by wizeng23 10 months ago

Oumi v0.3 Changelog

🔧 Model Quantization (NEW)

Quantization is a crucially important family of methods for reducing model size, for example, prior to deployment. Oumi now supports applying Activation-aware Weight Quantization (AWQ) to all models. See how in our notebook.

Usage Example:

```

Quick start - quantize TinyLlama to 4-bit

oumi quantize --method awqq40 --model "TinyLlama/TinyLlama-1.1B-Chat-v1.0" --output quantized_model

With configuration file

oumi quantize --config quantization_config.yaml ```

⚖️ Judge API V2 (MAJOR UPDATE)

LLM-as-a-Judge is a method for using foundation models to reliably evaluate other foundation models. We’ve overhauled Oumi’s LLM-as-Judge interface for ease-of-use and flexibility. Check out our notebook here.

Usage Example:

``` from oumi.judges.simple_judge import SimpleJudge

Built-in truthfulness judge

simplejudge = SimpleJudge(judgeconfig="oumi://configs/projects/judges/generic/truthfulness.yaml")

dataset = [{"request": "What is the capital of France?", "response": "Rome"}] outputs = simple_judge.judge(dataset) ```

🎯 Adaptive Inference (NEW)

💪 Adaptive Inference, as we term it, refers to new features in Oumi for resuming training (or any task) when a job has crashed, as well as optimizing inference parallelization to maximize bandwidth. Learn more in our notebook.

🛠️ Developer Experience

Updated contributing guidelines
Enhanced documentation
Tutorial notebook fixes
Improved error handling and testing
MLflow integration improvements
Multi-node verl Slurm job support
Rich logging handler option

New Contributors

@amarpal made their first contribution in https://github.com/oumi-ai/oumi/pull/1831
@42Shawn made their first contribution in https://github.com/oumi-ai/oumi/pull/1837

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.2.1...v0.3.0

- Python
Published by wizeng23 11 months ago

oumi - v0.3.0

Oumi v0.3 Changelog

🔧 Model Quantization (NEW)

Usage Example:

```

Quick start - quantize TinyLlama to 4-bit

oumi quantize --method awqq40 --model "TinyLlama/TinyLlama-1.1B-Chat-v1.0" --output quantized_model

With configuration file

oumi quantize --config quantization_config.yaml ```

⚖️ Judge API V2 (MAJOR UPDATE)

Usage Example:

``` from oumi.judges.simple_judge import SimpleJudge

Built-in truthfulness judge

simplejudge = SimpleJudge(judgeconfig="oumi://configs/projects/judges/generic/truthfulness.yaml")

dataset = [{"request": "What is the capital of France?", "response": "Rome"}] outputs = simple_judge.judge(dataset) ```

🎯 Adaptive Inference (NEW)

🛠️ Developer Experience

Updated contributing guidelines
Enhanced documentation
Tutorial notebook fixes
Improved error handling and testing
MLflow integration improvements
Multi-node verl Slurm job support
Rich logging handler option

New Contributors

@amarpal made their first contribution in https://github.com/oumi-ai/oumi/pull/1831
@42Shawn made their first contribution in https://github.com/oumi-ai/oumi/pull/1837

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.2.1...v0.3.0

- Python
Published by wizeng23 11 months ago

oumi - v0.2.1

What's Changed

Set inferonline and inferfrom_file to private by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1745
Update launch.md by @shanghongsim in https://github.com/oumi-ai/oumi/pull/1781
Add adaptive semaphore to enable future adaptive throughput scenarios by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1780
Fix a pyright regression by @taenin in https://github.com/oumi-ai/oumi/pull/1783
Judge API V2 | Fix judge config from repo path by @kaisopos in https://github.com/oumi-ai/oumi/pull/1782
Add permutable attributes and combination sampling for data synthesis by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1773
Removed collator in finetuning tutorial notebook by @shanghongsim in https://github.com/oumi-ai/oumi/pull/1788
Update our contributing guidelines. by @taenin in https://github.com/oumi-ai/oumi/pull/1789
Add adaptive concurrency controller in preparation for adaptive inference by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1784
Fixed issue with final conversations not consistently being saved by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1795
Add support for ingesting datasets for synthesis by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1790
Add support for adaptive inference by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1791
Add support for Example Sources in Synthesis by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1797
Webinar announcement and other news by @stefanwebb in https://github.com/oumi-ai/oumi/pull/1800
Added utm_source parameters by @stefanwebb in https://github.com/oumi-ai/oumi/pull/1802
Add code to handle document ingestion by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1796
Add code for handling basic document segmentation by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1803
Update mflow support in oumi trainer by @oelachqar in https://github.com/oumi-ai/oumi/pull/1804
Add multi-node verl SLURM job by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1798
Fixed various tutorial notebooks by @shanghongsim in https://github.com/oumi-ai/oumi/pull/1792
Add parameter logging to oumi trainer by @oelachqar in https://github.com/oumi-ai/oumi/pull/1807
Judge API V2 | Enable prompt variable replacement by YAML by @kaisopos in https://github.com/oumi-ai/oumi/pull/1805
[tiny] Update train config comment header by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1809
Add experimental option to use the rich logging handler by @oelachqar in https://github.com/oumi-ai/oumi/pull/1810

New Contributors

@shanghongsim made their first contribution in https://github.com/oumi-ai/oumi/pull/1781

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.2.0...v0.2.1

- Python
Published by kaisopos 11 months ago

oumi - v0.2.0

Highlights

GRPO support for trl and verl trainers

Oumi now supports GRPO training for both the trl and verl libraries! This allows you to run GRPO training with no/low code using Oumi's configs. You can also benefit from other features of the Oumi platform, such as custom evaluation and launching remote jobs.

Running GRPO training in Oumi is as simple as:

Create a reward function, and register it to Oumi's reward function registry using @register("<my_reward_fn>", RegistryType.REWARD_FUNCTION).
Create a dataset class to process your HF dataset into the format needed for your target framework, and register it to Oumi's dataset registry using @register_dataset("@hf-org-name/my-dataset-name").
Create an Oumi training config with your model, dataset, reward function, and hyperparameters. For specific details on setting up the config for GRPO, see our documentation.
Launch the training job locally using the oumi train CLI, or launch a remote job using the oumi launch CLI.

For an end-to-end example using Oumi + trl, check out our notebook walkthrough. For verl, check out our multi-modal Geometry3K config. Finally, check out our blog post for more information.

Models built with Oumi: HallOumi and CoALM

We’re proud to announce the release of two models built with Oumi: HallOumi and CoALM! Both of these were trained on Oumi, and we provide recipes to reproduce their training from scratch.

🧀 HallOumi: A truly open-source claim verification (hallucination detection) model developed by Oumi, outperforming Claude Sonnet, OpenAI o1, DeepSeek R1, Llama 405B, and Gemini Pro at only 8B parameters. Check out the Oumi recipe to train the model here.
🤖 CoALM: Conversational Agentic Language Model (CoALM) is a a unified approach that integrates both conversational and agentic capabilities. It includes an instruction tuning dataset and three trained models (8B, 70B, 405B). The project was a partnership between the ConvAI Lab at UIUC and Oumi, and the paper was accepted to ACL. Check out the Oumi recipes to train the models here.

New model support: Llama 4, Qwen3, Falcon H1, and more

We’ve added support for many recent models to Oumi, with tested recipes that work out-of-the-box!

Vision Language Models
Text-to-text LLMs
- Falcon-H1 and Falcon-E
- Qwen3
- Phi-4-reasoning

Support for Slurm and Frontier clusters

At Oumi, we want unify and simplify the processes for running jobs on remote clusters. We have now added support for launching jobs on Slurm clusters, and on Frontier, a supercomputer at the Oak Ridge Leadership Computing Facility.

What's Changed

[bugfix] Allow prerelease when building docker image by @oelachqar in https://github.com/oumi-ai/oumi/pull/1753
Update link to Oumi banner image in README by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1752
docs: add a badge and link to the social network Twitter by @Radovenchyk in https://github.com/oumi-ai/oumi/pull/1751
Support OLCF (Oak Ridge Leadership Computing Facility) Frontier HPC cluster in Oumi launcher by @nikg4 in https://github.com/oumi-ai/oumi/pull/1721
Judge API V2 | Core Functionality by @kaisopos in https://github.com/oumi-ai/oumi/pull/1717
Update oumi distributed torchrun to fallback to oumi train -c cfg.yaml .... on a single-node with 1 GPU by @nikg4 in https://github.com/oumi-ai/oumi/pull/1755
deps: Upgrade verl to 0.4.0 by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1749
add DCVLR logo to readme by @penfever in https://github.com/oumi-ai/oumi/pull/1754
Judge API V2 | Few-Shots by @kaisopos in https://github.com/oumi-ai/oumi/pull/1746
Update infer.md to fix a broken link by @ryan-arman in https://github.com/oumi-ai/oumi/pull/1756
Judge API V2 | minor nit by @kaisopos in https://github.com/oumi-ai/oumi/pull/1757
[Evaluation] Disabling flaky MMMU test by @kaisopos in https://github.com/oumi-ai/oumi/pull/1758
Automatically tail SkyPilot logs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1761
Enable vLLM for trl GRPO jobs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1760
Judge API V2 | Implement CLI by @kaisopos in https://github.com/oumi-ai/oumi/pull/1759
Updates to Oumi news for May, June by @stefanwebb in https://github.com/oumi-ai/oumi/pull/1763
Additional news items by @stefanwebb in https://github.com/oumi-ai/oumi/pull/1764
Judge API V2 | Support for built-in judges by @kaisopos in https://github.com/oumi-ai/oumi/pull/1762
[bug] safetensors v0.6.0rc0 is causing a regression, prevent upgrading by @oelachqar in https://github.com/oumi-ai/oumi/pull/1772
[verl] Support resuming from checkpoint by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1766
Upgrade accelerate and peft by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1774
[tiny] Pin flash-attn version by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1775
Pin the version of lm_eval to prevent a breaking change in the 4.9 release by @taenin in https://github.com/oumi-ai/oumi/pull/1777
Update inference to resume from temporary result file when possible by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1734
[tiny] Fix gradient checkpointing for Oumi trainer by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1778
[tiny] Remove use_liger argument by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1779
Judge API V2 | Merge Judge and Inference configs by @kaisopos in https://github.com/oumi-ai/oumi/pull/1776

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1.14...v0.2.0

- Python
Published by wizeng23 12 months ago

oumi - v0.1.14

What's Changed

Record latency histograms in base inference engine by @nikg4 in https://github.com/oumi-ai/oumi/pull/1702
Feat: add falcon-e integration by @younesbelkada in https://github.com/oumi-ai/oumi/pull/1705
[tiny] Minor update to fix the failing pre-commit checks by @oelachqar in https://github.com/oumi-ai/oumi/pull/1707
Add collator kwargs field to DataParams by @oelachqar in https://github.com/oumi-ai/oumi/pull/1708
[vision] Add option to process images individually by @oelachqar in https://github.com/oumi-ai/oumi/pull/1706
Update dev_setup.md to correct the order of steps by @ryan-arman in https://github.com/oumi-ai/oumi/pull/1709
Add configs for molmo support by @oelachqar in https://github.com/oumi-ai/oumi/pull/1710
[tiny] fix pre-commits checks on a fresh install by @oelachqar in https://github.com/oumi-ai/oumi/pull/1711
Add config for the Molmo O variant by @oelachqar in https://github.com/oumi-ai/oumi/pull/1712
Add experimental molmo grpo config and train aliases by @oelachqar in https://github.com/oumi-ai/oumi/pull/1713
Update installation.md to fix subversion handling by adding required … by @ryan-arman in https://github.com/oumi-ai/oumi/pull/1715
Frontier: Fix -n param in launcher script by @nikg4 in https://github.com/oumi-ai/oumi/pull/1720
Fix Falcon H1 dependency setup by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1723
letter count notebook improvements by @penfever in https://github.com/oumi-ai/oumi/pull/1697
[vision] Update vision feature generator to support training on completions only by @oelachqar in https://github.com/oumi-ai/oumi/pull/1722
[tiny] fix bug with vl collator by @oelachqar in https://github.com/oumi-ai/oumi/pull/1725
Add data synthesis config, params, and unit tests by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1700
Add support for additional exception types for remote inference engine, as well as fast failing for non-retryable status codes. by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1704
Adds DPO + QLoRA example for Falcon-H1 by @stefanwebb in https://github.com/oumi-ai/oumi/pull/1719
Update inference to always write intermediate results to file. by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1724
Added doc for new QLoRA param by @stefanwebb in https://github.com/oumi-ai/oumi/pull/1727
Readme for Falcon-E and note on extra dependencies required by @stefanwebb in https://github.com/oumi-ai/oumi/pull/1729
Add generic vision dataset by @oelachqar in https://github.com/oumi-ai/oumi/pull/1726
[tiny][bug] make git cmd optional by @oelachqar in https://github.com/oumi-ai/oumi/pull/1730
[tiny][bug] Add missing molmo feature by @oelachqar in https://github.com/oumi-ai/oumi/pull/1731
[tiny] Update phi3-vision configs to use oumi trainer by @oelachqar in https://github.com/oumi-ai/oumi/pull/1733
Minor bugfixes for 2 clouds in launcher code by @nikg4 in https://github.com/oumi-ai/oumi/pull/1728
Update dev_setup.md to add additional instructions by @ryan-arman in https://github.com/oumi-ai/oumi/pull/1736
Update trl to 0.18 by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1693
Update Verl trainer to export models in HF format by @nikg4 in https://github.com/oumi-ai/oumi/pull/1714
Add lmms-lab/multimodal-open-r1-8k-verified dataset by @oelachqar in https://github.com/oumi-ai/oumi/pull/1732
Add placeholders for DCVLR by @oelachqar in https://github.com/oumi-ai/oumi/pull/1738
add debug logging capabilities to collators by @aniruddh-alt in https://github.com/oumi-ai/oumi/pull/1678
[bug] update trainer to save processor when training with fsdp by @oelachqar in https://github.com/oumi-ai/oumi/pull/1742
Add model revision param by @oelachqar in https://github.com/oumi-ai/oumi/pull/1740
Add ability to customize HF model config via model.model_kwargs by @oelachqar in https://github.com/oumi-ai/oumi/pull/1741
Add docker release workflow by @oelachqar in https://github.com/oumi-ai/oumi/pull/1743
[bug] fix rank/local rank parsing for docker env by @oelachqar in https://github.com/oumi-ai/oumi/pull/1747
deps: Update vLLM to 0.8.3 by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1739
[docs] update dcvlr readme by @oelachqar in https://github.com/oumi-ai/oumi/pull/1748
Dcvlr by @penfever in https://github.com/oumi-ai/oumi/pull/1750

New Contributors

@younesbelkada made their first contribution in https://github.com/oumi-ai/oumi/pull/1705
@ryan-arman made their first contribution in https://github.com/oumi-ai/oumi/pull/1709
@stefanwebb made their first contribution in https://github.com/oumi-ai/oumi/pull/1719
@aniruddh-alt made their first contribution in https://github.com/oumi-ai/oumi/pull/1678

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1.13...v0.1.14

- Python
Published by wizeng23 about 1 year ago

oumi - v0.1.13

What's Changed

Update dev_setup.md by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1641
[tiny] Remove vllm install commands by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1643
Support for custom processor args: misc improvements by @nikg4 in https://github.com/oumi-ai/oumi/pull/1642
Add Countdown dataset and reward function by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1645
Adding LoRA train config for Qwen-VL 2.0 by @optas in https://github.com/oumi-ai/oumi/pull/1637
[Evaluation] Convenience function for standard config retrieval by @kaisopos in https://github.com/oumi-ai/oumi/pull/1644
Add demo script by @oelachqar in https://github.com/oumi-ai/oumi/pull/1647
[bug] fix build errors by @oelachqar in https://github.com/oumi-ai/oumi/pull/1649
Adding LoRA train config for SmolVLM by @optas in https://github.com/oumi-ai/oumi/pull/1639
[tiny] Update cli help shorthand by @oelachqar in https://github.com/oumi-ai/oumi/pull/1648
Oelachqar/update hooks by @oelachqar in https://github.com/oumi-ai/oumi/pull/1650
Add verl PPO trainer by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1646
Fix a missing dependency in the verl trainer. by @taenin in https://github.com/oumi-ai/oumi/pull/1651
Integrate verl GRPO trainer into train script by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1652
Update e2e tests to run on lambda by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1653
Add Qwen3 32B configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1661
Add Qwen3 30B A3B configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1665
[verl] Populate verl config from Oumi config by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1659
Provide option to configure label_ignore_index in training config by @nikg4 in https://github.com/oumi-ai/oumi/pull/1666
[Documentation] Custom Evaluations (PR 1-of-2) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1664
InterVL-3.0 SFT with limited training capabilities by @optas in https://github.com/oumi-ai/oumi/pull/1663
Add verl GRPO Countdown configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1668
Set explicit permissions for our test workflows. by @taenin in https://github.com/oumi-ai/oumi/pull/1670
Add support for repetition_penalty in GrpoParams by @REDDITARUN in https://github.com/oumi-ai/oumi/pull/1654
Fix broken tests due to precommit violations by @taenin in https://github.com/oumi-ai/oumi/pull/1671
[Documentation] Custom Evaluations (PR 2-of-2) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1669
Migrate to logger.warning usage by @emmanuel-ferdman in https://github.com/oumi-ai/oumi/pull/1673
Update the Oumi launcher and e2e tests to support runpod. by @taenin in https://github.com/oumi-ai/oumi/pull/1672
Switch back to using GCP for e2e tests. by @taenin in https://github.com/oumi-ai/oumi/pull/1675
Mark an e2e test as is_lora by @taenin in https://github.com/oumi-ai/oumi/pull/1676
Add Phi4 reasoning plus configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1674
Fix a test breakage caused by a new Click version (8.2.0) by @taenin in https://github.com/oumi-ai/oumi/pull/1679
chore: edited the link to the stars badge by @Radovenchyk in https://github.com/oumi-ai/oumi/pull/1681
Update verl GRPO countdown configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1682
[very nit] center oumi logo in the cli by @oelachqar in https://github.com/oumi-ai/oumi/pull/1683
[tiny] Update training environments doc by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1686
Add Geometry3K VLM dataset by @nikg4 in https://github.com/oumi-ai/oumi/pull/1687
Add torchao version to pyproject.toml by @nikg4 in https://github.com/oumi-ai/oumi/pull/1688
[Feature] Save evaluation config as YAML in output_dir #1546 by @asish-kun in https://github.com/oumi-ai/oumi/pull/1680
Create a script to calculate memory used during training by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1441
Support VLM-s with VERL_GRPO trainer by @nikg4 in https://github.com/oumi-ai/oumi/pull/1689
docs: Add GRPO/verl documentation by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1690
Update GRPO letter counting reward function and hparams for stability by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1692
[GRPO] Update letter counting notebook by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1694
Add Lambda Inference Engine by @oelachqar in https://github.com/oumi-ai/oumi/pull/1695
Basic shell script for launching jobs on OLCF Frontier HPC cluster by @nikg4 in https://github.com/oumi-ai/oumi/pull/1691
Add CoALM dataset class by @oelachqar in https://github.com/oumi-ai/oumi/pull/1696
Added exponential backoff and content-type error handling in remote inference engine by @abhiramvad in https://github.com/oumi-ai/oumi/pull/1685
Make SFT datasets usable with GRPO_TRL trainer by @nikg4 in https://github.com/oumi-ai/oumi/pull/1698
Implement Falcon H1 by @dhiaEddineRhaiem in https://github.com/oumi-ai/oumi/pull/1699
[tiny] Remove deprecated use_async_dataset from configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1701
Add sample inference configs for HuggingFaceTB/SmolVLM-Instruct by @nikg4 in https://github.com/oumi-ai/oumi/pull/1703

New Contributors

@REDDITARUN made their first contribution in https://github.com/oumi-ai/oumi/pull/1654
@emmanuel-ferdman made their first contribution in https://github.com/oumi-ai/oumi/pull/1673
@Radovenchyk made their first contribution in https://github.com/oumi-ai/oumi/pull/1681
@asish-kun made their first contribution in https://github.com/oumi-ai/oumi/pull/1680
@abhiramvad made their first contribution in https://github.com/oumi-ai/oumi/pull/1685
@dhiaEddineRhaiem made their first contribution in https://github.com/oumi-ai/oumi/pull/1699

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1.12...v0.1.13

- Python
Published by nikg4 about 1 year ago

oumi - v0.1.12

What's Changed

Add vllm to gpu optional dependencies by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1614
[HallOumi] Update inference notebook by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1613
Update llama4 GCP jobs for non-dev environments. by @taenin in https://github.com/oumi-ai/oumi/pull/1621
Update transformers to 4.51.0 by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1620
Lazy load skypilot by @taenin in https://github.com/oumi-ai/oumi/pull/1622
Add additionalmodelkwargs and additionaltrainerkwargs to train function by @hommayushi3 in https://github.com/oumi-ai/oumi/pull/1624
Added 3 Pixmo vision-language datasets by @jrwana in https://github.com/oumi-ai/oumi/pull/1523
[GRPO] Add notebook to demonstrate GRPO & evaluation for letter counting by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1625
[Remote Inference] Update Default Params by @kaisopos in https://github.com/oumi-ai/oumi/pull/1630
Update trl to 0.16 by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1631
Support custom processor args in ModelParams by @nikg4 in https://github.com/oumi-ai/oumi/pull/1634
Support BerryBench evaluation by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1635
[Remote Inference] Error checking for api_key by @kaisopos in https://github.com/oumi-ai/oumi/pull/1638
Rename cnnmnistexample to cnnmnisttutorial by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1640
[Remote Inference][GCP] Constructing api_url from the Project ID and Region by @kaisopos in https://github.com/oumi-ai/oumi/pull/1636

New Contributors

@jrwana made their first contribution in https://github.com/oumi-ai/oumi/pull/1523

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1.11...v0.1.12

- Python
Published by jgreer013 about 1 year ago

oumi - v0.1.11

Oumi v0.1.11 Release Notes 🚀

Key Highlights

Model Support 🤖

Integrated Llama 4 (Scout and Maverick variants) with complete workflow configs 🦙
Added LoRA training for Phi3, Phi4, and Qwen2.5-VL multimodal models 🖼️

Developer Experience 💻

Introduced MLflow integration for experiment tracking 📝
Enhanced CLI with convenient alias functionality ⌨️

HallOumi Framework 🧠

Added examples for Halloumi
Added dedicated inference notebooks for both generative and classifier approaches 📓

Welcome to our new contributors @hommayushi3 and @gabrielaugz! 👋

For details, see the full changelog.

- Python
Published by oelachqar about 1 year ago

oumi - v0.1.10

What's Changed

Increment pillow version for compatibility with Python 3.13 by @nikg4 in https://github.com/oumi-ai/oumi/pull/1566
[Evaluation] Bug: Multiple GPUs attempt to save in the same folder by @kaisopos in https://github.com/oumi-ai/oumi/pull/1567

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1.9...v0.1.10

- Python
Published by nikg4 about 1 year ago

oumi - 0.1.9

What's Changed

Add QwQ full fine-tune and QLoRA configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1518
Update TRL to 0.15 and fix Liger/dataset code by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1507
[tiny] Remove vLLM Colab link and fix Alpaca Eval quickstart by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1530
Evaluation: Inference optimizations by @kaisopos in https://github.com/oumi-ai/oumi/pull/1522
Qwen2.5 VL: Replace "from source" install with transformers>=0.49 by @nikg4 in https://github.com/oumi-ai/oumi/pull/1528
[Evaluation] Renaming evaluation_platform → evaluation_backend by @kaisopos in https://github.com/oumi-ai/oumi/pull/1526
[tiny] Clean up datasets code by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1529
Minor logging improvements in BaseMapDataset by @nikg4 in https://github.com/oumi-ai/oumi/pull/1532
Upload scripts used in a Weekly Walkthrough by @taenin in https://github.com/oumi-ai/oumi/pull/1533
Update VisionLanguageConversationFeatureGenerator by @nikg4 in https://github.com/oumi-ai/oumi/pull/1531
[docs] add security.md by @oelachqar in https://github.com/oumi-ai/oumi/pull/1534
[Evaluation] Custom evaluation notebook: a reliability classifier by @kaisopos in https://github.com/oumi-ai/oumi/pull/1535
Multimodal: Limit max number of images per Conversation by @nikg4 in https://github.com/oumi-ai/oumi/pull/1536
Auto-populate and validate params specific to vision_language_sft collator in TrainingConfig by @nikg4 in https://github.com/oumi-ai/oumi/pull/1537
Update Oumi Env to use Rich formatting by @taenin in https://github.com/oumi-ai/oumi/pull/1541
Update oumi launch to use Rich formatting by @taenin in https://github.com/oumi-ai/oumi/pull/1543
Update oumi evaluate to use rich formatting. by @taenin in https://github.com/oumi-ai/oumi/pull/1544
Update the CLI to replace all prints with Rich prints. by @taenin in https://github.com/oumi-ai/oumi/pull/1547
Render the oumi env command as a shell block in bug reports. by @taenin in https://github.com/oumi-ai/oumi/pull/1548
Define Conversation proto bufs by @nikg4 in https://github.com/oumi-ai/oumi/pull/1550
[Evaluation] Modifying Alpaca Eval results format to be consistent with LM Harness by @kaisopos in https://github.com/oumi-ai/oumi/pull/1551
Augmenting logging training/model statistics by @optas in https://github.com/oumi-ai/oumi/pull/1545
Misc no-op code cleanups by @nikg4 in https://github.com/oumi-ai/oumi/pull/1553
Add code used for the evaluation demo. by @taenin in https://github.com/oumi-ai/oumi/pull/1556
Add OUMI_FORCE_EDITABLE_INSTALL env var to do editable Oumi install from source in job configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1420
Add letter counting GRPO example by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1539
Remove UV install from notebooks as this breaks colab by @taenin in https://github.com/oumi-ai/oumi/pull/1558
[Evaluation] Updates in hallucination notebook by @kaisopos in https://github.com/oumi-ai/oumi/pull/1552
[Evaluations] Custom evals: Adding support for eval_kwargs by @kaisopos in https://github.com/oumi-ai/oumi/pull/1557
Logging message update in log_number_of_model_parameters by @nikg4 in https://github.com/oumi-ai/oumi/pull/1560
[Evaluation][Custom] Removing restrictions and better error checking by @kaisopos in https://github.com/oumi-ai/oumi/pull/1561
Support text truncation (max_length) for vision_language_sft collator by @nikg4 in https://github.com/oumi-ai/oumi/pull/1559
phi 4 multimodal training version 1 ( with limitations ) by @optas in https://github.com/oumi-ai/oumi/pull/1555
Phi-4 basic inference with native/vllm by @optas in https://github.com/oumi-ai/oumi/pull/1563
[minor] phi4 train improvements by @optas in https://github.com/oumi-ai/oumi/pull/1564
Fix printing errors in oumi env for non-string values. by @taenin in https://github.com/oumi-ai/oumi/pull/1565

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1.8...v0.1.9

- Python
Published by nikg4 about 1 year ago

oumi - v0.1.8

What's Changed

GRPO trainer: Minimal initial integration by @nikg4 in https://github.com/oumi-ai/oumi/pull/1482
Update oumi infer to fall back to interactive mode if no input path is specified. by @taenin in https://github.com/oumi-ai/oumi/pull/1483
Add sample DDP/GCP config for GRPO trainer by @nikg4 in https://github.com/oumi-ai/oumi/pull/1485
Temporary fix for chat template issue with multimodal inference w/ in-process vLLM engine by @nikg4 in https://github.com/oumi-ai/oumi/pull/1486
[tiny] Update async_eval.yaml comments to reference correct class by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1488
Fix a bug where overriding remote_params fails via the CLI (oumi infer) by @taenin in https://github.com/oumi-ai/oumi/pull/1487
Define GrpoParams under configs by @nikg4 in https://github.com/oumi-ai/oumi/pull/1490
Support more GRPO params by @nikg4 in https://github.com/oumi-ai/oumi/pull/1491
Minor updates to oumi env by @nikg4 in https://github.com/oumi-ai/oumi/pull/1492
Warn instead of error when device not found for MFU calculation by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1489
Updated all CLI endpoints to support oumi:// prefix by @Spaarsh in https://github.com/oumi-ai/oumi/pull/1468
Fix chat template issue for nested content parts used for VLMs by @nikg4 in https://github.com/oumi-ai/oumi/pull/1493
Ctseng777/judge by @ctseng777 in https://github.com/oumi-ai/oumi/pull/1474
[Evaluation] Modularization & enabling custom evaluations by @kaisopos in https://github.com/oumi-ai/oumi/pull/1484
Update documentation formatting for BaseModel by @taenin in https://github.com/oumi-ai/oumi/pull/1494
Fix log_samples not propagating from eval_kwargs by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1496
[Evaluation] Adding support for logging model samples for all backends by @kaisopos in https://github.com/oumi-ai/oumi/pull/1499
Support for deprecated input param (evaluation_platform) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1500
Limiting the AlpacaEval number of samples for quickstart by @kaisopos in https://github.com/oumi-ai/oumi/pull/1501
Add recurring tests to keep our test badges updated. by @taenin in https://github.com/oumi-ai/oumi/pull/1498
Add a schedule for our GPU, CPU, and doc tests by @taenin in https://github.com/oumi-ai/oumi/pull/1503
Update the GPU Tests badge to use results from main by @taenin in https://github.com/oumi-ai/oumi/pull/1504
vLLM version increment by @nikg4 in https://github.com/oumi-ai/oumi/pull/1502
Minor logging improvements by @nikg4 in https://github.com/oumi-ai/oumi/pull/1505
[Evaluation] Save Utils: Moving, fixes, and unit tests by @kaisopos in https://github.com/oumi-ai/oumi/pull/1506
Update sample GRPO script to validate num_generations by @nikg4 in https://github.com/oumi-ai/oumi/pull/1509
Resolve warning about --dispatch batches deprecated param by @nikg4 in https://github.com/oumi-ai/oumi/pull/1510
[Evaluation] Re-enabling evaluations with Math Hard (leaderboard_math_hard) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1511
Update docker image and build script by @oelachqar in https://github.com/oumi-ai/oumi/pull/1508
Add Qwen QwQ Lora config by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1514
Add QwQ eval/infer configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1515
[Evaluation] Instantiating an inference engine (if needed) when running custom evaluations by @kaisopos in https://github.com/oumi-ai/oumi/pull/1513
Switch eval yaml configs to use evaluation_platform by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1516
Mark BaseMapDataset as typing.Sized by @nikg4 in https://github.com/oumi-ai/oumi/pull/1517
VLM collator refactor by @nikg4 in https://github.com/oumi-ai/oumi/pull/1512

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1.7...v0.1.8

- Python
Published by kaisopos over 1 year ago

oumi - v0.1.7

What's Changed

Update the RemoteInferenceEngine to appropriately handle openai format batch prediction endpoints. by @taenin in https://github.com/oumi-ai/oumi/pull/1472
Fix local models to not break the registry. by @taenin in https://github.com/oumi-ai/oumi/pull/1476
Create an inference config for Claude Sonnet 3.7 by @taenin in https://github.com/oumi-ai/oumi/pull/1479
Add notebook for fine-tuning MiniMath-R1-1.5B by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1480
[Evaluation] Migrate LM Harness integration point from simple_evaluate to evaluate by @kaisopos in https://github.com/oumi-ai/oumi/pull/1455
[tiny]Update trl to 0.14 by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1478

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1.6...v0.1.7

- Python
Published by taenin over 1 year ago

oumi - v0.1.6

What's Changed

Update RemoteParams to no longer require an API URL. by @taenin in https://github.com/oumi-ai/oumi/pull/1452
[Tiny] Update default training params for Qwen2-VL-2B-Instruct by @optas in https://github.com/oumi-ai/oumi/pull/1454
[Tiny] Add more warnings for "special" requirements of Qwen2.5-VL by @optas in https://github.com/oumi-ai/oumi/pull/1453
Minor cleanup of oumi fetch by @taenin in https://github.com/oumi-ai/oumi/pull/1463
Support for multi-image VLM training by @nikg4 in https://github.com/oumi-ai/oumi/pull/1448
Remove a temp workaround in pad_sequences on the left side by @nikg4 in https://github.com/oumi-ai/oumi/pull/1464
[tiny] Add warning that Oumi doesn't support Intel Macs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1467
VLM-related logging improvements by @nikg4 in https://github.com/oumi-ai/oumi/pull/1469
Fix Oumi launcher to be able to run on RunPod and Lambda by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1470
Enable pre-release install for uv in pyproject.toml by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1466

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1.5...v0.1.6

- Python
Published by wizeng23 over 1 year ago

oumi - v0.1.5

What's Changed

Fix the remainder of our configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1356
Adopt new Llama 3.1 HF names by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1357
Define OUMI_USE_SPOT_VM env var and start using it to override use_spot param by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1359
Support HuggingFaceM4/Docmatix dataset by @vishwamartur in https://github.com/oumi-ai/oumi/pull/1342
[nit] update default issue names by @oelachqar in https://github.com/oumi-ai/oumi/pull/1367
Update sft_datasets.md by @penfever in https://github.com/oumi-ai/oumi/pull/1349
Have GitHub Trending image hyperlink to GitHub Trending page by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1370
Update the link for the trending banner. by @taenin in https://github.com/oumi-ai/oumi/pull/1371
Move code to disable caching in model.config to a helper function by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1378
Update transformers version to 4.48 by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1372
Update notebooks to improve their Colab experience by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1380
Add proper labels and types to new Bugs and Feature Requests. by @taenin in https://github.com/oumi-ai/oumi/pull/1383
Upgrade omegaconf to 2.4.0dev3 by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1384
Support HuggingFaceM4/the_cauldron dataset by @vishwamartur in https://github.com/oumi-ai/oumi/pull/1366
Update our FAQ for tips about installing oumi on Windows by @taenin in https://github.com/oumi-ai/oumi/pull/1385
Cleanup HuggingFaceM4/Docmatix and HuggingFaceM4/the_cauldron multimodal datasets by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1387
Remove uneeded env vars from job configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1390
Remove transformer version override for HuggingFaceTB/SmolVLM-Instruct in launcher script by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1388
[Small Refactor] Moving the inference engine def outside the inference config by @kaisopos in https://github.com/oumi-ai/oumi/pull/1395
Evaluation - LM Harness: Adding vLLM support by @kaisopos in https://github.com/oumi-ai/oumi/pull/1379
Remove Docmatix dataset references from docstrings VLM config examples by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1397
Fixed broken link in Oumi - A Tour.ipynb notebook by @ciaralema in https://github.com/oumi-ai/oumi/pull/1398
Fix broken links in notebooks. by @taenin in https://github.com/oumi-ai/oumi/pull/1402
Create a client for communicating with a Slurm node via SSH. by @taenin in https://github.com/oumi-ai/oumi/pull/1389
[tiny] Remove references to missing job configs in README by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1404
Train+Inference with Qwen 2.5 VL (3B) by @optas in https://github.com/oumi-ai/oumi/pull/1396
Add a Slurm cluster and cloud to the oumi launcher. by @taenin in https://github.com/oumi-ai/oumi/pull/1406
Move pretokenize script from scripts/pretokenize/ to scripts/datasets/pretokenize/ by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1412
Create a script to save Conversation-s from SFT datasets into .jsonl file by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1413
[Evaluation] LM Harness refactor by @kaisopos in https://github.com/oumi-ai/oumi/pull/1410
Update save_conversations tool by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1421
[SambaNova] Integrate SambaNova Systems to oumi inference by @ctseng777 in https://github.com/oumi-ai/oumi/pull/1415
[Μinor] Equating Qwen's 2.5 chat-template to version's 2.0 by @optas in https://github.com/oumi-ai/oumi/pull/1419
Add requirements header to configs and clean them up by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1411
Updated oumi infer to support CLI argument for system prompt by @Spaarsh in https://github.com/oumi-ai/oumi/pull/1422
[Evaluation] LM Harness remote server support by @kaisopos in https://github.com/oumi-ai/oumi/pull/1414
[Feature] Add Tulu3 SFT Mixture Dataset Support by @bwalshe in https://github.com/oumi-ai/oumi/pull/1381
Support Multimodal inference with multiple images and PDF-s in NATIVE engine by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1424
Update notebooks to run on Colab by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1423
Add calm recipe. by @taenin in https://github.com/oumi-ai/oumi/pull/1425
Update VLM sample oumi infer -i commands by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1428
Provide example show to start SGLang server using Docker by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1429
Multi-image support in SGLang inference engine by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1426
Calm readme by @emrecanacikgoz in https://github.com/oumi-ai/oumi/pull/1432
WildChat-50M Reproduction by @penfever in https://github.com/oumi-ai/oumi/pull/1433
Add WildChat support by @penfever in https://github.com/oumi-ai/oumi/pull/1348
Create pad_to_max_dim_and_stack() function in torch_utils by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1435
use deterministic by @penfever in https://github.com/oumi-ai/oumi/pull/1434
Additional HF trainer parameters for config by @penfever in https://github.com/oumi-ai/oumi/pull/1436
Set a better default for vllm inference GPU usage. by @taenin in https://github.com/oumi-ai/oumi/pull/1437
Added fetch command and modified infer command to resolve oumi:// by @Spaarsh in https://github.com/oumi-ai/oumi/pull/1439
Require an inference config for oumi infer. by @taenin in https://github.com/oumi-ai/oumi/pull/1443
Make the tulu3 unit tests hermetic. by @taenin in https://github.com/oumi-ai/oumi/pull/1446
Add 2 more sample PDF-s with 1 and 2 pages under testdata/pdfs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1427
Enable ability to override list values in config via CLI by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1430
Renamed CALM to CoALM by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1450
Add support for Docmatix dataset to multimodal training script by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1449
Update oumi launch status to show clusters with no running jobs. by @taenin in https://github.com/oumi-ai/oumi/pull/1451

New Contributors

@vishwamartur made their first contribution in https://github.com/oumi-ai/oumi/pull/1342
@ctseng777 made their first contribution in https://github.com/oumi-ai/oumi/pull/1415
@Spaarsh made their first contribution in https://github.com/oumi-ai/oumi/pull/1422
@bwalshe made their first contribution in https://github.com/oumi-ai/oumi/pull/1381
@emrecanacikgoz made their first contribution in https://github.com/oumi-ai/oumi/pull/1432

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1.4...v0.1.5

- Python
Published by wizeng23 over 1 year ago

oumi - v0.1.4

What's Changed

Add memory cleanup calls in e2e integration tests by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1277
Set up versioning for our documentation by @taenin in https://github.com/oumi-ai/oumi/pull/1275
Make qwen2-VL evaluation job pass by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1278
Add multi-modal (vlm) notebook with Llama 11B by @optas in https://github.com/oumi-ai/oumi/pull/1258
Documentation: Inference -> List supported models by @kaisopos in https://github.com/oumi-ai/oumi/pull/1279
[tiny] update website link by @oelachqar in https://github.com/oumi-ai/oumi/pull/1280
Update all documentation links to the new doc URL by @taenin in https://github.com/oumi-ai/oumi/pull/1281
Update Oumi - A Tour.ipynb by @brragorn in https://github.com/oumi-ai/oumi/pull/1282
Documentation: Judge (minor edits) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1283
Fix citation by @oelachqar in https://github.com/oumi-ai/oumi/pull/1285
Add Deepseek R1 1.5B/32B configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1276
Misc eval configs cleanup by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1286
[docs] Describe parallel evaluation by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1284
Update microsoft/Phi-3-vision-128k-instruct training config by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1287
Add Together Deepseek R1 inference config by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1289
[minor] vlm notebook minor updates (doc referencing, freeze visual backbone) by @optas in https://github.com/oumi-ai/oumi/pull/1288
Add missing -m oumi evaluate argument in eval config by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1291
[docs] Add more references to VL-SFT and SFT notebooks by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1293
Eval config change for deepseek-ai/DeepSeek-R1-Distill-Llama-70B by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1292
[notebooks] Update intro & installation instruction by @oelachqar in https://github.com/oumi-ai/oumi/pull/1294
Update notebook intros by @oelachqar in https://github.com/oumi-ai/oumi/pull/1296
[notebooks] Update installation instructions for colab by @oelachqar in https://github.com/oumi-ai/oumi/pull/1297
Add Apache license header to src/oumi/**/*.py by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1290
Minor updates to VLM Multimodal notebook by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1299
[docs] Add latest notebooks and update references by @oelachqar in https://github.com/oumi-ai/oumi/pull/1300
[tiny] Add docs auto-generated .rst files to gitignore by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1298
[tiny] use GitHub link for header by @oelachqar in https://github.com/oumi-ai/oumi/pull/1301
[docs][tiny] update inference engines reference by @oelachqar in https://github.com/oumi-ai/oumi/pull/1302
Update README/docs to add new DeepSeek models by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1304
[docs] Use pip install oumi over pip install . by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1305
Tune VLM SFT configs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1306
Tune VLM configs for SmolVLM and Qwen2-VL by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1307
Update config/notebook pip installs to use PyPI by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1308
[tiny] upgrade torch version by @oelachqar in https://github.com/oumi-ai/oumi/pull/1295
Update logging and unit tests related to chat templates by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1311
fix(docs): "interested by joining" to "interested in joining" by @CharlesCNorton in https://github.com/oumi-ai/oumi/pull/1312
Add HF_TOKEN instructions to Oumi Multimodal notebook by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1313
Update configuration.md by @penfever in https://github.com/oumi-ai/oumi/pull/1314
remove duplicate keys in config example by @lucyknada in https://github.com/oumi-ai/oumi/pull/1315
[Notebooks] Update VLM notebook by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1317
Update parasailinferenceengine.py by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1320
Fix typo and update warning message for OUMI trainer by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1319
[Notebooks] Add a note that a notebook kernel restart may be needed after pip install oumi by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1318
Update Phi3 to support multiple images by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1321
Add more detailed comment headers to YAML configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1310
[Notebooks] Add a note to Tour notebook to restart kernel after the first pip install by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1327
Tweak --mem-fraction-static param in sample SGLang configs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1328
Disallow using DatasetParams field names as keys in DatasetParams.dataset_kwargs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1324
Support dataset_name_override dataset_kwarg by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1188
Add an util and a test marker for HF token by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1329
Update llama3-instruct chat template to align with the original models template by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1326
chore: update launcher.sh by @eltociear in https://github.com/oumi-ai/oumi/pull/1333
[Notebooks] Minor improvements in VLM and CNN notebooks by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1335
Update VLM cluster names in sample commands by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1336
Update our README and docs with the github trending badge. by @taenin in https://github.com/oumi-ai/oumi/pull/1340
Update README.md - Add DeepSeek to supported models by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1343
Update index.md - Add DeepSeek to supported models by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1344
Update "GPU Tests" status badge in README page by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1345

New Contributors

@CharlesCNorton made their first contribution in https://github.com/oumi-ai/oumi/pull/1312
@lucyknada made their first contribution in https://github.com/oumi-ai/oumi/pull/1315
@eltociear made their first contribution in https://github.com/oumi-ai/oumi/pull/1333

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1.3...v0.1.4

- Python
Published by oelachqar over 1 year ago

oumi - v0.1.3

What's Changed

Documentation: Judge | Custom Model page by @kaisopos in https://github.com/oumi-ai/oumi/pull/1195
[WIP] Add a notebook for using CNN with custom dataset by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1196
[Cherrypick for launch] Evaluate: return dict of results by @kaisopos in https://github.com/oumi-ai/oumi/pull/1197
Configs Train/Infer/Eval and Llama 3.3v (70b) by @optas in https://github.com/oumi-ai/oumi/pull/1200
Adding an integration test for evaluation fn's output (see PR-1197) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1199
[docs] Add more details and cross-references related to customization by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1198
Define single_gpu test marker by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1201
Native inference: Don't set min_p, temperature in GenerationConfig if sampling is disabled by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1202
Update tests to make them runnable on GCP by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1203
Add newline before pformat(train_config) by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1204
GCP tests launcher script changes by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1205
[Evaluation] Bug: serialization by @kaisopos in https://github.com/oumi-ai/oumi/pull/1207
[docs] Add inference snippet for together.ai and DeepSeek APIs by @oelachqar in https://github.com/oumi-ai/oumi/pull/1208
Exclude multi_gpu tests from GitHub GPU tests by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1210
Update e2e tests to support multi-GPU machines by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1206
Add wrappers for remote inference engines by @oelachqar in https://github.com/oumi-ai/oumi/pull/1209
Vision-Lang & Inference (including LoRA) by @optas in https://github.com/oumi-ai/oumi/pull/1174
[BugFix] Throw a runtime error for quantized models & inference=VLLM by @kaisopos in https://github.com/oumi-ai/oumi/pull/1212
Fix most job configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1213
e2e tests update by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1216
[Notebook] Evaluation with Oumi by @kaisopos in https://github.com/oumi-ai/oumi/pull/1218
gpt2: move include_performance_metrics param from script to yaml by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1217
Simplify inference engine API by @oelachqar in https://github.com/oumi-ai/oumi/pull/1214
Move configs to experimental by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1215
[docs] Update index page by @oelachqar in https://github.com/oumi-ai/oumi/pull/1220
Update ConsoleLogger to write to STDOUT by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1221
Set use_spot to False in our JobConfigs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1222
Delete oumi[optional] install target by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1224
Scaffolding and the first testcase for e2e evaluation tests by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1225
[docs] Update inference engines doc page by @oelachqar in https://github.com/oumi-ai/oumi/pull/1227
Clean-up inference engine builder by @oelachqar in https://github.com/oumi-ai/oumi/pull/1226
[VLLM Engine] Enabling BitsAndBytes quantization by @kaisopos in https://github.com/oumi-ai/oumi/pull/1223
Add example distillation notebook by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1228
Add a script to pre-download models for gpu_tests by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1231
Fix multi-GPU inference integration test by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1229
[tiny][docs] Update PEFT/LoRA content by @optas in https://github.com/oumi-ai/oumi/pull/1233
[BugFix] GGUF does not work with VLLM by @kaisopos in https://github.com/oumi-ai/oumi/pull/1232
Re-enable parallel evaluation for VLM-s by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1235
Add multimodal exemplar dataset in our provided mini-datasets by @optas in https://github.com/oumi-ai/oumi/pull/1234
[Tiny] renaming a field name (init_lora_weights) by @optas in https://github.com/oumi-ai/oumi/pull/1236
Add more e2e evaluation tests by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1237
Fix pyright breakage when vllm and llama_cpp are not installed by @taenin in https://github.com/oumi-ai/oumi/pull/1240
Update our oumi launch documentation. by @taenin in https://github.com/oumi-ai/oumi/pull/1239
Update index.md title for "Join the Community!" by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1242
Update quickstart.md - nit for Oumi support request by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1241
[VLLM Engine] Improve support for GGUF models (incl. auto-download) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1238
Update README.md title to "Join the Community!" by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1243
Update quickstart.md by @brragorn in https://github.com/oumi-ai/oumi/pull/1251
Update quickstart.md by @brragorn in https://github.com/oumi-ai/oumi/pull/1253
Update quickstart.md by @brragorn in https://github.com/oumi-ai/oumi/pull/1252
Update quickstart.md by @brragorn in https://github.com/oumi-ai/oumi/pull/1250
[Minor refactor] Moving model caching to oumi.utils by @kaisopos in https://github.com/oumi-ai/oumi/pull/1246
Add more details to troubleshooting FAQ by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1249
Update training_methods.md - Change compute requirement suggestions by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1245
Update train.md - nit description change by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1244
[docs] misc docs feedback by @oelachqar in https://github.com/oumi-ai/oumi/pull/1248
[tiny] Qwen2-VL activate experimental datapipes by @optas in https://github.com/oumi-ai/oumi/pull/1247
Update Oumi - A Tour.ipynb by @brragorn in https://github.com/oumi-ai/oumi/pull/1254
[docs] more docs feedback by @oelachqar in https://github.com/oumi-ai/oumi/pull/1255
Update supported_models.md by @penfever in https://github.com/oumi-ai/oumi/pull/1256
Rename experimental_use_torch_datapipes data param by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1257
Add pypi release workflow using testpypi by @oelachqar in https://github.com/oumi-ai/oumi/pull/1259
Update workflow names by @oelachqar in https://github.com/oumi-ai/oumi/pull/1262
Update default idleminutesto_autostop to 1 hour. by @taenin in https://github.com/oumi-ai/oumi/pull/1264
update pypi release workflow to use trusted env by @oelachqar in https://github.com/oumi-ai/oumi/pull/1265
Add padding_side param to internal model config by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1260
Documentation: Updates on Evaluation/Judge (based on Manos' feedback) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1261
[tiny] less strict requirements by @oelachqar in https://github.com/oumi-ai/oumi/pull/1266
Add Deepseek R1 Distill Llama 8B/70B configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1263
Update index.md to highlight beta stage by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1268
Update README.md to highlight beta stage by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1267
Disable pre-release packages by @oelachqar in https://github.com/oumi-ai/oumi/pull/1270
Update common_workflows.md - Clarify OpenAI is just an example by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1271
Documentation: Evaluation page (update to highlight multi-modal) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1269
Update launch.md by @taenin in https://github.com/oumi-ai/oumi/pull/1272
Add pypi release workflow by @oelachqar in https://github.com/oumi-ai/oumi/pull/1273
Documentation: Judge | minor edit (bold) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1274

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1.2...v0.1.3

- Python
Published by oelachqar over 1 year ago

oumi - v0.1.2.3.1

What's Changed

Re-enable parallel evaluation for VLM-s by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1235
Add multimodal exemplar dataset in our provided mini-datasets by @optas in https://github.com/oumi-ai/oumi/pull/1234
[Tiny] renaming a field name (init_lora_weights) by @optas in https://github.com/oumi-ai/oumi/pull/1236
Add more e2e evaluation tests by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1237
Fix pyright breakage when vllm and llama_cpp are not installed by @taenin in https://github.com/oumi-ai/oumi/pull/1240
Update our oumi launch documentation. by @taenin in https://github.com/oumi-ai/oumi/pull/1239
Update index.md title for "Join the Community!" by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1242
Update quickstart.md - nit for Oumi support request by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1241
[VLLM Engine] Improve support for GGUF models (incl. auto-download) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1238
Update README.md title to "Join the Community!" by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1243
Update quickstart.md by @brragorn in https://github.com/oumi-ai/oumi/pull/1251
Update quickstart.md by @brragorn in https://github.com/oumi-ai/oumi/pull/1253
Update quickstart.md by @brragorn in https://github.com/oumi-ai/oumi/pull/1252
Update quickstart.md by @brragorn in https://github.com/oumi-ai/oumi/pull/1250
[Minor refactor] Moving model caching to oumi.utils by @kaisopos in https://github.com/oumi-ai/oumi/pull/1246
Add more details to troubleshooting FAQ by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1249
Update training_methods.md - Change compute requirement suggestions by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1245
Update train.md - nit description change by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1244
[docs] misc docs feedback by @oelachqar in https://github.com/oumi-ai/oumi/pull/1248
[tiny] Qwen2-VL activate experimental datapipes by @optas in https://github.com/oumi-ai/oumi/pull/1247
Update Oumi - A Tour.ipynb by @brragorn in https://github.com/oumi-ai/oumi/pull/1254
[docs] more docs feedback by @oelachqar in https://github.com/oumi-ai/oumi/pull/1255
Update supported_models.md by @penfever in https://github.com/oumi-ai/oumi/pull/1256
Rename experimental_use_torch_datapipes data param by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1257
Add pypi release workflow using testpypi by @oelachqar in https://github.com/oumi-ai/oumi/pull/1259
Update workflow names by @oelachqar in https://github.com/oumi-ai/oumi/pull/1262
Update default idleminutesto_autostop to 1 hour. by @taenin in https://github.com/oumi-ai/oumi/pull/1264
update pypi release workflow to use trusted env by @oelachqar in https://github.com/oumi-ai/oumi/pull/1265
Add padding_side param to internal model config by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1260
Documentation: Updates on Evaluation/Judge (based on Manos' feedback) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1261
[tiny] less strict requirements by @oelachqar in https://github.com/oumi-ai/oumi/pull/1266
Add Deepseek R1 Distill Llama 8B/70B configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1263

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1.2.2...v0.1.2.3.1

- Python
Published by oelachqar over 1 year ago

oumi - v0.1.2.3

What's Changed

Re-enable parallel evaluation for VLM-s by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1235
Add multimodal exemplar dataset in our provided mini-datasets by @optas in https://github.com/oumi-ai/oumi/pull/1234
[Tiny] renaming a field name (init_lora_weights) by @optas in https://github.com/oumi-ai/oumi/pull/1236
Add more e2e evaluation tests by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1237
Fix pyright breakage when vllm and llama_cpp are not installed by @taenin in https://github.com/oumi-ai/oumi/pull/1240
Update our oumi launch documentation. by @taenin in https://github.com/oumi-ai/oumi/pull/1239
Update index.md title for "Join the Community!" by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1242
Update quickstart.md - nit for Oumi support request by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1241
[VLLM Engine] Improve support for GGUF models (incl. auto-download) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1238
Update README.md title to "Join the Community!" by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1243
Update quickstart.md by @brragorn in https://github.com/oumi-ai/oumi/pull/1251
Update quickstart.md by @brragorn in https://github.com/oumi-ai/oumi/pull/1253
Update quickstart.md by @brragorn in https://github.com/oumi-ai/oumi/pull/1252
Update quickstart.md by @brragorn in https://github.com/oumi-ai/oumi/pull/1250
[Minor refactor] Moving model caching to oumi.utils by @kaisopos in https://github.com/oumi-ai/oumi/pull/1246
Add more details to troubleshooting FAQ by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1249
Update training_methods.md - Change compute requirement suggestions by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1245
Update train.md - nit description change by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1244
[docs] misc docs feedback by @oelachqar in https://github.com/oumi-ai/oumi/pull/1248
[tiny] Qwen2-VL activate experimental datapipes by @optas in https://github.com/oumi-ai/oumi/pull/1247
Update Oumi - A Tour.ipynb by @brragorn in https://github.com/oumi-ai/oumi/pull/1254
[docs] more docs feedback by @oelachqar in https://github.com/oumi-ai/oumi/pull/1255
Update supported_models.md by @penfever in https://github.com/oumi-ai/oumi/pull/1256
Rename experimental_use_torch_datapipes data param by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1257
Add pypi release workflow using testpypi by @oelachqar in https://github.com/oumi-ai/oumi/pull/1259
Update workflow names by @oelachqar in https://github.com/oumi-ai/oumi/pull/1262
Update default idleminutesto_autostop to 1 hour. by @taenin in https://github.com/oumi-ai/oumi/pull/1264
update pypi release workflow to use trusted env by @oelachqar in https://github.com/oumi-ai/oumi/pull/1265

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1.2.2...v0.1.2.3

- Python
Published by oelachqar over 1 year ago

oumi - v0.1.2.2

What's Changed

Configs Train/Infer/Eval and Llama 3.3v (70b) by @optas in https://github.com/oumi-ai/oumi/pull/1200
Adding an integration test for evaluation fn's output (see PR-1197) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1199
[docs] Add more details and cross-references related to customization by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1198
Define single_gpu test marker by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1201
Native inference: Don't set min_p, temperature in GenerationConfig if sampling is disabled by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1202
Update tests to make them runnable on GCP by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1203
Add newline before pformat(train_config) by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1204
GCP tests launcher script changes by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1205
[Evaluation] Bug: serialization by @kaisopos in https://github.com/oumi-ai/oumi/pull/1207
[docs] Add inference snippet for together.ai and DeepSeek APIs by @oelachqar in https://github.com/oumi-ai/oumi/pull/1208
Exclude multi_gpu tests from GitHub GPU tests by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1210
Update e2e tests to support multi-GPU machines by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1206
Add wrappers for remote inference engines by @oelachqar in https://github.com/oumi-ai/oumi/pull/1209
Vision-Lang & Inference (including LoRA) by @optas in https://github.com/oumi-ai/oumi/pull/1174
[BugFix] Throw a runtime error for quantized models & inference=VLLM by @kaisopos in https://github.com/oumi-ai/oumi/pull/1212
Fix most job configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1213
e2e tests update by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1216
[Notebook] Evaluation with Oumi by @kaisopos in https://github.com/oumi-ai/oumi/pull/1218
gpt2: move include_performance_metrics param from script to yaml by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1217
Simplify inference engine API by @oelachqar in https://github.com/oumi-ai/oumi/pull/1214
Move configs to experimental by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1215
[docs] Update index page by @oelachqar in https://github.com/oumi-ai/oumi/pull/1220
Update ConsoleLogger to write to STDOUT by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1221
Set use_spot to False in our JobConfigs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1222
Delete oumi[optional] install target by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1224
Scaffolding and the first testcase for e2e evaluation tests by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1225
[docs] Update inference engines doc page by @oelachqar in https://github.com/oumi-ai/oumi/pull/1227
Clean-up inference engine builder by @oelachqar in https://github.com/oumi-ai/oumi/pull/1226
[VLLM Engine] Enabling BitsAndBytes quantization by @kaisopos in https://github.com/oumi-ai/oumi/pull/1223
Add example distillation notebook by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1228
Add a script to pre-download models for gpu_tests by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1231
Fix multi-GPU inference integration test by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1229
[tiny][docs] Update PEFT/LoRA content by @optas in https://github.com/oumi-ai/oumi/pull/1233
[BugFix] GGUF does not work with VLLM by @kaisopos in https://github.com/oumi-ai/oumi/pull/1232

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1.2.1...v0.1.2.2

- Python
Published by oelachqar over 1 year ago

oumi - v0.1.2.1

What's Changed

[docs] Make absolute links local in our Launch + Contributing Docs by @taenin in https://github.com/oumi-ai/oumi/pull/1152
Fix broken ruff precommit. by @taenin in https://github.com/oumi-ai/oumi/pull/1153
Documentation: Judge | 1st revision (of many) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1148
[docs] Put -m oumi train on single line by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1154
[docs] Add a description to the oumi env command. by @taenin in https://github.com/oumi-ai/oumi/pull/1155
[docs] Update the list of commands on Core Concepts page by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1156
Add a comment to Omegaconf yaml files with config info by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1149
Update training_methods.md by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1159
Update train.md by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1158
[docs] Improve docs for custom multimodal SFT datasets by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1157
Update quickstart.md Removed H1 emoji by @brragorn in https://github.com/oumi-ai/oumi/pull/1160
Update style_guide.md by @brragorn in https://github.com/oumi-ai/oumi/pull/1162
Exclude unit/integration tests explicitly marked as e2e from GitHub runs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1164
Update train.yaml by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1167
Define e2e_eternal test marker by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1169
Bugfixes in tests/integration/datasets/test_sft_vision_datasets_load_datasets.py by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1166
Update contributing.md by @brragorn in https://github.com/oumi-ai/oumi/pull/1161
Removed unnecessary cloud.md which is already covered in depth in another section. by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1168
Update GitHub actions markers expression by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1171
Add MNIST as Vision SFT dataset by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1170
Documentation: Judge | Judge (main) page by @kaisopos in https://github.com/oumi-ai/oumi/pull/1165
[Documentation] Evaluation pages: fixing broken links by @kaisopos in https://github.com/oumi-ai/oumi/pull/1172
Documentation: Judge | Built-In page by @kaisopos in https://github.com/oumi-ai/oumi/pull/1173
OUMI Trainer: Support non-text models by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1175
OUMI Trainer: Improve handling of num_training_epochs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1176
Update readme by @oelachqar in https://github.com/oumi-ai/oumi/pull/1177
Training vlm llama 90b by @optas in https://github.com/oumi-ai/oumi/pull/1179
Documentation: Judge | Custom Prompts page by @kaisopos in https://github.com/oumi-ai/oumi/pull/1183
[docs] fix misc broken links by @oelachqar in https://github.com/oumi-ai/oumi/pull/1186
Fixing judge's broken doc links by @kaisopos in https://github.com/oumi-ai/oumi/pull/1185
OUMI Trainer: Improve handling of num_training_epochs and update _get_total_training_steps() by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1178
[docs] Add initial page how to define datasets for custom data types by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1182
Add CnnClassifier custom model by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1180
Define default_dataset property for all pretraining datasets by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1189
[tiny] Raise explicit error for models not supporting infer by @oelachqar in https://github.com/oumi-ai/oumi/pull/1187
[tests] Replace external models & datasets to make unit tests hermetic by @oelachqar in https://github.com/oumi-ai/oumi/pull/1163
[tiny] Add error message to bare exceptions by @oelachqar in https://github.com/oumi-ai/oumi/pull/1190
Bugfix in CnnClassifier example by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1191
Move data mixtures tests to unit tests by @oelachqar in https://github.com/oumi-ai/oumi/pull/1192
[docs] Resolve PyRight issues in NpzDataset in custom dataset example by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1194
Add examples of Alpaca and Oumi Dataset Formats by @optas in https://github.com/oumi-ai/oumi/pull/1184
Add integration test for CNNClassifier custom model by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1193

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1.2-alpha...v0.1.2

- Python
Published by oelachqar over 1 year ago

oumi - v0.1.2.0-alpha

What's Changed

Update README.md - Better highlight features & nits by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/995
[tiny] update docstring and cleanup by @oelachqar in https://github.com/oumi-ai/oumi/pull/1006
Qwen2-VL: minor updates by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1000
Update README.md - Describe Oumi's most common capabilities by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/996
Fix readme. by @taenin in https://github.com/oumi-ai/oumi/pull/1009
Updated our ascii logo by @taenin in https://github.com/oumi-ai/oumi/pull/1008
[docs] Update readme by @oelachqar in https://github.com/oumi-ai/oumi/pull/1010
Cleanup scripts by @oelachqar in https://github.com/oumi-ai/oumi/pull/1011
Cleanup experimental folder by @oelachqar in https://github.com/oumi-ai/oumi/pull/1012
Update lists of supported VLM-s in README and docs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1014
Freeze Python package versions by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1007
Update blip2's chat template to use the "default" one by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1015
Add docstrings how to start vLLM and SGLang servers for Llama-3.2-11B-Vision-Instruct by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1016
Evaluation: bugfixing, corner case, unit tests by @kaisopos in https://github.com/oumi-ai/oumi/pull/1003
Configure asyncio_default_fixture_loop_scope to reduce pytest warnings by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1013
Update the registry to load registered core values upon use. by @taenin in https://github.com/oumi-ai/oumi/pull/1017
Update default installation instructions to pypi by @taenin in https://github.com/oumi-ai/oumi/pull/1018
[tiny] Update debug datasets by @oelachqar in https://github.com/oumi-ai/oumi/pull/1020
[docs] Address misc docs feedback by @oelachqar in https://github.com/oumi-ai/oumi/pull/1019
[tiny] update pre-defined judges and docs by @oelachqar in https://github.com/oumi-ai/oumi/pull/1021
Parameterize e2e training test, and add config for Qwen2-VL by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1023
Remove our docs password from the readme. by @taenin in https://github.com/oumi-ai/oumi/pull/1024
VLM docs update by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1025
Fix loading registered pretrain datasets by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1005
Update @requires_gpus test decorator to optionally specify min GPU memory requirement by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1029
[tiny] Update GitHub workflows by @oelachqar in https://github.com/oumi-ai/oumi/pull/1034
Update BaseConfig.from_yaml to also support Path by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1026
[tiny] Cleanup judge engine builder & fix circular dep by @oelachqar in https://github.com/oumi-ai/oumi/pull/1035
Create GPU GitHub Actions workflow by @oelachqar in https://github.com/oumi-ai/oumi/pull/1004
Add structured outputs support to gemini/vertex engines by @oelachqar in https://github.com/oumi-ai/oumi/pull/1022
[docs] Fix feedback on training and inference user guides by @oelachqar in https://github.com/oumi-ai/oumi/pull/1037
[docs][tiny] fix examples in inference guide by @oelachqar in https://github.com/oumi-ai/oumi/pull/1038
Add a sanity test for circular imports. by @taenin in https://github.com/oumi-ai/oumi/pull/1030
Resolve circular dependencies in Oumi by @taenin in https://github.com/oumi-ai/oumi/pull/1039
Move our circular dependency test to e2e to speed up GPU CI tests. by @taenin in https://github.com/oumi-ai/oumi/pull/1040
Add custom inference engine for gemini API by @oelachqar in https://github.com/oumi-ai/oumi/pull/1036
Define CLI in our quickstart. by @taenin in https://github.com/oumi-ai/oumi/pull/1042
Skip running GPU tests on low-risk code paths by @oelachqar in https://github.com/oumi-ai/oumi/pull/1043
Define more terms in our training docs. by @taenin in https://github.com/oumi-ai/oumi/pull/1044
Fix the broken python text snippet on the train page. by @taenin in https://github.com/oumi-ai/oumi/pull/1045
Fix the second python snippet in the train page. by @taenin in https://github.com/oumi-ai/oumi/pull/1046
[docs] Add Gemini to the list of supported inference API-s, and sort them by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1048
Fix issues in most notebooks by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1047
[docs][tiny] remove termynal from sphinx conf by @oelachqar in https://github.com/oumi-ai/oumi/pull/1041
Fix a typo in the VS Code environment page. by @taenin in https://github.com/oumi-ai/oumi/pull/1049
Define WSL in our vscode docs. by @taenin in https://github.com/oumi-ai/oumi/pull/1052
[tiny] disable unit tests on safe paths by @oelachqar in https://github.com/oumi-ai/oumi/pull/1051
[docs] Fix contributing and open issue links by @oelachqar in https://github.com/oumi-ai/oumi/pull/1050
[evaluations/generative_benchmark] Broken link by @kaisopos in https://github.com/oumi-ai/oumi/pull/1054
Remove dangling reference to jupyter in Makefile help by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1053
[evaluations/generative_benchmark] Removing notebook link by @kaisopos in https://github.com/oumi-ai/oumi/pull/1055
Support constrained decoding in SGLang inference engine by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1032
[tiny] Update tutorials page by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1056
Minor updates to Launch.md by @taenin in https://github.com/oumi-ai/oumi/pull/1059
[docs] Update docs/user_guides/infer/infer.md by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1058
Nits for common_workflows.md by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1061
Nit fixes for acknowledgements.md by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1057
Add sample trouble shooting for remote jobs. by @taenin in https://github.com/oumi-ai/oumi/pull/1062
Add a Github Issues selector for questions and have it redirect to Discord. by @taenin in https://github.com/oumi-ai/oumi/pull/1064
Package checking: Adding functionality for checking package versioning and fast failing by @kaisopos in https://github.com/oumi-ai/oumi/pull/1031
Fix various typos in contributing.md by @taenin in https://github.com/oumi-ai/oumi/pull/1066
SGLang inference documentation by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1065
Replace assert in NativeInferenceEngine with RuntimeError by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1068
Update dev set up instructions to use a Fork. by @taenin in https://github.com/oumi-ai/oumi/pull/1067
Define inference configs for more models by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1069
[Evaluation] HF Leaderboards yaml files by @kaisopos in https://github.com/oumi-ai/oumi/pull/1071
Specify engine: NATIVE is inference configs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1075
Improve handling of image path and URLs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1074
[Doc > Quickstart] Should we add links to guides for better discoverability? by @kaisopos in https://github.com/oumi-ai/oumi/pull/1076
Add e2e tests for running tutorial notebooks by @oelachqar in https://github.com/oumi-ai/oumi/pull/1079
Ignore all experimental files when running our circular dependency test. by @taenin in https://github.com/oumi-ai/oumi/pull/1081
[Super Nit Doc Update] environments.md by @kaisopos in https://github.com/oumi-ai/oumi/pull/1082
Add an env var for loading user registered values (dataset, models, clouds) when initializing the Oumi Registry by @taenin in https://github.com/oumi-ai/oumi/pull/1077
Update internal model configs to support default tokenizer_pad_token and chat_template by model type by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1078
[Minor] Notebook typo by @kaisopos in https://github.com/oumi-ai/oumi/pull/1085
Upgrade transformers to 4.47 by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1033
[tiny][docs] Update recipes page by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1072
Configure e2e integration test for Llama 3.2 Vision 11B by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1086
Nits for cli_reference.md by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1063
[Documentation] Evaluate | Leaderboards Page by @kaisopos in https://github.com/oumi-ai/oumi/pull/1084
[Documentation] Evaluate | Main Page (revision) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1089
[tiny] Fix precommit by @oelachqar in https://github.com/oumi-ai/oumi/pull/1092
Add timeout for unit & integration tests by @oelachqar in https://github.com/oumi-ai/oumi/pull/1091
Add GitHub Actions workflow for doctests by @oelachqar in https://github.com/oumi-ai/oumi/pull/1093
[docs] remove unused page, fix links by @oelachqar in https://github.com/oumi-ai/oumi/pull/1094
[Documentation] Evaluate | Main Page (small refactor) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1095
Rewrite of the main Oumi Launch page. by @taenin in https://github.com/oumi-ai/oumi/pull/1087
Remove pytest.mark.skip() for basic e2e tests by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1088
[tiny] Upgrade minimum numpy version to unblock python3.12 installation by @oelachqar in https://github.com/oumi-ai/oumi/pull/1099
Update our Readme with a new header image. by @taenin in https://github.com/oumi-ai/oumi/pull/1098
[docs] Minor refresh to dataset resource pages by @oelachqar in https://github.com/oumi-ai/oumi/pull/1097
[docs] Add docs guide page by @oelachqar in https://github.com/oumi-ai/oumi/pull/1096
Add a quick unit test to ensure new dependencies are not added to the top-level CLI by @taenin in https://github.com/oumi-ai/oumi/pull/1100
Updates distributed run to handle local machine scenarios by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1028
[docs] Refresh datasets resource page by @oelachqar in https://github.com/oumi-ai/oumi/pull/1101
Update e2e integration tests to allow trainer_type overrides by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1090
Temporary fix for "processing_class" error for OUMI trainer by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1102
Update training logging by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1104
Vision-Language LoRA training by @optas in https://github.com/oumi-ai/oumi/pull/1027
Update the Deploy page to use CLI and python instructions. by @taenin in https://github.com/oumi-ai/oumi/pull/1106
Set rank/world_size/device_id= params in init_process_group() to resolve warnings by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1109
[Revert] Undo package upgrades including transformers 4.47 by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1111
Move Llama 3.2 Vision 11b SFT FFT configs into subdir by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1110
[docs] Update dataset formats page by @oelachqar in https://github.com/oumi-ai/oumi/pull/1112
[docs] Remove placeholder pages by @oelachqar in https://github.com/oumi-ai/oumi/pull/1113
[docs] address mic feedback from the team by @oelachqar in https://github.com/oumi-ai/oumi/pull/1114
Update quickstart.md by @penfever in https://github.com/oumi-ai/oumi/pull/1115
Minor updates to oumi env by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1116
Update oumi train path to set TOKENIZERS_PARALLELISM variable by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1117
Reorganize oumi distributed run logic by @oelachqar in https://github.com/oumi-ai/oumi/pull/1118
[docs] Update misc Multimodal docs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1120
[docs] Add badges, smaller logos, and nit fix to notebook by @oelachqar in https://github.com/oumi-ai/oumi/pull/1121
Rename LLAMA 8b and 11B Vision inference REMOTE_VLLM configs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1122
Fix a circular dependency when invoking the oumi src directory via Python. by @taenin in https://github.com/oumi-ai/oumi/pull/1119
[docs] Multimodal SFT datasets: minor updates by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1123
Fix all doctests by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1107
Cleanup legacy constraints on datasets by @oelachqar in https://github.com/oumi-ai/oumi/pull/1080
[tiny] skip alpaca_eval test if not installed by @oelachqar in https://github.com/oumi-ai/oumi/pull/1125
Remove target_col: from VLM configs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1126
Update our documentation for custom clusters by @taenin in https://github.com/oumi-ai/oumi/pull/1128
Mark targetcol and useasync_dataset as deprecated by @oelachqar in https://github.com/oumi-ai/oumi/pull/1127
Update sample VLM eval config by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1129
[Documentation] Evaluate | Standardized Benchmarks page (refactor) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1131
[docs] Add doc section for MMMU benchmark by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1130
Evaluation yaml configs: standardize task name by @kaisopos in https://github.com/oumi-ai/oumi/pull/1108
[doc] Oumi Models (part-1) by @optas in https://github.com/oumi-ai/oumi/pull/1083
Enable FSDP for Llama Vision with OUMI trainer by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1105
[docs] Add an entry for "Fine-tuning a Vision-Language Model" by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1132
[tiny] [docs] troubleshooting improvements by @optas in https://github.com/oumi-ai/oumi/pull/1070
[tiny] [docs] Cleanup touches on Quickstart by @optas in https://github.com/oumi-ai/oumi/pull/1133
VLM e2e integration tests update by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1138
Reduce batch size in Qwen2 VL eval config to reduce chances of OOM by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1136
Update readme, first iteration by @oelachqar in https://github.com/oumi-ai/oumi/pull/1135
Remove BLIP2 configs temporarily by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1140
[tiny] Looser pandas version requirements by @oelachqar in https://github.com/oumi-ai/oumi/pull/1141
Add a Customizing Oumi page to our docs by @taenin in https://github.com/oumi-ai/oumi/pull/1142
Upgrade wandb version by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1143
Get all notebooks to run by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1139
Add fineweb e2e integration test by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1146
Update quickstart.md with emoji by @michaellema in https://github.com/oumi-ai/oumi/pull/1147
Remove some notebook E2E tests by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1145
Updated notebooks and quickstart documentation by @penfever in https://github.com/oumi-ai/oumi/pull/1134
updated quickstart with clearer instructions about setting up cloud by @penfever in https://github.com/oumi-ai/oumi/pull/1151
add maxmemorypergpu and devicemap params to lm eval harness by @penfever in https://github.com/oumi-ai/oumi/pull/1150
Add generic prompt-response dataset and update finetuning notebook by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1144
Review default values by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1124

New Contributors

@penfever made their first contribution in https://github.com/oumi-ai/oumi/pull/1115
@michaellema made their first contribution in https://github.com/oumi-ai/oumi/pull/1147

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1.1-alpha...v0.1.2-alpha

- Python
Published by oelachqar over 1 year ago

oumi - v0.1.1.0-alpha.1

What's Changed

Minimal SkyPilot config for blip2 and llava models for GCP with TRL_SFT by @xrdaukar in https://github.com/oumi-ai/oumi/pull/573
Inference Engine async writes by @taenin in https://github.com/oumi-ai/oumi/pull/574
Misc cleanups in JsonlinesDataset by @xrdaukar in https://github.com/oumi-ai/oumi/pull/576
Split out cloud dependencies by @taenin in https://github.com/oumi-ai/oumi/pull/575
Disable sdpa for blip2 by @xrdaukar in https://github.com/oumi-ai/oumi/pull/579
Set accelerate version to fix FSDP model saving by @wizeng23 in https://github.com/oumi-ai/oumi/pull/580
Remove AWS as a default dep by @taenin in https://github.com/oumi-ai/oumi/pull/582
Update ProfilerParams docstrings to follow the new style by @xrdaukar in https://github.com/oumi-ai/oumi/pull/583
Minor updates in scripts/benchmarks/minimal_multimodal_training.py by @xrdaukar in https://github.com/oumi-ai/oumi/pull/585
Add @override annotations to methods of few Dataset subclasses by @xrdaukar in https://github.com/oumi-ai/oumi/pull/584
Add dataset class for dolly dataset by @oelachqar in https://github.com/oumi-ai/oumi/pull/586
Refactor debugging/device utils, and add new GPU stats measurement functions by @xrdaukar in https://github.com/oumi-ai/oumi/pull/587
Add text jsonlines dataset class by @oelachqar in https://github.com/oumi-ai/oumi/pull/589
Define DataCollationParams by @xrdaukar in https://github.com/oumi-ai/oumi/pull/581
Misc updates to Polaris launcher scripts by @xrdaukar in https://github.com/oumi-ai/oumi/pull/591
Set up a new version of the Oumi CLI using Typer by @taenin in https://github.com/oumi-ai/oumi/pull/588
Update handling of GPU fan speed info by @xrdaukar in https://github.com/oumi-ai/oumi/pull/595
Add support for magpie dataset variants by @oelachqar in https://github.com/oumi-ai/oumi/pull/594
Rename GenerationConfig to GenerationParams by @wizeng23 in https://github.com/oumi-ai/oumi/pull/592
Fix cli infer test by @wizeng23 in https://github.com/oumi-ai/oumi/pull/598
Judge Notebook 1: default judge by @kaisopos in https://github.com/oumi-ai/oumi/pull/593
[Tiny] update missing dataset import by @oelachqar in https://github.com/oumi-ai/oumi/pull/599
Update training script to support data collators by @xrdaukar in https://github.com/oumi-ai/oumi/pull/590
Update accelerate version to 1.0.0 by @wizeng23 in https://github.com/oumi-ai/oumi/pull/601
Remove deprecated dataset code paths by @oelachqar in https://github.com/oumi-ai/oumi/pull/596
Refactor Aya & Ultrachat to use oumi dataset sft classes by @oelachqar in https://github.com/oumi-ai/oumi/pull/597
Add Llama train/eval/infer E2E integration test by @wizeng23 in https://github.com/oumi-ai/oumi/pull/602
Set docstring for NVidiaGpuRuntimeInfo struct by @xrdaukar in https://github.com/oumi-ai/oumi/pull/603
Add generation params to inference engines by @oelachqar in https://github.com/oumi-ai/oumi/pull/600
[bug] Fix issue loading jsonl datasets from file by @oelachqar in https://github.com/oumi-ai/oumi/pull/604
Add Llama 3B configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/605
Align pyright checks with latest Pylance version by @oelachqar in https://github.com/oumi-ai/oumi/pull/611
Fix apply_chat_template issue in VisionLanguageSftDataset by @xrdaukar in https://github.com/oumi-ai/oumi/pull/609
More robust make setup by @oelachqar in https://github.com/oumi-ai/oumi/pull/610
Fix a bug where the new CLI was improperly importing functions from top-level modules. by @taenin in https://github.com/oumi-ai/oumi/pull/613
Add support for the Launch command suite in the new CLI by @taenin in https://github.com/oumi-ai/oumi/pull/612
Support HuggingFaceH4/llava-instruct-mix-vsft dataset by @xrdaukar in https://github.com/oumi-ai/oumi/pull/608
[tiny] Fix .gitignore by @wizeng23 in https://github.com/oumi-ai/oumi/pull/616
[tiny] add gpt2 chat template, and update tests to use it by @oelachqar in https://github.com/oumi-ai/oumi/pull/617
Turn off pretty-printing exceptions in our CLI by @taenin in https://github.com/oumi-ai/oumi/pull/618
Cleanup dependencies by @oelachqar in https://github.com/oumi-ai/oumi/pull/615
Upgrade oumi dependencies by @oelachqar in https://github.com/oumi-ai/oumi/pull/606
Update makefile to use uv, add Jupyter target by @oelachqar in https://github.com/oumi-ai/oumi/pull/614
Add miniconda installation target, cleanup unused make commands by @oelachqar in https://github.com/oumi-ai/oumi/pull/620
Update several notebooks with the new EvaluationConfig format. by @taenin in https://github.com/oumi-ai/oumi/pull/621
Make sure conda env is registered by @oelachqar in https://github.com/oumi-ai/oumi/pull/622
Add Llama 3b sft/lora/qlora configs for Polaris by @wizeng23 in https://github.com/oumi-ai/oumi/pull/626
Add check if installation is successful by @oelachqar in https://github.com/oumi-ai/oumi/pull/625
Initial Cambrian integration by @xrdaukar in https://github.com/oumi-ai/oumi/pull/557
[tiny] alpaca - minor reproducibility boost by @optas in https://github.com/oumi-ai/oumi/pull/619
explicitly specify the model's dtype in LMH by @optas in https://github.com/oumi-ai/oumi/pull/607
[tiny] Add flops for T4 GPU by @wizeng23 in https://github.com/oumi-ai/oumi/pull/628
Use a timestamp for job directories on Polaris by @taenin in https://github.com/oumi-ai/oumi/pull/627
[tiny] Fix bug with Polaris job num by @wizeng23 in https://github.com/oumi-ai/oumi/pull/629
Update two VLLM configs. by @xrdaukar in https://github.com/oumi-ai/oumi/pull/624
Add pip install -U uv; to make setup for existing envs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/630
Disable MFU logging for non-packed datasets by @wizeng23 in https://github.com/oumi-ai/oumi/pull/632
Add config example for long context fine-tuning by @oelachqar in https://github.com/oumi-ai/oumi/pull/631
Add distribution mode flag to llama_tune by @wizeng23 in https://github.com/oumi-ai/oumi/pull/635
Judge Notebook 2: Custom Judge by @kaisopos in https://github.com/oumi-ai/oumi/pull/623
Bugfixes for LLAVA by @xrdaukar in https://github.com/oumi-ai/oumi/pull/634
Update sphinx config and docs to fix misc errors and warnings by @oelachqar in https://github.com/oumi-ai/oumi/pull/639
Factor out OUMITOTALNUM_GPUS env var by @wizeng23 in https://github.com/oumi-ai/oumi/pull/636
Remove bitsandbytes from train dependencies by @oelachqar in https://github.com/oumi-ai/oumi/pull/643
Enable intershinx to allow linking to external documentation pages by @oelachqar in https://github.com/oumi-ai/oumi/pull/640
Tune few training params for LLAVA and blip2 models by @xrdaukar in https://github.com/oumi-ai/oumi/pull/642
Added support for specifying the inference engine via the InferenceConfig by @taenin in https://github.com/oumi-ai/oumi/pull/638
Add popular pre-training dataset classes by @oelachqar in https://github.com/oumi-ai/oumi/pull/641
Remove openai dependency by @oelachqar in https://github.com/oumi-ai/oumi/pull/644
Update our documentation to point to the new CLI. by @taenin in https://github.com/oumi-ai/oumi/pull/645
Enable dataloaders for VLLM-s (llava and blip2) by @xrdaukar in https://github.com/oumi-ai/oumi/pull/646
Allow gradient clipping to be optional by @optas in https://github.com/oumi-ai/oumi/pull/649
Add support for add_generation_prompt in LLAVA chat template by @xrdaukar in https://github.com/oumi-ai/oumi/pull/648
Add a description to the Launch CLI by @taenin in https://github.com/oumi-ai/oumi/pull/651
Add all Llama FSDP GCP configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/637
Coerce model params to correct dtype for QLoRA FSDP by @wizeng23 in https://github.com/oumi-ai/oumi/pull/652
Use uv for pip install commands by @wizeng23 in https://github.com/oumi-ai/oumi/pull/653
Update sphinx docs by @oelachqar in https://github.com/oumi-ai/oumi/pull/654
[Docs] Refactor docs pipeline by @oelachqar in https://github.com/oumi-ai/oumi/pull/655
[docs] swap and configure sphinx theme by @oelachqar in https://github.com/oumi-ai/oumi/pull/656
[Docs] Add documentation placeholders by @oelachqar in https://github.com/oumi-ai/oumi/pull/658
[Docs] Add sphinx-bibtex by @oelachqar in https://github.com/oumi-ai/oumi/pull/659
[Docs] fix rendering issues by @oelachqar in https://github.com/oumi-ai/oumi/pull/660
[docs] fix broken links by @oelachqar in https://github.com/oumi-ai/oumi/pull/661
Fix broken link in readme (dev_setup) by @kaisopos in https://github.com/oumi-ai/oumi/pull/662
[docs][tiny] fix minor doc typos by @oelachqar in https://github.com/oumi-ai/oumi/pull/666
[docs] add autodoc2 template by @oelachqar in https://github.com/oumi-ai/oumi/pull/665
[docs] Add content links and references by @oelachqar in https://github.com/oumi-ai/oumi/pull/668
[docs] switch to myst-nb for rendering notebooks by @oelachqar in https://github.com/oumi-ai/oumi/pull/669
[docs] Add script to generate module summaries by @oelachqar in https://github.com/oumi-ai/oumi/pull/670
[docs] Include cli reference by @oelachqar in https://github.com/oumi-ai/oumi/pull/671
Add dataset submodules by @oelachqar in https://github.com/oumi-ai/oumi/pull/667
Update notebooks to include a descriptive title by @oelachqar in https://github.com/oumi-ai/oumi/pull/664
Update tests/utils/testdeviceutils.py by @xrdaukar in https://github.com/oumi-ai/oumi/pull/672
[Inference] Bug in generation config stop tokens by @kaisopos in https://github.com/oumi-ai/oumi/pull/663
Support rewriting special label values to -100 (ignore_index) to exclude from loss by @xrdaukar in https://github.com/oumi-ai/oumi/pull/657
Rename emails and website url to Oumi by @wizeng23 in https://github.com/oumi-ai/oumi/pull/675
Update scripts/benchmarks/minimalmultimodaltraining.py to be able to test 3 other models by @xrdaukar in https://github.com/oumi-ai/oumi/pull/677
Update data collators TextCollatorWithPadding by @xrdaukar in https://github.com/oumi-ai/oumi/pull/674
Add image support to llama3-instruct.jinja chat template by @xrdaukar in https://github.com/oumi-ai/oumi/pull/676
Cleanup vscode launch.json by @oelachqar in https://github.com/oumi-ai/oumi/pull/679
Add Llama 3B DDP configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/681
Add cff file for citations by @oelachqar in https://github.com/oumi-ai/oumi/pull/682
Small edits to Llama configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/683
Increase llava.7b.sft max length from 512 to 1024 by @xrdaukar in https://github.com/oumi-ai/oumi/pull/687
Rename turn.py to conversation.py by @oelachqar in https://github.com/oumi-ai/oumi/pull/685
Pad token fix by @kaisopos in https://github.com/oumi-ai/oumi/pull/678
Add basic validation for data collators by @xrdaukar in https://github.com/oumi-ai/oumi/pull/688
Support --image CLI param in infer_interactive by @xrdaukar in https://github.com/oumi-ai/oumi/pull/686
Move sft and preference tuning datasets to their own module by @oelachqar in https://github.com/oumi-ai/oumi/pull/691
Set model.tokenizer_pad_token for gpt2 in Oumi Inference example in vsconfig by @xrdaukar in https://github.com/oumi-ai/oumi/pull/692
Update our dev set up instructions to not conflict with the new CLI by @taenin in https://github.com/oumi-ai/oumi/pull/698
Add back the polling spinner to the Launch CLI by @taenin in https://github.com/oumi-ai/oumi/pull/695
Add a dummy sft dataset for unit tests by @oelachqar in https://github.com/oumi-ai/oumi/pull/697
Include judge commands into main CLI by @oelachqar in https://github.com/oumi-ai/oumi/pull/690
Replace logger.exception() in TextCollatorWithPadding with logger.error() by @xrdaukar in https://github.com/oumi-ai/oumi/pull/700
[Clean-up] Remove dead code (custom eval) by @kaisopos in https://github.com/oumi-ai/oumi/pull/696
Define oumi.core.constants file for global constants by @xrdaukar in https://github.com/oumi-ai/oumi/pull/694
Add serialization helper methods for Conversation object by @oelachqar in https://github.com/oumi-ai/oumi/pull/701
Create pullrequesttemplate.md by @taenin in https://github.com/oumi-ai/oumi/pull/693
Use -100 as ignore_index in VisionLanguageSftDataset and in build_data_collator() by default by @xrdaukar in https://github.com/oumi-ai/oumi/pull/702
Add main entrypoint, update vscode launch.json to use new cli by @oelachqar in https://github.com/oumi-ai/oumi/pull/704
Update experimental async dataset feature by @oelachqar in https://github.com/oumi-ai/oumi/pull/705
Update new image CLI to support input image for VLLMs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/699
Update sft jsonl dataset to support json files and alpaca format by @oelachqar in https://github.com/oumi-ai/oumi/pull/703
Make interactive inference continually loop by @wizeng23 in https://github.com/oumi-ai/oumi/pull/689
Update vscode launch inference example by @xrdaukar in https://github.com/oumi-ai/oumi/pull/706
Training configurations llama3.1 + ChatML-Template by @optas in https://github.com/oumi-ai/oumi/pull/684
Add help strings for oumi train/evaluate/infer by @wizeng23 in https://github.com/oumi-ai/oumi/pull/708
Log peak gpu memory at the end of training by @oelachqar in https://github.com/oumi-ai/oumi/pull/710
Log training git commit and tag by @oelachqar in https://github.com/oumi-ai/oumi/pull/709
Add github issue templates. by @taenin in https://github.com/oumi-ai/oumi/pull/711
Enable more ruff rules by @oelachqar in https://github.com/oumi-ai/oumi/pull/712
Add Llama inference configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/713
Enable pathlib rule by @oelachqar in https://github.com/oumi-ai/oumi/pull/714
Update index.rst - switch expanded name Open Universal Machine Intelligence and replace with "Oumi" by @ciaralema in https://github.com/oumi-ai/oumi/pull/715
Make sure registry is case agnostic by @oelachqar in https://github.com/oumi-ai/oumi/pull/717
Move config files to new directory structure by @wizeng23 in https://github.com/oumi-ai/oumi/pull/720
Replace np.array(x) with to x.numpy() to resolve test warnings by @xrdaukar in https://github.com/oumi-ai/oumi/pull/724
Update index.rst by @ciaralema in https://github.com/oumi-ai/oumi/pull/719
Add judge cli tests and bugfixes by @oelachqar in https://github.com/oumi-ai/oumi/pull/722
Update tests/builders/test_lr_schedules.py by @xrdaukar in https://github.com/oumi-ai/oumi/pull/723
Add dataset_path parameter by @oelachqar in https://github.com/oumi-ai/oumi/pull/716
[tiny] Update judge configs by @oelachqar in https://github.com/oumi-ai/oumi/pull/725
Rename --generation.(input/output)_filepath flags to --(input/output)_path and rename detach flag by @wizeng23 in https://github.com/oumi-ai/oumi/pull/707
Rename LLAVA sftskyjob.yaml for consistency with blip2 by @xrdaukar in https://github.com/oumi-ai/oumi/pull/729
Remove deprecated preprocessing function by @oelachqar in https://github.com/oumi-ai/oumi/pull/726
Add DPO base class, initial dataset, integration tests by @oelachqar in https://github.com/oumi-ai/oumi/pull/728
[tiny] Add datasets import to enable registry discovery by @oelachqar in https://github.com/oumi-ai/oumi/pull/731
Add a helper function is_image_text_llm() to detect supported image+text LLMs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/730
Add dpo debug dataset and integration test by @oelachqar in https://github.com/oumi-ai/oumi/pull/732
Add inference engine for vertex GCP by @oelachqar in https://github.com/oumi-ai/oumi/pull/734
[docs] Cleanup placeholder documentation pages by @oelachqar in https://github.com/oumi-ai/oumi/pull/736
Native inference with images (model.generate), and Processor abstraction by @xrdaukar in https://github.com/oumi-ai/oumi/pull/721
[docs] update faq section by @oelachqar in https://github.com/oumi-ai/oumi/pull/739
[docs] update advanced topics section by @oelachqar in https://github.com/oumi-ai/oumi/pull/737
[docs] update datasets section by @oelachqar in https://github.com/oumi-ai/oumi/pull/738
[docs] Update inference section by @oelachqar in https://github.com/oumi-ai/oumi/pull/744
[docs] Add evaluate and judge user guides by @oelachqar in https://github.com/oumi-ai/oumi/pull/745
[docs] Update training section by @oelachqar in https://github.com/oumi-ai/oumi/pull/741
Support merve/vqav2-small dataset by @xrdaukar in https://github.com/oumi-ai/oumi/pull/742
Reorder IMAGE and TEXT messages in coco and Flick datasets by @xrdaukar in https://github.com/oumi-ai/oumi/pull/743
Rename base datasets modules by @oelachqar in https://github.com/oumi-ai/oumi/pull/735
[tests] Move unit tests to unit folder by @oelachqar in https://github.com/oumi-ai/oumi/pull/753
Update training params for LLAVA and BLIP2 models by @xrdaukar in https://github.com/oumi-ai/oumi/pull/754
[docs] Add script to summarize configs by @oelachqar in https://github.com/oumi-ai/oumi/pull/751
[docs] Enable termynal cli and add example by @oelachqar in https://github.com/oumi-ai/oumi/pull/750
[docs] Update apidoc rst files with latest modules by @oelachqar in https://github.com/oumi-ai/oumi/pull/749
[docs] Add logo, analytics, and update sphinx config by @oelachqar in https://github.com/oumi-ai/oumi/pull/747
[docs] Update about section by @oelachqar in https://github.com/oumi-ai/oumi/pull/748
[docs] Add page for using skypilot by @oelachqar in https://github.com/oumi-ai/oumi/pull/746
Update base dataset class to use dataset_path by @oelachqar in https://github.com/oumi-ai/oumi/pull/757
Unit tests with sanity checks for oumi chat templates by @xrdaukar in https://github.com/oumi-ai/oumi/pull/755
[docs] Misc docs cleanup tasks by @oelachqar in https://github.com/oumi-ai/oumi/pull/756
And WandB integration to eval by @wizeng23 in https://github.com/oumi-ai/oumi/pull/727
Add 3 new PNG images to testdata by @xrdaukar in https://github.com/oumi-ai/oumi/pull/758
Update LLAVA integration tests to use different test images by @xrdaukar in https://github.com/oumi-ai/oumi/pull/761
[tests] Add shared mock tokenizer mixture, requires_gpus marker by @oelachqar in https://github.com/oumi-ai/oumi/pull/759
Update vllm engine to allow for continuous batching by @oelachqar in https://github.com/oumi-ai/oumi/pull/762
Added text completions collator hard-coded for Llama 3.1 support by @jgreer013 in https://github.com/oumi-ai/oumi/pull/752
Update remote inference engines to show progress, reduce native engine verbosity by @oelachqar in https://github.com/oumi-ai/oumi/pull/760
Add sample oumi launcher configs for LLAVA and blip2 by @xrdaukar in https://github.com/oumi-ai/oumi/pull/763
Create 2 other Launcher configs for LLAVA and blip2 using OUMI training loop by @xrdaukar in https://github.com/oumi-ai/oumi/pull/765
Update CI to include all dependencies, fix newly introduced vllm bugs by @oelachqar in https://github.com/oumi-ai/oumi/pull/764
Suggest using the git root directory when launching jobs via a CLI in dev-build. by @taenin in https://github.com/oumi-ai/oumi/pull/766
[bug] enable logger propagation so that pytest can capture them by @oelachqar in https://github.com/oumi-ai/oumi/pull/769
Update liger kernel integration by @oelachqar in https://github.com/oumi-ai/oumi/pull/768
Add BaseInferenceEngine supported_params to keep track of inference engine parameters by @oelachqar in https://github.com/oumi-ai/oumi/pull/770
Add READMEs for new config structure by @wizeng23 in https://github.com/oumi-ai/oumi/pull/767
Fix *_no_cuda() tests in tests/unit/utils/test_device_utils.py by @xrdaukar in https://github.com/oumi-ai/oumi/pull/775
Remove support for python 3.8, update to 3.9 by @oelachqar in https://github.com/oumi-ai/oumi/pull/772
Multimodal improvements, and basic support for meta-llama/Llama-3.2-11B-Vision-Instruct by @xrdaukar in https://github.com/oumi-ai/oumi/pull/771
Set vLLM max LoRA rank based on LoRA adapter by @wizeng23 in https://github.com/oumi-ai/oumi/pull/776
[WIP] Llama 3.2 Vision 11B configs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/777
Tune MLLMs configs for Llama Vision and LLAVA by @xrdaukar in https://github.com/oumi-ai/oumi/pull/779
Ιnference vLLM: towards fitting big models in GPU memory by @optas in https://github.com/oumi-ai/oumi/pull/774
Adds control for an explicit quantization-compute dtype by @optas in https://github.com/oumi-ai/oumi/pull/680
Correct layer name in Llama vision accelerate config by @xrdaukar in https://github.com/oumi-ai/oumi/pull/782
Clean up configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/781
Save (pre-)processor configs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/780
Tiny: force batch_size of generation parameters >= 1 by @optas in https://github.com/oumi-ai/oumi/pull/783
Switch to the regular pip temporarily in the old Polaris launcher by @xrdaukar in https://github.com/oumi-ai/oumi/pull/787
Define PAD token for Llama 2B model (fineweb pretrain) by @xrdaukar in https://github.com/oumi-ai/oumi/pull/785
Rename oumi launch stop to oumi launch cancel by @wizeng23 in https://github.com/oumi-ai/oumi/pull/786
[tiny] move collator test to unit folder by @oelachqar in https://github.com/oumi-ai/oumi/pull/790
Add oumi launch stop subcommand by @wizeng23 in https://github.com/oumi-ai/oumi/pull/789
Add sft packed dataset and tests by @oelachqar in https://github.com/oumi-ai/oumi/pull/788
CLI documentation updates. by @taenin in https://github.com/oumi-ai/oumi/pull/791
Updated default temperature from 1.0 to 0.0 for generation params by @jgreer013 in https://github.com/oumi-ai/oumi/pull/792
Add a prominent link to our docs. by @taenin in https://github.com/oumi-ai/oumi/pull/793
Fixing 2 broken links in README.md by @kaisopos in https://github.com/oumi-ai/oumi/pull/794
Switch from writing individual conversations to all conversations by @jgreer013 in https://github.com/oumi-ai/oumi/pull/796
Update pullrequesttemplate.md by @taenin in https://github.com/oumi-ai/oumi/pull/798
Fix polling jobs and job status in the CLI by @taenin in https://github.com/oumi-ai/oumi/pull/795
Update base SFT dataset to add support for training on assistant turns only by @oelachqar in https://github.com/oumi-ai/oumi/pull/797
Small fix for building docs by @taenin in https://github.com/oumi-ai/oumi/pull/799
Update polaris user regex to handle usernames with dashes by @jgreer013 in https://github.com/oumi-ai/oumi/pull/801
Simplify TextCollatorWithPadding by @xrdaukar in https://github.com/oumi-ai/oumi/pull/800
Minor updates to docstrings and error messages related to text_with_padding collator by @xrdaukar in https://github.com/oumi-ai/oumi/pull/802
Adding more control/parameters for text generation by @optas in https://github.com/oumi-ai/oumi/pull/784
Pre-training HSDP configs for Llama 8B for A100-40GB by @xrdaukar in https://github.com/oumi-ai/oumi/pull/778
Fix pyproject.toml merge error by @oelachqar in https://github.com/oumi-ai/oumi/pull/805
Update llama_tune script to support 8B pretraining by @xrdaukar in https://github.com/oumi-ai/oumi/pull/804
Add e2e fine-tuning test by @oelachqar in https://github.com/oumi-ai/oumi/pull/806
Create Polaris and GCP launcher configs for Llama 8B pretraining by @xrdaukar in https://github.com/oumi-ai/oumi/pull/807
Enable sphinx-tabs, update OOM faq page to use it by @oelachqar in https://github.com/oumi-ai/oumi/pull/808
[docs] Start adding doctests by @oelachqar in https://github.com/oumi-ai/oumi/pull/809
Fixed broken skyssh and skycode commands by @jgreer013 in https://github.com/oumi-ai/oumi/pull/810
Enable Accelerate FSDP runs without using the Accelerate launcher by @wizeng23 in https://github.com/oumi-ai/oumi/pull/803
Disable uv in polaris launcher by @xrdaukar in https://github.com/oumi-ai/oumi/pull/812
Migrate configs to use Oumi instead of Accelerate configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/811
Update pretest.yaml by @taenin in https://github.com/oumi-ai/oumi/pull/817
Update installation.md by @taenin in https://github.com/oumi-ai/oumi/pull/814
Add smollm config by @oelachqar in https://github.com/oumi-ai/oumi/pull/815
Add extra logging in Polaris client by @xrdaukar in https://github.com/oumi-ai/oumi/pull/813
Fix a small typo in quickstart by @taenin in https://github.com/oumi-ai/oumi/pull/818
Correct names of few env vars in llama_tune.sh by @xrdaukar in https://github.com/oumi-ai/oumi/pull/819
[tiny] move collator to builders by @oelachqar in https://github.com/oumi-ai/oumi/pull/821
Update sample commands to run Llama 8B model on Polaris by @xrdaukar in https://github.com/oumi-ai/oumi/pull/823
[tiny] update device cleanup by @oelachqar in https://github.com/oumi-ai/oumi/pull/822
Update quickstart to use the new CLI arg format. by @taenin in https://github.com/oumi-ai/oumi/pull/820
Mock out slow tests. by @taenin in https://github.com/oumi-ai/oumi/pull/825
More changes for Llama 8B Polaris by @xrdaukar in https://github.com/oumi-ai/oumi/pull/826
Minor updates to Polaris scripts by @xrdaukar in https://github.com/oumi-ai/oumi/pull/827
Misc documentation updates for installation, quickstart, and tutorials. by @taenin in https://github.com/oumi-ai/oumi/pull/828
Add image utils to load image bytes for messages with IMAGE_URL, IMAGE_PATH types by @xrdaukar in https://github.com/oumi-ai/oumi/pull/829
Update our documentation to include modules for directories that aren't packages. by @taenin in https://github.com/oumi-ai/oumi/pull/830
Add Llama405B Qlora/Lora/FFT configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/833
Add SGLangInferenceEngine by @xrdaukar in https://github.com/oumi-ai/oumi/pull/834
Switch from 8-bit AdamW to AdamW for FSDP Lora/QLora by @wizeng23 in https://github.com/oumi-ai/oumi/pull/835
Add options for specifying log levels in the CLI. by @taenin in https://github.com/oumi-ai/oumi/pull/836
Request user feedback after training. by @taenin in https://github.com/oumi-ai/oumi/pull/837
Documentation fix for training. by @taenin in https://github.com/oumi-ai/oumi/pull/840
[tiny] pass dataset_path to constructor by @oelachqar in https://github.com/oumi-ai/oumi/pull/841
Move chatqa to projects, add details by @oelachqar in https://github.com/oumi-ai/oumi/pull/816
[tiny] fix typo in docs/quickstart by @oelachqar in https://github.com/oumi-ai/oumi/pull/843
Add unit tests for builders/data module by @oelachqar in https://github.com/oumi-ai/oumi/pull/842
Additional improvements to 405B by @wizeng23 in https://github.com/oumi-ai/oumi/pull/838
Add constrained decoding to vllm engine by @oelachqar in https://github.com/oumi-ai/oumi/pull/844
Add support for remote vllm inference engine by @oelachqar in https://github.com/oumi-ai/oumi/pull/845
Add @override annotations to misc inference engines methods where missing by @xrdaukar in https://github.com/oumi-ai/oumi/pull/846
Add a debug command for dumping environment information. by @taenin in https://github.com/oumi-ai/oumi/pull/839
[tiny] fix openai remote engine auth bug by @oelachqar in https://github.com/oumi-ai/oumi/pull/848
Update RemoteInferenceEngine to support guided decoding by @oelachqar in https://github.com/oumi-ai/oumi/pull/849
Support multimodal inference with remote engines by @xrdaukar in https://github.com/oumi-ai/oumi/pull/847
Fixes for quickstart and launch commands. by @taenin in https://github.com/oumi-ai/oumi/pull/851
Support Qwen2-VL model for training by @xrdaukar in https://github.com/oumi-ai/oumi/pull/850
Fix our config parsing test by @taenin in https://github.com/oumi-ai/oumi/pull/855
Move remote_params from GenerationParams to InferenceConfig by @xrdaukar in https://github.com/oumi-ai/oumi/pull/853
Tune PyTorch CUDA allocation params for Llama 405B qlora by @xrdaukar in https://github.com/oumi-ai/oumi/pull/854
Update Llama 405B qlora config to use adamw_torch_fused optimizer by @xrdaukar in https://github.com/oumi-ai/oumi/pull/856
Update vllm config path in Polaris multi-node inference notebook by @xrdaukar in https://github.com/oumi-ai/oumi/pull/857
Change config dir structure for LLAVA and blip2 models by @xrdaukar in https://github.com/oumi-ai/oumi/pull/859
Enable LM-harness eval for multimodal, and add eval config for Llama 3.2 Vision 11B by @xrdaukar in https://github.com/oumi-ai/oumi/pull/858
Update inference engine docs. by @taenin in https://github.com/oumi-ai/oumi/pull/862
Initial port of zigzag attention by @xrdaukar in https://github.com/oumi-ai/oumi/pull/861
Add integration tests for visual SFT datasets (VisionLanguageSftDataset) by @xrdaukar in https://github.com/oumi-ai/oumi/pull/863
Update base config to_yaml signature by @oelachqar in https://github.com/oumi-ai/oumi/pull/865
Add support for batch inference by @oelachqar in https://github.com/oumi-ai/oumi/pull/864
Parallelize data preprocessing in BaseMapDataset by @xrdaukar in https://github.com/oumi-ai/oumi/pull/868
Tweak writer batch size computation in BaseMapDataset by @xrdaukar in https://github.com/oumi-ai/oumi/pull/870
Refresh fine-tuning notebook by @oelachqar in https://github.com/oumi-ai/oumi/pull/872
Support freezing nested model layers by @xrdaukar in https://github.com/oumi-ai/oumi/pull/873
Update dev_setup.md with minor clarification for use of SSH in GitHub by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/866
Rewrite CONTRIBUTING.md with new content, fixes and nits by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/867
Phi3 Vision (microsoft/Phi-3-vision-128k-instruct): Basic training support by @xrdaukar in https://github.com/oumi-ai/oumi/pull/871
Update main readme (part 1) by @oelachqar in https://github.com/oumi-ai/oumi/pull/874
Update readme sections (part 2) by @oelachqar in https://github.com/oumi-ai/oumi/pull/875
Refresh documentation pages by @oelachqar in https://github.com/oumi-ai/oumi/pull/877
Qwen2-VL-2B-Instruct: basic training config by @xrdaukar in https://github.com/oumi-ai/oumi/pull/876
Update recipes list in readme by @oelachqar in https://github.com/oumi-ai/oumi/pull/878
Update the list of supported VLM-s in README by @xrdaukar in https://github.com/oumi-ai/oumi/pull/879
Update documentation landing page by @oelachqar in https://github.com/oumi-ai/oumi/pull/881
Cap TRL library version to <0.12.2 by @jgreer013 in https://github.com/oumi-ai/oumi/pull/882
[tiny] fix remote inference bug for messages with image urls by @oelachqar in https://github.com/oumi-ai/oumi/pull/883
Adding support for Alpaca Eval by @kaisopos in https://github.com/oumi-ai/oumi/pull/880
Improve error messaging for the remote inference engine. by @taenin in https://github.com/oumi-ai/oumi/pull/884
Move multimodal configs from configs/examples/vision to configs/recipes/vision by @xrdaukar in https://github.com/oumi-ai/oumi/pull/886
Define internal VLM model configs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/885
Alpaca Eval 2.0 notebook by @kaisopos in https://github.com/oumi-ai/oumi/pull/887
Freeze torchdata>=0.8.0,<0.10.0 by @xrdaukar in https://github.com/oumi-ai/oumi/pull/890
Create torchrun, accelerate CLI wrapper by @xrdaukar in https://github.com/oumi-ai/oumi/pull/888
Update VLM configs to use oumi distributed torchrun by @xrdaukar in https://github.com/oumi-ai/oumi/pull/891
Allow None batch size in GenerationParams by @wizeng23 in https://github.com/oumi-ai/oumi/pull/895
Evaluation: Saving metadata for future repro by @kaisopos in https://github.com/oumi-ai/oumi/pull/892
Fix missing import precommits when you create a fresh install of Oumi by @taenin in https://github.com/oumi-ai/oumi/pull/898
Update inference documentation by @wizeng23 in https://github.com/oumi-ai/oumi/pull/893
Allow simpler method of specifying LoRA Adapter for eval/inference by @wizeng23 in https://github.com/oumi-ai/oumi/pull/889
Introduce complex Message.content with sub-items - Phase 1 by @xrdaukar in https://github.com/oumi-ai/oumi/pull/897
Enable Message.content with sub-items - phase 2 by @xrdaukar in https://github.com/oumi-ai/oumi/pull/901
Update configs/notebooks to use typer CLI instead of oumi.train/eval/infer modules by @wizeng23 in https://github.com/oumi-ai/oumi/pull/902
Evaluation Notebooks: MT Bench by @kaisopos in https://github.com/oumi-ai/oumi/pull/899
Update chat template/conversation format - Phase 3 by @xrdaukar in https://github.com/oumi-ai/oumi/pull/903
Update make skycode to use Oumi launcher by @wizeng23 in https://github.com/oumi-ai/oumi/pull/894
Delete main() functions in train/evaluate/infer.py by @wizeng23 in https://github.com/oumi-ai/oumi/pull/900
Update Inference links in oumi README by @xrdaukar in https://github.com/oumi-ai/oumi/pull/904
Define qwen2-vl-instruct chat template by @xrdaukar in https://github.com/oumi-ai/oumi/pull/907
Set max_pixels/min_pixels params for Qwen2 VL model by @xrdaukar in https://github.com/oumi-ai/oumi/pull/908
Update chat template/conversation format - Phase 4 by @xrdaukar in https://github.com/oumi-ai/oumi/pull/906
Update configs to use oumi distributed torchrun by @wizeng23 in https://github.com/oumi-ai/oumi/pull/905
[tiny] minor docs updates by @oelachqar in https://github.com/oumi-ai/oumi/pull/913
[docs] clean-up docs pages by @oelachqar in https://github.com/oumi-ai/oumi/pull/914
Initial support for HuggingFaceTB/SmolVLM-Instruct by @xrdaukar in https://github.com/oumi-ai/oumi/pull/910
Catch exceptions for missing LoRA adapters by @taenin in https://github.com/oumi-ai/oumi/pull/918
Exclude custom models from Vision-Language model checks. by @taenin in https://github.com/oumi-ai/oumi/pull/919
Refactor is_custom_model() check into a function + Minor tests updates by @xrdaukar in https://github.com/oumi-ai/oumi/pull/921
Disable FSDP's use_orig_params for PEFT training by @wizeng23 in https://github.com/oumi-ai/oumi/pull/869
Mark Message as frozen by @xrdaukar in https://github.com/oumi-ai/oumi/pull/923
Rename MessageContentItem to ContentItem for brevity. by @xrdaukar in https://github.com/oumi-ai/oumi/pull/925
Delete configs/examples/llama3_1 by @wizeng23 in https://github.com/oumi-ai/oumi/pull/922
Update OOM FAQ with instructions to tune CUDA alloc by @xrdaukar in https://github.com/oumi-ai/oumi/pull/926
Update oumi launch documentation by @wizeng23 in https://github.com/oumi-ai/oumi/pull/930
90% speed up for the Oumi CLI by @taenin in https://github.com/oumi-ai/oumi/pull/929
Update docs/datasets/vl_sft.md and docs/datasets/local_datasets.md reflecting the latest changes in Conversation format and VL-dataset setup by @xrdaukar in https://github.com/oumi-ai/oumi/pull/928
Remove remaining references to MessageContentItem by @xrdaukar in https://github.com/oumi-ai/oumi/pull/931
Extend dev setup guide by @wizeng23 in https://github.com/oumi-ai/oumi/pull/932
Clean up configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/933
Add ascii art to all oumi commands in the CLI. by @taenin in https://github.com/oumi-ai/oumi/pull/934
[docs] Reorganize doc pages by @oelachqar in https://github.com/oumi-ai/oumi/pull/935
Fix broken pip targets by @wizeng23 in https://github.com/oumi-ai/oumi/pull/927
[docs] Enable doctest directive and add make doctests by @oelachqar in https://github.com/oumi-ai/oumi/pull/937
Evaluation refactoring by @kaisopos in https://github.com/oumi-ai/oumi/pull/912
Migrate Sky configs to Oumi job configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/938
[docs] fix misc doc build errors / warnings by @oelachqar in https://github.com/oumi-ai/oumi/pull/936
Update fineweb config by @wizeng23 in https://github.com/oumi-ai/oumi/pull/940
Evaluation refactoring (2nd) by @kaisopos in https://github.com/oumi-ai/oumi/pull/939
Update CLI parser to support = separated arguments by @xrdaukar in https://github.com/oumi-ai/oumi/pull/942
Integrating with AlpacaEval (PR 1 of 2) by @kaisopos in https://github.com/oumi-ai/oumi/pull/941
[Bug Fix] AlpacaEval by @kaisopos in https://github.com/oumi-ai/oumi/pull/943
Configure logging for oumi.telemetry in train.py by @xrdaukar in https://github.com/oumi-ai/oumi/pull/944
Only print ASCII log on main process by @xrdaukar in https://github.com/oumi-ai/oumi/pull/946
[bug] Fix issue with registered dataset loading by @oelachqar in https://github.com/oumi-ai/oumi/pull/947
Alpaca Eval: integration test by @kaisopos in https://github.com/oumi-ai/oumi/pull/945
Print telemetry stats in more compact format by @xrdaukar in https://github.com/oumi-ai/oumi/pull/950
Refactor conversation_utils by @xrdaukar in https://github.com/oumi-ai/oumi/pull/951
Improve rank detection for logging by @xrdaukar in https://github.com/oumi-ai/oumi/pull/953
[docs] Add core concepts page by @oelachqar in https://github.com/oumi-ai/oumi/pull/952
[docs][tiny] Update acknowledgement page by @oelachqar in https://github.com/oumi-ai/oumi/pull/954
[docs][tiny] remove unused pages by @oelachqar in https://github.com/oumi-ai/oumi/pull/955
[docs][bug] Fix rendering issue in installation page by @oelachqar in https://github.com/oumi-ai/oumi/pull/956
[docs] Refresh evaluate user guide by @oelachqar in https://github.com/oumi-ai/oumi/pull/961
[docs] Refresh datasets resource section by @oelachqar in https://github.com/oumi-ai/oumi/pull/960
[docs] Refresh models resource section by @oelachqar in https://github.com/oumi-ai/oumi/pull/959
[docs] Refresh infer user guide by @oelachqar in https://github.com/oumi-ai/oumi/pull/957
[docs] Refresh training user guide by @oelachqar in https://github.com/oumi-ai/oumi/pull/958
[docs][tiny] Update links by @oelachqar in https://github.com/oumi-ai/oumi/pull/962
[docs] Update doc generation config by @oelachqar in https://github.com/oumi-ai/oumi/pull/963
[docs] Fix misc docstring rendering issues by @oelachqar in https://github.com/oumi-ai/oumi/pull/964
[docs] Refresh judge user guide by @oelachqar in https://github.com/oumi-ai/oumi/pull/965
[docs] Add documentation page for local training by @oelachqar in https://github.com/oumi-ai/oumi/pull/966
[docs] Update documentation index page by @oelachqar in https://github.com/oumi-ai/oumi/pull/967
[Evaluation] Refactor repro info & add repro info in AlpacaEval by @kaisopos in https://github.com/oumi-ai/oumi/pull/949
[docs] Add documentation page for notebook environment by @oelachqar in https://github.com/oumi-ai/oumi/pull/969
Add more options for saving LoRA models by @wizeng23 in https://github.com/oumi-ai/oumi/pull/968
[docs] Add vscode environment doc page by @oelachqar in https://github.com/oumi-ai/oumi/pull/970
[docs] Cleanup evaluation guide page by @oelachqar in https://github.com/oumi-ai/oumi/pull/971
[docs] Update recipes page by @oelachqar in https://github.com/oumi-ai/oumi/pull/972
[docs] Update tutorials page by @oelachqar in https://github.com/oumi-ai/oumi/pull/973
[docs] Add doc page for generative evals by @oelachqar in https://github.com/oumi-ai/oumi/pull/974
[docs] misc clarifications in train, judge, dataset pages by @oelachqar in https://github.com/oumi-ai/oumi/pull/976
[docs][tiny] Remove train/common_workflows page by @oelachqar in https://github.com/oumi-ai/oumi/pull/977
[docs] Add infer common workflows page by @oelachqar in https://github.com/oumi-ai/oumi/pull/978
Update notebooks setup documentation by @wizeng23 in https://github.com/oumi-ai/oumi/pull/975
[docs] Add supported models page by @oelachqar in https://github.com/oumi-ai/oumi/pull/979
[docs] update inference engine user guide by @oelachqar in https://github.com/oumi-ai/oumi/pull/980
Refactor conversation JSON builders by @xrdaukar in https://github.com/oumi-ai/oumi/pull/981
[docs] Update CLI reference page by @oelachqar in https://github.com/oumi-ai/oumi/pull/983
[docs] Minor updates to datasets intro page by @oelachqar in https://github.com/oumi-ai/oumi/pull/984
Remove internal GCS mentions by @wizeng23 in https://github.com/oumi-ai/oumi/pull/982
Update vision/smollm config names by @wizeng23 in https://github.com/oumi-ai/oumi/pull/987
Update VLLMInferenceEngine to support multimodal conversations by @xrdaukar in https://github.com/oumi-ai/oumi/pull/986
Image utils refactorings by @xrdaukar in https://github.com/oumi-ai/oumi/pull/985
[docs] Add training monitoring documentation page by @oelachqar in https://github.com/oumi-ai/oumi/pull/988
Add torchvision dependency by @xrdaukar in https://github.com/oumi-ai/oumi/pull/989
Define configs/recipes/vision/phi3/sft/trl_gcp_job.yaml for Phi3 Vision by @xrdaukar in https://github.com/oumi-ai/oumi/pull/990
data.train.text_col cleanup for VLM-s by @xrdaukar in https://github.com/oumi-ai/oumi/pull/991
[Evaluation] Updating Documentation (1st pass) by @kaisopos in https://github.com/oumi-ai/oumi/pull/993
Fix saving merged lora model by @wizeng23 in https://github.com/oumi-ai/oumi/pull/992
Add an option to return custom datasets as IterableDataset by @xrdaukar in https://github.com/oumi-ai/oumi/pull/994
Fix regression in CLI speed by @taenin in https://github.com/oumi-ai/oumi/pull/997
Create alias for json/jsonl datasets by @oelachqar in https://github.com/oumi-ai/oumi/pull/998
[Evaluation] Updating Documentation (Generative Benchmarks) by @kaisopos in https://github.com/oumi-ai/oumi/pull/999
[docs][tiny] Fix broken links, update sft dataset examples by @oelachqar in https://github.com/oumi-ai/oumi/pull/1002

New Contributors

@ciaralema made their first contribution in https://github.com/oumi-ai/oumi/pull/715

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1-alpha...v0.1.1-alpha

- Python
Published by oelachqar over 1 year ago

oumi - Initial release

What's Changed

Add python project configs by @oelachqar in https://github.com/oumi-ai/oumi/pull/1
Add repo skeleton by @oelachqar in https://github.com/oumi-ai/oumi/pull/2
Export lema entrypoint scripts by @oelachqar in https://github.com/oumi-ai/oumi/pull/3
Update static type checking config by @oelachqar in https://github.com/oumi-ai/oumi/pull/5
Add example jupyter / colab notebook by @oelachqar in https://github.com/oumi-ai/oumi/pull/4
Refactor config parsing to use omegaconf by @oelachqar in https://github.com/oumi-ai/oumi/pull/6
Updating documentation (Dev Environment Setup) by @kaisopos in https://github.com/oumi-ai/oumi/pull/7
Add tests and vscode config by @oelachqar in https://github.com/oumi-ai/oumi/pull/8
Added DPOTrainer example to repo, as well as cuda device cleanup to training loop by @jgreer013 in https://github.com/oumi-ai/oumi/pull/9
Adding torch as top-level module dependency by @optas in https://github.com/oumi-ai/oumi/pull/10
Add configs for specific hardware requirements by @jgreer013 in https://github.com/oumi-ai/oumi/pull/11
Sort pre-commit hooks lexicographically by @xrdaukar in https://github.com/oumi-ai/oumi/pull/12
Add logging config by @oelachqar in https://github.com/oumi-ai/oumi/pull/13
Lema inference by @xrdaukar in https://github.com/oumi-ai/oumi/pull/14
Panos dev by @optas in https://github.com/oumi-ai/oumi/pull/16
Add job launcher by @oelachqar in https://github.com/oumi-ai/oumi/pull/15
Making split of data a flexible variable by @optas in https://github.com/oumi-ai/oumi/pull/17
Configure max file size in precommit hooks by @xrdaukar in https://github.com/oumi-ai/oumi/pull/18
Minor bugfix and documentation update by @oelachqar in https://github.com/oumi-ai/oumi/pull/19
adding pynvml to train env by @kaisopos in https://github.com/oumi-ai/oumi/pull/20
Panos dev by @optas in https://github.com/oumi-ai/oumi/pull/22
Augmenting Types for training hyperparams by @optas in https://github.com/oumi-ai/oumi/pull/23
Train refactoring (config file visibility) + a few minor changes by @kaisopos in https://github.com/oumi-ai/oumi/pull/21
Minimal test for train function by @xrdaukar in https://github.com/oumi-ai/oumi/pull/25
Fix leftover 'torchdtype' in 'ModelParams' by @xrdaukar in https://github.com/oumi-ai/oumi/pull/26
Update GPU types list in the default SkyPilot config by @xrdaukar in https://github.com/oumi-ai/oumi/pull/27
Add a missing lema-infer command under [project.scripts] by @xrdaukar in https://github.com/oumi-ai/oumi/pull/28
add basic pytests for evaluate and infer by @xrdaukar in https://github.com/oumi-ai/oumi/pull/29
Update README and pyproject.toml by @wizeng23 in https://github.com/oumi-ai/oumi/pull/30
A helper function to print info about available CUDA devices by @xrdaukar in https://github.com/oumi-ai/oumi/pull/31
Update SkyPilot cconfig to start using torchrun by @xrdaukar in https://github.com/oumi-ai/oumi/pull/32
Support basic single-node, multi-gpu training by @xrdaukar in https://github.com/oumi-ai/oumi/pull/33
Run all precommit hooks on the repo by @xrdaukar in https://github.com/oumi-ai/oumi/pull/35
Add experimental code for llama cpp inference by @jgreer013 in https://github.com/oumi-ai/oumi/pull/37
Create skeleton of STYLE_GUIDE.md by @xrdaukar in https://github.com/oumi-ai/oumi/pull/36
Adding support for training custom models (for now just a dummy model). by @kaisopos in https://github.com/oumi-ai/oumi/pull/38
Fix custom model name in test_train.py by @xrdaukar in https://github.com/oumi-ai/oumi/pull/39
Configure pyright (static type checker) and resolve existing type errors to make it pass by @xrdaukar in https://github.com/oumi-ai/oumi/pull/41
fix trailing whitespace warning in STYLE_GUIDE.md by @xrdaukar in https://github.com/oumi-ai/oumi/pull/43
Configure initial GitHub Actions workflow to run pre-commits and tests by @xrdaukar in https://github.com/oumi-ai/oumi/pull/44
A variety of proposed extensions to finetune a chat-based model (starting with Zephyr) by @optas in https://github.com/oumi-ai/oumi/pull/34
Fix syntax error in ultrachat by @xrdaukar in https://github.com/oumi-ai/oumi/pull/48
Create initial version of CONTRIBUTING.md by @xrdaukar in https://github.com/oumi-ai/oumi/pull/46
Reduce the number of training steps from 5 to 3 to make test_train.py faster by @xrdaukar in https://github.com/oumi-ai/oumi/pull/49
Adding registry for custom models. by @kaisopos in https://github.com/oumi-ai/oumi/pull/42
Add config and streaming args to DataParams by @wizeng23 in https://github.com/oumi-ai/oumi/pull/47
Update Pre-review Tests to only run on pull_request by @xrdaukar in https://github.com/oumi-ai/oumi/pull/50
Add training flags to computes tokens-based stats by @xrdaukar in https://github.com/oumi-ai/oumi/pull/51
reduce test training steps in another test which I missed before by @xrdaukar in https://github.com/oumi-ai/oumi/pull/53
Rename var names of *Params classes by @wizeng23 in https://github.com/oumi-ai/oumi/pull/52
Make some NVIDIA-specific dependencies optional by @xrdaukar in https://github.com/oumi-ai/oumi/pull/54
fix trl version as 0.8.6 by @xrdaukar in https://github.com/oumi-ai/oumi/pull/56
Remove reference to torch.cuda.clock_rate by @xrdaukar in https://github.com/oumi-ai/oumi/pull/57
Update inference to support non-interactive batch mode. by @kaisopos in https://github.com/oumi-ai/oumi/pull/58
Update README.md to include Linux/WSL specific instructions by @xrdaukar in https://github.com/oumi-ai/oumi/pull/59
Minor formatting improvements in README.md by @xrdaukar in https://github.com/oumi-ai/oumi/pull/60
Minor: Updating Lora Params by @optas in https://github.com/oumi-ai/oumi/pull/55
Support dataset packing by @wizeng23 in https://github.com/oumi-ai/oumi/pull/63
Disallow relative imports in LeMa by @xrdaukar in https://github.com/oumi-ai/oumi/pull/65
Add text_col param that's required for SFTTrainer by @wizeng23 in https://github.com/oumi-ai/oumi/pull/66
Refactor common config parsing logic (YAML, arg_list) into a common util by @xrdaukar in https://github.com/oumi-ai/oumi/pull/68
Standardize test naming convention by @wizeng23 in https://github.com/oumi-ai/oumi/pull/69
Adding support for a hardcoded evaluation with MMLU. by @kaisopos in https://github.com/oumi-ai/oumi/pull/67
Minor changes to the default configs/skypilot/sky.yaml config by @xrdaukar in https://github.com/oumi-ai/oumi/pull/71
Prototype to pass config.model.model_max_length to Trainers by @xrdaukar in https://github.com/oumi-ai/oumi/pull/70
[Inference] Remove the prepended prompts from model responses. by @kaisopos in https://github.com/oumi-ai/oumi/pull/73
Add a util to print versioning info by @xrdaukar in https://github.com/oumi-ai/oumi/pull/74
Switch to tempfile.TemporaryDirectory() in test_train.py by @xrdaukar in https://github.com/oumi-ai/oumi/pull/75
Update docstring verbs to descriptive form by @wizeng23 in https://github.com/oumi-ai/oumi/pull/76
Add sample accelerate and fsdp configs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/77
Refactor code to get device rank and world size into a helper function by @xrdaukar in https://github.com/oumi-ai/oumi/pull/79
Add a simple util to print model summary e.g., layer names, architecture summary by @xrdaukar in https://github.com/oumi-ai/oumi/pull/80
Freeze numpy to pre 2.0 version by @xrdaukar in https://github.com/oumi-ai/oumi/pull/81
Adding inference support for next logit probability. by @kaisopos in https://github.com/oumi-ai/oumi/pull/78
Create FSDP configs for Phi3 by @xrdaukar in https://github.com/oumi-ai/oumi/pull/82
Auto-format pyproject.toml with "Even Better TOML" by @xrdaukar in https://github.com/oumi-ai/oumi/pull/83
Minor cleanup updates to SkyPilot configs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/84
Mixed Precision Training, Flash-Attention-2, Print-trainable-params by @optas in https://github.com/oumi-ai/oumi/pull/85
Update README.md to include basic instructions for multi-GPU training (DDP, FSDP) by @xrdaukar in https://github.com/oumi-ai/oumi/pull/86
Start using $SKYPILOTNUMGPUSPERNODE in SkyPilot config by @xrdaukar in https://github.com/oumi-ai/oumi/pull/90
Add configs for FineWeb Llama2 pretraining by @wizeng23 in https://github.com/oumi-ai/oumi/pull/89
Quantization by @optas in https://github.com/oumi-ai/oumi/pull/87
Update the default SkyPilot config to print more debug/context info by @xrdaukar in https://github.com/oumi-ai/oumi/pull/92
Add license by @oelachqar in https://github.com/oumi-ai/oumi/pull/93
Initial version of SkyPilot config for multi-node training (num_nodes: N) by @xrdaukar in https://github.com/oumi-ai/oumi/pull/94
MMLU eval refactor. by @kaisopos in https://github.com/oumi-ai/oumi/pull/88
Remove comparison between LOCAL_RANK and RANK by @xrdaukar in https://github.com/oumi-ai/oumi/pull/96
Handling the loading of peft adapters and other minor issues (e.g., adding more logging parameters) by @optas in https://github.com/oumi-ai/oumi/pull/91
Update configs/skypilot/skyllama2b.yaml to start using skyinit.sh by @xrdaukar in https://github.com/oumi-ai/oumi/pull/97
Add bool param to resume training from the last known checkpoint (if exists) by @xrdaukar in https://github.com/oumi-ai/oumi/pull/99
Inference: save/restore probabilities to/from file. by @kaisopos in https://github.com/oumi-ai/oumi/pull/98
Add support for dataset mixtures during training by @taenin in https://github.com/oumi-ai/oumi/pull/95
Add train, test, and validation splits to the LeMa config. by @taenin in https://github.com/oumi-ai/oumi/pull/101
nanoGPT (GPT2) pretraining recipe by @wizeng23 in https://github.com/oumi-ai/oumi/pull/103
Minor: Updates on Zephyr-Config by @optas in https://github.com/oumi-ai/oumi/pull/106
Update pre-commit config by @oelachqar in https://github.com/oumi-ai/oumi/pull/108
Add integration tests that verify all configs load properly. by @taenin in https://github.com/oumi-ai/oumi/pull/102
Handling Gradient Checkpointing by @optas in https://github.com/oumi-ai/oumi/pull/107
Update skypilot/sky_gpt2.yaml to include an example how to mount GCS dir by @xrdaukar in https://github.com/oumi-ai/oumi/pull/111
Rename datasetparams.datasetconfig to dataset_params.subset by @oelachqar in https://github.com/oumi-ai/oumi/pull/109
Refactor SFT dataset preprocessing by @oelachqar in https://github.com/oumi-ai/oumi/pull/112
Support shuffling and random seeds for dataset sampling by @taenin in https://github.com/oumi-ai/oumi/pull/113
Split types file into module by @oelachqar in https://github.com/oumi-ai/oumi/pull/114
Add GCP deps to lema[cloud] by @xrdaukar in https://github.com/oumi-ai/oumi/pull/117
Add llama3-instruct jinja template by @jgreer013 in https://github.com/oumi-ai/oumi/pull/118
Update sky_init.sh to print current dir by @xrdaukar in https://github.com/oumi-ai/oumi/pull/120
Add prompt response sft preprocessor factory for aya dataset by @jgreer013 in https://github.com/oumi-ai/oumi/pull/121
Add configs for chatqa model by @oelachqar in https://github.com/oumi-ai/oumi/pull/110
Saving inference probs in parquet format. by @kaisopos in https://github.com/oumi-ai/oumi/pull/115
Refactor model registry by @oelachqar in https://github.com/oumi-ai/oumi/pull/122
Define BaseTrainer abstraction by @xrdaukar in https://github.com/oumi-ai/oumi/pull/116
Add a registry for metric functions that we can run during training. by @taenin in https://github.com/oumi-ai/oumi/pull/126
Update trainingparams.py so HF trainer uses numtrain_epochs by @optas in https://github.com/oumi-ai/oumi/pull/125
Add native PyTorch model training by @oelachqar in https://github.com/oumi-ai/oumi/pull/123
[Quick fix] Handle pynvml being misconfigured by @taenin in https://github.com/oumi-ai/oumi/pull/128
Enable DP for inference by @kaisopos in https://github.com/oumi-ai/oumi/pull/100
Add configs for training llama3-8b with aya finetune by @jgreer013 in https://github.com/oumi-ai/oumi/pull/130
Update HF save_model() to only save on master replica by @xrdaukar in https://github.com/oumi-ai/oumi/pull/131
Pipe MetricsFunction from our config to train.py by @taenin in https://github.com/oumi-ai/oumi/pull/129
Fixing broken eval. by @kaisopos in https://github.com/oumi-ai/oumi/pull/132
Minor updates in SkyPilot docstrings by @xrdaukar in https://github.com/oumi-ai/oumi/pull/133
Fix bug with DP evaluation by @oelachqar in https://github.com/oumi-ai/oumi/pull/134
[MMLU custom eval] removing hardcoded subject, samples, num-shots. by @kaisopos in https://github.com/oumi-ai/oumi/pull/135
Add an initial config for async evaluations by @taenin in https://github.com/oumi-ai/oumi/pull/137
Add a new top level command: evaluate_async by @taenin in https://github.com/oumi-ai/oumi/pull/138
Minor bug fix in writing evaluations by @taenin in https://github.com/oumi-ai/oumi/pull/140
Support full GPT2 run by @wizeng23 in https://github.com/oumi-ai/oumi/pull/141
Upload sample configs for running async evals on GPT2 by @taenin in https://github.com/oumi-ai/oumi/pull/139
Apply torch.distributed.barrier() in save_model by @xrdaukar in https://github.com/oumi-ai/oumi/pull/136
Create an experimental util to generate pre-tokenized datasets (Parquet files) with token_ids column by @xrdaukar in https://github.com/oumi-ai/oumi/pull/144
Created a new dataset class with async loading & tokenization by @jgreer013 in https://github.com/oumi-ai/oumi/pull/142
Remove private debug dir from configs/skypilot/sky_gpt2.yaml by @xrdaukar in https://github.com/oumi-ai/oumi/pull/145
Define dataloadernumworkers and dataloaderprefetchfactor params by @xrdaukar in https://github.com/oumi-ai/oumi/pull/146
[Evaluations] Integration with LM Evaluation Harness by @kaisopos in https://github.com/oumi-ai/oumi/pull/143
Support model compilation by @wizeng23 in https://github.com/oumi-ai/oumi/pull/147
Multiple cleanup changes in configs/skypilot/sky_gpt2.yaml by @xrdaukar in https://github.com/oumi-ai/oumi/pull/148
Update SkyPilot training configs to include run_name by @xrdaukar in https://github.com/oumi-ai/oumi/pull/149
Update async eval to properly parse eval configs by @taenin in https://github.com/oumi-ai/oumi/pull/150
Zephyr Configs [full-model, skypilot] by @optas in https://github.com/oumi-ai/oumi/pull/152
Disable model.compile in gpt2 config by @xrdaukar in https://github.com/oumi-ai/oumi/pull/154
Update sky_init.sh to print task id and cluster info by @xrdaukar in https://github.com/oumi-ai/oumi/pull/156
[bug] Include jinja templates in build by @oelachqar in https://github.com/oumi-ai/oumi/pull/158
Add basic scaffolding for torch profiler around training loop by @xrdaukar in https://github.com/oumi-ai/oumi/pull/157
[Minor] Adding attn_implementation arg in LM Harness. by @kaisopos in https://github.com/oumi-ai/oumi/pull/160
Update Trainer.savemodel to start using the public HF savemodel() method (except for PEFT) by @xrdaukar in https://github.com/oumi-ai/oumi/pull/161
Update the vanilla eval config for gpt2 to run hellaswag evals. by @taenin in https://github.com/oumi-ai/oumi/pull/165
Add Dataset base class & API by @oelachqar in https://github.com/oumi-ai/oumi/pull/151
Add experimental notebook to run Nvidia's ChatRAG-Bench evaluation by @oelachqar in https://github.com/oumi-ai/oumi/pull/166
Update ChatQA training configs by @oelachqar in https://github.com/oumi-ai/oumi/pull/159
Update async dataset class to support pre-tokenized datasets by @oelachqar in https://github.com/oumi-ai/oumi/pull/162
Create a launcher script for Polaris jobs (ALCF) by @taenin in https://github.com/oumi-ai/oumi/pull/164
Update pre-tokenized column name to be input_ids in tokenize_dataset tool by @xrdaukar in https://github.com/oumi-ai/oumi/pull/167
Replacing EvaluationConfig's DataParams with DatasetSplitParams by @kaisopos in https://github.com/oumi-ai/oumi/pull/168
Submit config to create Custom IAM role for SkyPilot Service Accounts on GCP by @xrdaukar in https://github.com/oumi-ai/oumi/pull/169
Remove GCP project reference by @xrdaukar in https://github.com/oumi-ai/oumi/pull/172
Make sure output training dir exists by @xrdaukar in https://github.com/oumi-ai/oumi/pull/171
Improve launcher usability via command line arguments. by @taenin in https://github.com/oumi-ai/oumi/pull/170
Add a source directory to the Polaris launcher and clean up rsync copies. by @taenin in https://github.com/oumi-ai/oumi/pull/173
Introduce LEMARUNNAME env var to SkyPilot training configs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/174
Minor changes: 1. Remove hardcoded HF_TOKEN 2. Log effective training config by @xrdaukar in https://github.com/oumi-ai/oumi/pull/175
Tweak default params in gpt2 scripts by @xrdaukar in https://github.com/oumi-ai/oumi/pull/177
LM Harness optimizations by @kaisopos in https://github.com/oumi-ai/oumi/pull/176
No longer ignore .git. in Polaris Needed for venv. by @taenin in https://github.com/oumi-ai/oumi/pull/179
A hack for running jobs on Polaris. by @taenin in https://github.com/oumi-ai/oumi/pull/180
[Polaris] Move venv creation from worker to launcher. by @taenin in https://github.com/oumi-ai/oumi/pull/181
Update README.md to include sky launch - 10 ... example by @xrdaukar in https://github.com/oumi-ai/oumi/pull/182
[Evaluations] Adding support for HuggingFace's leaderboard v1 benchmarks by @kaisopos in https://github.com/oumi-ai/oumi/pull/183
Llama 3 Aya Fine-Tuning Updates by @jgreer013 in https://github.com/oumi-ai/oumi/pull/163
Remove logger propagation by @wizeng23 in https://github.com/oumi-ai/oumi/pull/185
[Evaluations] HF leaderboard v1 configs by @kaisopos in https://github.com/oumi-ai/oumi/pull/186
Move logging.py to utils by @wizeng23 in https://github.com/oumi-ai/oumi/pull/187
Create the Jobs config for the lema launcher. by @taenin in https://github.com/oumi-ai/oumi/pull/188
Initial abstract base classes for the lema launcher. by @taenin in https://github.com/oumi-ai/oumi/pull/189
Added mfu calculation and tests by @jgreer013 in https://github.com/oumi-ai/oumi/pull/190
Introduce two new training params: savemodel and saveepoch by @xrdaukar in https://github.com/oumi-ai/oumi/pull/191
Update FineWeb ablation model configs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/196
Added MFU telemetry by @jgreer013 in https://github.com/oumi-ai/oumi/pull/193
Update Polaris script by @wizeng23 in https://github.com/oumi-ai/oumi/pull/192
Rename training.save_model param to training.save_final_model for clarity by @xrdaukar in https://github.com/oumi-ai/oumi/pull/197
Support disabling dropout by @wizeng23 in https://github.com/oumi-ai/oumi/pull/184
Update actual mfu calculation by @jgreer013 in https://github.com/oumi-ai/oumi/pull/199
Implement a client for talking to SkyPilot. by @taenin in https://github.com/oumi-ai/oumi/pull/201
Fixed miscalculation of second step start time by @jgreer013 in https://github.com/oumi-ai/oumi/pull/202
Update ablation-model-fineweb-v1 config to start using grad checkpointing by @xrdaukar in https://github.com/oumi-ai/oumi/pull/198
Add distributed operations by @oelachqar in https://github.com/oumi-ai/oumi/pull/194
Add pre-commit hooks for credential scanning + new checks by @oelachqar in https://github.com/oumi-ai/oumi/pull/195
Sample job for multi-node training by @xrdaukar in https://github.com/oumi-ai/oumi/pull/203
Update Polaris multi-node launcher by @xrdaukar in https://github.com/oumi-ai/oumi/pull/204
Multi-node config improvements for llama2b model (HuggingFaceFW/ablation-model-fineweb-v1) by @xrdaukar in https://github.com/oumi-ai/oumi/pull/205
Minor updates to Polaris launcher script by @xrdaukar in https://github.com/oumi-ai/oumi/pull/206
Update Lema FSDP configs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/207
[tiny] add default formatter for markdown by @oelachqar in https://github.com/oumi-ai/oumi/pull/210
Preparations for Lema custom pre-training loop by @oelachqar in https://github.com/oumi-ai/oumi/pull/208
Update MFU callback to support Lema trainer by @oelachqar in https://github.com/oumi-ai/oumi/pull/209
Configure llama2b model to use FSDP HYBRID_SHARD by @xrdaukar in https://github.com/oumi-ai/oumi/pull/213
Implement a Cluster resource manager around Sky Pilot. by @taenin in https://github.com/oumi-ai/oumi/pull/214
Add utils to setup distributed training by @oelachqar in https://github.com/oumi-ai/oumi/pull/211
Add example notebook to train NanoGPT model with Lema by @oelachqar in https://github.com/oumi-ai/oumi/pull/212
[tiny] update sky pilot ssh config by @oelachqar in https://github.com/oumi-ai/oumi/pull/215
Implement a Cloud resource manager around Sky Pilot by @taenin in https://github.com/oumi-ai/oumi/pull/216
Sanitize run name by @xrdaukar in https://github.com/oumi-ai/oumi/pull/217
Use "cluster_name" instead of "name" in the Sky client. by @taenin in https://github.com/oumi-ai/oumi/pull/218
Minor logging improvements in Polaris sample job scripts by @xrdaukar in https://github.com/oumi-ai/oumi/pull/219
Update shell scripts to point to local dataset by @jgreer013 in https://github.com/oumi-ai/oumi/pull/221
Support FSDP on Polaris using accelerate by @xrdaukar in https://github.com/oumi-ai/oumi/pull/220
Add telemetry manager by @oelachqar in https://github.com/oumi-ai/oumi/pull/222
Switch to the latest transformers=4.43.1 by @xrdaukar in https://github.com/oumi-ai/oumi/pull/223
Re-enable model compilation for llama2b model by @xrdaukar in https://github.com/oumi-ai/oumi/pull/224
Increase llama2b batch size from 2 to 3 by @xrdaukar in https://github.com/oumi-ai/oumi/pull/225
Add makefile with common local commands by @oelachqar in https://github.com/oumi-ai/oumi/pull/227
Add DeepSpeed config for Llama2b by @wizeng23 in https://github.com/oumi-ai/oumi/pull/228
MFU Improvements for Llama 2B on Polaris by @jgreer013 in https://github.com/oumi-ai/oumi/pull/229
FSDP config updates by @xrdaukar in https://github.com/oumi-ai/oumi/pull/231
Rename accelerate configs to be in line with other configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/232
[tiny] Update logger format to include rank, pid and threadname by @oelachqar in https://github.com/oumi-ai/oumi/pull/235
Set model.config.use_cache = False by @xrdaukar in https://github.com/oumi-ai/oumi/pull/233
Experimental training loop for pre-training by @oelachqar in https://github.com/oumi-ai/oumi/pull/230
Disable gradient checkpointing in SkyPilot llama2b config by @xrdaukar in https://github.com/oumi-ai/oumi/pull/236
Implement a client for communicating with Polaris via python. by @taenin in https://github.com/oumi-ai/oumi/pull/234
Add SkyPilot config for experimental/pretokenize/tokenize_dataset.py by @xrdaukar in https://github.com/oumi-ai/oumi/pull/237
Update Fabric.run() calls to use the "warn" flag. by @taenin in https://github.com/oumi-ai/oumi/pull/239
Update pretokenize tool to support input datasets by @xrdaukar in https://github.com/oumi-ai/oumi/pull/238
Add optimizers builder function by @oelachqar in https://github.com/oumi-ai/oumi/pull/240
Add a "put" method in the Polaris client for writing remote files. by @taenin in https://github.com/oumi-ai/oumi/pull/242
Add deepspeed (DS) config to support hierarchical partitioning by @wizeng23 in https://github.com/oumi-ai/oumi/pull/244
Add support for uploading MFU in wandb by @jgreer013 in https://github.com/oumi-ai/oumi/pull/245
Create a Polaris Cluster class consuming the polaris client by @taenin in https://github.com/oumi-ai/oumi/pull/246
Add initial docker image by @oelachqar in https://github.com/oumi-ai/oumi/pull/241
Fix a string in the Polaris Cluster tests. by @taenin in https://github.com/oumi-ai/oumi/pull/249
Set training loop random seeds by @oelachqar in https://github.com/oumi-ai/oumi/pull/248
Fix bug with Polaris multi-node script by @wizeng23 in https://github.com/oumi-ai/oumi/pull/247
Add torchfix listing target by @oelachqar in https://github.com/oumi-ai/oumi/pull/250
Add training state classes by @oelachqar in https://github.com/oumi-ai/oumi/pull/251
Save and restore telemetry state during training by @oelachqar in https://github.com/oumi-ai/oumi/pull/252
Configure file logging by @oelachqar in https://github.com/oumi-ai/oumi/pull/254
Create a Polaris Cloud class consuming the polaris client by @taenin in https://github.com/oumi-ai/oumi/pull/253
Define a registry for cloud builders. by @taenin in https://github.com/oumi-ai/oumi/pull/255
Add logging to tensor board, wandb in custom training loop by @oelachqar in https://github.com/oumi-ai/oumi/pull/256
Add a get_all utility method to the LeMa Registry by @taenin in https://github.com/oumi-ai/oumi/pull/257
Update the BaseCloud up_cluster definition to return a job status. by @taenin in https://github.com/oumi-ai/oumi/pull/258
Create a launcher class for the LeMa Launcher. by @taenin in https://github.com/oumi-ai/oumi/pull/261
Add script to benchmark datasets and data loader params by @oelachqar in https://github.com/oumi-ai/oumi/pull/260
[Follow-up] data loader benchmarking script by @oelachqar in https://github.com/oumi-ai/oumi/pull/262
Create DDP configs for accelerate by @xrdaukar in https://github.com/oumi-ai/oumi/pull/259
Switch from nightly to stable version of SkyPilot by @xrdaukar in https://github.com/oumi-ai/oumi/pull/264
Make all tests green by @xrdaukar in https://github.com/oumi-ai/oumi/pull/265
Set dataloader_pin_memory=True to be intentional by @xrdaukar in https://github.com/oumi-ai/oumi/pull/266
Move torch_profiler_utils from lema.utils to lema.perfomance by @xrdaukar in https://github.com/oumi-ai/oumi/pull/267
Add BaseIterableDataset, refactor DataLoader to use DataPipes by @oelachqar in https://github.com/oumi-ai/oumi/pull/263
Add a dataset_kwargs attribute, tests by @oelachqar in https://github.com/oumi-ai/oumi/pull/268
Use stateful dataloader by @oelachqar in https://github.com/oumi-ai/oumi/pull/269
Update the polaris client / cluster to work e2e by @taenin in https://github.com/oumi-ai/oumi/pull/270
Update package structure for the launcher by @taenin in https://github.com/oumi-ai/oumi/pull/273
[tiny] Register debug datasets by @oelachqar in https://github.com/oumi-ai/oumi/pull/272
Update several of our launcher base fields to use strings instead of ints. by @taenin in https://github.com/oumi-ai/oumi/pull/274
Configure data loader sampling strategy for map-style datasets by @oelachqar in https://github.com/oumi-ai/oumi/pull/271
Ensure we CD into the working DIR before submitting polaris jobs. by @taenin in https://github.com/oumi-ai/oumi/pull/276
Compute the number of dataloader workers per node by @xrdaukar in https://github.com/oumi-ai/oumi/pull/277
Introduce BaseTokenizer alias by @xrdaukar in https://github.com/oumi-ai/oumi/pull/280
Cache getdevicerank_info by @xrdaukar in https://github.com/oumi-ai/oumi/pull/279
Adding initial scripts for running polaris jobs. by @taenin in https://github.com/oumi-ai/oumi/pull/275
Update the polaris client to automatically set execute permissions for copied files. by @taenin in https://github.com/oumi-ai/oumi/pull/286
Deprecate building models data parallel by @oelachqar in https://github.com/oumi-ai/oumi/pull/282
Switch to using safetensors when saving models by @oelachqar in https://github.com/oumi-ai/oumi/pull/281
Add ability to validate configs and params after init by @oelachqar in https://github.com/oumi-ai/oumi/pull/285
Some updates to Polaris launcher script by @xrdaukar in https://github.com/oumi-ai/oumi/pull/287
Upgrade to latest TRL version, remove numpy version condition by @oelachqar in https://github.com/oumi-ai/oumi/pull/283
Add learning rate builder function by @oelachqar in https://github.com/oumi-ai/oumi/pull/284
Remove patchwork as a dep. by @taenin in https://github.com/oumi-ai/oumi/pull/290
Set up initial demo launcher jobs for GCP. by @taenin in https://github.com/oumi-ai/oumi/pull/288
[tiny] cleanup pyproject.toml dependencies by @oelachqar in https://github.com/oumi-ai/oumi/pull/292
Make dataset data backend attribute read-only by @oelachqar in https://github.com/oumi-ai/oumi/pull/291
Optimize Github actions by @oelachqar in https://github.com/oumi-ai/oumi/pull/289
Misc minor changes by @xrdaukar in https://github.com/oumi-ai/oumi/pull/293
[tiny] Update GitHub action cache version by @oelachqar in https://github.com/oumi-ai/oumi/pull/295
Rename 'NodeParams' -> 'JobResources' by @taenin in https://github.com/oumi-ai/oumi/pull/296
Disable compilation for DDP accelerate launch config by @xrdaukar in https://github.com/oumi-ai/oumi/pull/297
Export top level launcher functions and instantiate a default launcher. by @taenin in https://github.com/oumi-ai/oumi/pull/298
Prevent HF version bump by @taenin in https://github.com/oumi-ai/oumi/pull/300
Add dtype/mixed precision configs to Lema trainer by @wizeng23 in https://github.com/oumi-ai/oumi/pull/278
Create a notebook tutorial for running remote training. by @taenin in https://github.com/oumi-ai/oumi/pull/299
Increase the default value of ProfilerParams.row_limit from 20 to 50 by @xrdaukar in https://github.com/oumi-ai/oumi/pull/304
Mini guide on using basic lema functionality by @oelachqar in https://github.com/oumi-ai/oumi/pull/303
Compute MFU based of HF total_flos (alternative way to compute MFU) by @xrdaukar in https://github.com/oumi-ai/oumi/pull/301
Support GPT2 training with Lema trainer by @wizeng23 in https://github.com/oumi-ai/oumi/pull/302
Add a client for running local jobs via the launcher. by @taenin in https://github.com/oumi-ai/oumi/pull/305
Add a local cluster for running local jobs. by @taenin in https://github.com/oumi-ai/oumi/pull/306
Support llama2b with lema trainer by @wizeng23 in https://github.com/oumi-ai/oumi/pull/308
Add a convenience method for listing all registered clouds. by @taenin in https://github.com/oumi-ai/oumi/pull/310
[ALCF] Reverse Polaris GPU order to match CPU/GPU affinities by @xrdaukar in https://github.com/oumi-ai/oumi/pull/307
Create a local cloud for the LeMa launcher. by @taenin in https://github.com/oumi-ai/oumi/pull/309
Remove some leftover occurrences of builtin_ prefix in HF MFU callback by @xrdaukar in https://github.com/oumi-ai/oumi/pull/312
Clean up mixed precision params by @wizeng23 in https://github.com/oumi-ai/oumi/pull/311
Add finetuning tutorial by @oelachqar in https://github.com/oumi-ai/oumi/pull/313
Fix interpolation when loading lema configs. by @taenin in https://github.com/oumi-ai/oumi/pull/314
[bugfix] GPU workers not waiting for global leader to save final checkpoint by @oelachqar in https://github.com/oumi-ai/oumi/pull/315
Add simple benchmark script for distributed operations by @oelachqar in https://github.com/oumi-ai/oumi/pull/316
Add a 'done' field to the LeMa job status object. by @taenin in https://github.com/oumi-ai/oumi/pull/317
Fix a small typo in Lema README by @xrdaukar in https://github.com/oumi-ai/oumi/pull/318
Add pytorch profiler (-p) option to multinode_example_worker.sh script by @xrdaukar in https://github.com/oumi-ai/oumi/pull/319
Create a simpler tutorial for running jobs. by @taenin in https://github.com/oumi-ai/oumi/pull/320
Minor cleanups in Lema training loop by @xrdaukar in https://github.com/oumi-ai/oumi/pull/322
Remove unbalanced call to barrier() in HuggingFaceTrainer.save_model by @xrdaukar in https://github.com/oumi-ai/oumi/pull/323
Create a tutorial for custom clouds. by @taenin in https://github.com/oumi-ai/oumi/pull/321
Add support for logging stdout and stderr for Local runs. by @taenin in https://github.com/oumi-ai/oumi/pull/324
Fix nanoGPT notebook by @wizeng23 in https://github.com/oumi-ai/oumi/pull/325
Add more pytorch profiler instrumentations in Lema training loop by @xrdaukar in https://github.com/oumi-ai/oumi/pull/327
Add training param: dataloader_main_process_only by @xrdaukar in https://github.com/oumi-ai/oumi/pull/326
fix synchronization issues in LEMA training loop by @xrdaukar in https://github.com/oumi-ai/oumi/pull/328
Update LEMA training loop to count tokens on CPU by @xrdaukar in https://github.com/oumi-ai/oumi/pull/330
Update README.md by @taenin in https://github.com/oumi-ai/oumi/pull/331
Add various improvements to Lema trainer by @wizeng23 in https://github.com/oumi-ai/oumi/pull/329
Add PyTorch profiler annotation for each step/micro-step by @xrdaukar in https://github.com/oumi-ai/oumi/pull/333
Enable HfMfuTrainerCallback if supported by @xrdaukar in https://github.com/oumi-ai/oumi/pull/332
Add support for PyTorch profiling schedule by @xrdaukar in https://github.com/oumi-ai/oumi/pull/334
Set up Sphinx-based doc generation for LeMa by @taenin in https://github.com/oumi-ai/oumi/pull/335
Fix dataclass strings to be parsable by our docs generator. by @taenin in https://github.com/oumi-ai/oumi/pull/337
Update ProfilerStepCallback to add microstep profiler annotations by @xrdaukar in https://github.com/oumi-ai/oumi/pull/338
Add include_alternative_mfu_metrics param to control if HF MFU is enabled by @xrdaukar in https://github.com/oumi-ai/oumi/pull/336
Minor doc formatting updates. by @taenin in https://github.com/oumi-ai/oumi/pull/340
Add 8-bit Adam optimizer to Lema trainer by @wizeng23 in https://github.com/oumi-ai/oumi/pull/339
Enable gradient scaling for fp16 mixed-precision training by @wizeng23 in https://github.com/oumi-ai/oumi/pull/342
Add a link to our documentation via the readme. by @taenin in https://github.com/oumi-ai/oumi/pull/344
Disable weight decay for layernorm/biases in Lema trainer by @wizeng23 in https://github.com/oumi-ai/oumi/pull/341
Polaris: Enable NCCL debug logging at WARNING level by @xrdaukar in https://github.com/oumi-ai/oumi/pull/347
Add a new notebook for getting started. by @taenin in https://github.com/oumi-ai/oumi/pull/345
Create TelemetryCallback by @xrdaukar in https://github.com/oumi-ai/oumi/pull/343
Various improvements for our autogenerated docs by @taenin in https://github.com/oumi-ai/oumi/pull/349
Polaris: update sample tail command to use -n200 by @xrdaukar in https://github.com/oumi-ai/oumi/pull/348
Fix a minor bug in TelemetryCallback.on_train_end by @xrdaukar in https://github.com/oumi-ai/oumi/pull/350
Update LEMA training loop to log wandb url by @xrdaukar in https://github.com/oumi-ai/oumi/pull/351
Update model dtype for DeepSpeed to make it work with SkyPilot and Polaris by @xrdaukar in https://github.com/oumi-ai/oumi/pull/352
Enable the launcher via the CLI by @taenin in https://github.com/oumi-ai/oumi/pull/353
Update Polaris init script to print nodelist by @xrdaukar in https://github.com/oumi-ai/oumi/pull/354
Minor logging updates in Polaris scripts by @xrdaukar in https://github.com/oumi-ai/oumi/pull/355
Define ddp1gpu Polaris mode: Spawn 1 torchrun process per GPU (4 torchrun-s per node) by @xrdaukar in https://github.com/oumi-ai/oumi/pull/356
Add a helper util to query GPU temperatures by @xrdaukar in https://github.com/oumi-ai/oumi/pull/359
Add Llama 8B config by @wizeng23 in https://github.com/oumi-ai/oumi/pull/358
Add another bareer() call before train() by @xrdaukar in https://github.com/oumi-ai/oumi/pull/360
Add Llama70B FSDP config by @wizeng23 in https://github.com/oumi-ai/oumi/pull/361
Minor improvements in logging and instrumentations in train.py by @xrdaukar in https://github.com/oumi-ai/oumi/pull/362
Refactor our core directory to logically organize our classes. by @taenin in https://github.com/oumi-ai/oumi/pull/357
Basic plumbing for GPU temperature telemetry by @xrdaukar in https://github.com/oumi-ai/oumi/pull/363
Minor update to Llama70B by @wizeng23 in https://github.com/oumi-ai/oumi/pull/365
Reorder model compilation and DDP/FSDP wrapping by @xrdaukar in https://github.com/oumi-ai/oumi/pull/364
Mini tutorial for Llama3.1-70b inference on Polaris. by @taenin in https://github.com/oumi-ai/oumi/pull/367
jgreer013/vllm-inference by @jgreer013 in https://github.com/oumi-ai/oumi/pull/366
Fix interpolation when using the launcher CLI for various sky configs. by @taenin in https://github.com/oumi-ai/oumi/pull/369
Add Llama8B Lora config for GCP/Polaris by @wizeng23 in https://github.com/oumi-ai/oumi/pull/368
Add vllm parallel inference to improve throughput by @jgreer013 in https://github.com/oumi-ai/oumi/pull/370
Set TOKENIZERS_PARALLELISM: false for llama8b model by @xrdaukar in https://github.com/oumi-ai/oumi/pull/371
Disable MFU computation for PEFT by @xrdaukar in https://github.com/oumi-ai/oumi/pull/372
Add empty_device_cache_steps param and configure it for Llama8b model by @xrdaukar in https://github.com/oumi-ai/oumi/pull/373
Add TelemetryCallback.include_timer_metrics param: False by default by @xrdaukar in https://github.com/oumi-ai/oumi/pull/378
Update llama8b GCP launcher script to allow Spot VMs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/380
Minimal Llama8B LoRA eval config by @xrdaukar in https://github.com/oumi-ai/oumi/pull/376
Add Llama 8b SFT config by @wizeng23 in https://github.com/oumi-ai/oumi/pull/379
Move common NCCL variables initialization into polaris_init.sh by @xrdaukar in https://github.com/oumi-ai/oumi/pull/377
Minor tuning of llama8b configs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/382
Update eval script to use Meta-Llama-3.1-8B-Instruct model version by @xrdaukar in https://github.com/oumi-ai/oumi/pull/381
Initial notebook for llama 8b LoRa tuning. by @taenin in https://github.com/oumi-ai/oumi/pull/374
Update SkyPilot GCP script to download the right model version by @xrdaukar in https://github.com/oumi-ai/oumi/pull/385
Clean up Sky configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/383
Update main makefile to generate docs by @oelachqar in https://github.com/oumi-ai/oumi/pull/386
Add docs-serve makefile command by @oelachqar in https://github.com/oumi-ai/oumi/pull/387
Fix missing new line at the end of Makefile by @xrdaukar in https://github.com/oumi-ai/oumi/pull/390
Raise NOT_IMPLEMENTED if adapter_model is configured for LM_HARNESS eval by @xrdaukar in https://github.com/oumi-ai/oumi/pull/391
Update Llama8B LoRA eval script to use built-in LEMA evaluator by @xrdaukar in https://github.com/oumi-ai/oumi/pull/389
Add Llama 70b lora config by @wizeng23 in https://github.com/oumi-ai/oumi/pull/388
Enable markdown docs by @oelachqar in https://github.com/oumi-ai/oumi/pull/394
Check ignored docstring rules by @oelachqar in https://github.com/oumi-ai/oumi/pull/395
Remove special case for saving PEFT models by @xrdaukar in https://github.com/oumi-ai/oumi/pull/384
Move shared code into polaris_init by @wizeng23 in https://github.com/oumi-ai/oumi/pull/392
Update Llama notebook to include 8B SFT by @wizeng23 in https://github.com/oumi-ai/oumi/pull/393
Update sample commands to point to the preemptable queue by @taenin in https://github.com/oumi-ai/oumi/pull/396
Update lm_harness to support LoRA adapters by @jgreer013 in https://github.com/oumi-ai/oumi/pull/397
Fix FSDP model initialization by @wizeng23 in https://github.com/oumi-ai/oumi/pull/398
Add vscode launch config for accelerate distributed training by @oelachqar in https://github.com/oumi-ai/oumi/pull/400
Update trainer save model by @oelachqar in https://github.com/oumi-ai/oumi/pull/399
Increase from 2 to 3 nodes for Llama 70B Lora by @wizeng23 in https://github.com/oumi-ai/oumi/pull/402
Add param to customize NCCL timeout by @oelachqar in https://github.com/oumi-ai/oumi/pull/401
Add docs and gpu install targets by @oelachqar in https://github.com/oumi-ai/oumi/pull/403
Significant improvements for the Polaris launcher by @taenin in https://github.com/oumi-ai/oumi/pull/404
Ensure that jobs are queued on existing clusters when users call UP by @taenin in https://github.com/oumi-ai/oumi/pull/406
Autostop sky clusters after 30 min of no activity by @taenin in https://github.com/oumi-ai/oumi/pull/407
Add support for triton kernels from Liger Kernel by @oelachqar in https://github.com/oumi-ai/oumi/pull/405
Add support for including notebooks in the docs by @oelachqar in https://github.com/oumi-ai/oumi/pull/408
Update sphinx comments to docstrings by @oelachqar in https://github.com/oumi-ai/oumi/pull/411
Add missing docstrings to TrainingParams by @oelachqar in https://github.com/oumi-ai/oumi/pull/409
Capped model max length for Llama tuning by @wizeng23 in https://github.com/oumi-ai/oumi/pull/413
Fix a deadlock in the Polaris launcher for users with 500+ jobs. by @taenin in https://github.com/oumi-ai/oumi/pull/412
Script to run inference with Llama/GPT judges. by @kaisopos in https://github.com/oumi-ai/oumi/pull/414
Add missing docstrings to top-level configs by @oelachqar in https://github.com/oumi-ai/oumi/pull/410
[tiny] sphinx conf update by @oelachqar in https://github.com/oumi-ai/oumi/pull/416
Improve launcher polling by running tasks in a subprocess. by @taenin in https://github.com/oumi-ai/oumi/pull/417
Add missing package docstrings by @oelachqar in https://github.com/oumi-ai/oumi/pull/415
[tiny] Enable D104 rule by @oelachqar in https://github.com/oumi-ai/oumi/pull/419
Fix bug with 70B Lora by @wizeng23 in https://github.com/oumi-ai/oumi/pull/421
Update the CLI to look for open SSH tunnels as a way of preserving Polaris state by @taenin in https://github.com/oumi-ai/oumi/pull/418
Update the polaris launcher to always update the lema installation on job creation. by @taenin in https://github.com/oumi-ai/oumi/pull/422
Cleanup doc RSTs by @oelachqar in https://github.com/oumi-ai/oumi/pull/420
Add sphinx api doc template for packages by @oelachqar in https://github.com/oumi-ai/oumi/pull/425
Add automatically generated apidoc RSTs by @oelachqar in https://github.com/oumi-ai/oumi/pull/424
[tiny] Move apidocs into their own folder by @oelachqar in https://github.com/oumi-ai/oumi/pull/426
Add docs-rebuild command to Makefile by @oelachqar in https://github.com/oumi-ai/oumi/pull/427
Refresh markdown docs by @oelachqar in https://github.com/oumi-ai/oumi/pull/429
Reorganize our test structure by @taenin in https://github.com/oumi-ai/oumi/pull/431
Add Llama 70B SFT config by @wizeng23 in https://github.com/oumi-ai/oumi/pull/428
Script to generate judge prompts. by @kaisopos in https://github.com/oumi-ai/oumi/pull/423
[tiny] Breakdown main Readme into multiple docs by @oelachqar in https://github.com/oumi-ai/oumi/pull/430
Update main readme file by @oelachqar in https://github.com/oumi-ai/oumi/pull/432
Add GitHub badges, readme typos by @oelachqar in https://github.com/oumi-ai/oumi/pull/434
Fix markdown lint errors by @oelachqar in https://github.com/oumi-ai/oumi/pull/433
Update documentation index by @oelachqar in https://github.com/oumi-ai/oumi/pull/436
[tiny] Only log to console on global leader by @wizeng23 in https://github.com/oumi-ai/oumi/pull/435
Tune sphinx config by @oelachqar in https://github.com/oumi-ai/oumi/pull/437
Enable Liger for Llama 8B SFT by @wizeng23 in https://github.com/oumi-ai/oumi/pull/439
Updated Parallel Inference job by @jgreer013 in https://github.com/oumi-ai/oumi/pull/438
Add a mkdir to polaris init. by @taenin in https://github.com/oumi-ai/oumi/pull/440
[tiny] Fix lema loop performance gap by @oelachqar in https://github.com/oumi-ai/oumi/pull/441
[tiny] update trainer benchmark script and minor updates by @oelachqar in https://github.com/oumi-ai/oumi/pull/443
Add Llama 8B eval script by @wizeng23 in https://github.com/oumi-ai/oumi/pull/442
Add dataset remote code param by @oelachqar in https://github.com/oumi-ai/oumi/pull/445
[docs] Update format + add missing docs to data_params.py by @oelachqar in https://github.com/oumi-ai/oumi/pull/444
Update Polaris Llama8b eval script to enable data-parallel evals for LM_HARNESS by @xrdaukar in https://github.com/oumi-ai/oumi/pull/446
Copy changes from PR-446 into Polaris launcher config by @xrdaukar in https://github.com/oumi-ai/oumi/pull/448
Copy changes from PR-446 into GCP launcher config by @xrdaukar in https://github.com/oumi-ai/oumi/pull/449
Minor fixes in llama8B eval scripts by @xrdaukar in https://github.com/oumi-ai/oumi/pull/450
Add Llama 70B eval script by @wizeng23 in https://github.com/oumi-ai/oumi/pull/447
[bugfix] add isusingaccelerate_fsdp util by @oelachqar in https://github.com/oumi-ai/oumi/pull/453
[tiny] Fix inference notebook by @wizeng23 in https://github.com/oumi-ai/oumi/pull/451
Simplify record_function annotation in LEMA training loop by @xrdaukar in https://github.com/oumi-ai/oumi/pull/454
[tiny] enable ruff format on save with notebooks by @oelachqar in https://github.com/oumi-ai/oumi/pull/455
[tiny] Add missing default value to hf_trainer by @oelachqar in https://github.com/oumi-ai/oumi/pull/458
Judge inference script for Polaris by @kaisopos in https://github.com/oumi-ai/oumi/pull/452
Add the base classes for inference. Pull out logic from infer to a native text inference engine. by @taenin in https://github.com/oumi-ai/oumi/pull/456
Telemetry improvements for tracking GPU temperature and in general by @xrdaukar in https://github.com/oumi-ai/oumi/pull/457
Add integration tests for native inference (not using the CLI). by @taenin in https://github.com/oumi-ai/oumi/pull/460
Update README.md by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/462
Update README to make installation steps more prominent by @taenin in https://github.com/oumi-ai/oumi/pull/464
Fix several broken links and update installation instructions by @taenin in https://github.com/oumi-ai/oumi/pull/465
Update inference to pass the generation config to inference engines. by @taenin in https://github.com/oumi-ai/oumi/pull/466
Update README.md by @taenin in https://github.com/oumi-ai/oumi/pull/467
Fixed issue with metadata extraction failure by @jgreer013 in https://github.com/oumi-ai/oumi/pull/469
Add fsdp support to lema loop by @oelachqar in https://github.com/oumi-ai/oumi/pull/463
Combine telemetry from all ranks by @xrdaukar in https://github.com/oumi-ai/oumi/pull/468
Add sample for full fine-tuned and LoRA-tuned model inference using vLLM by @wizeng23 in https://github.com/oumi-ai/oumi/pull/470
Update chattemplatebuilder by @oelachqar in https://github.com/oumi-ai/oumi/pull/472
Removed duplicate task_done call by @jgreer013 in https://github.com/oumi-ai/oumi/pull/473
Add flag to enable experimental torch data pipes processing pipeline by @oelachqar in https://github.com/oumi-ai/oumi/pull/474
Vision-languange datasets & fine-tuning MVP by @oelachqar in https://github.com/oumi-ai/oumi/pull/459
Rebuild docs, add multi-modal tutorial by @oelachqar in https://github.com/oumi-ai/oumi/pull/475
Add test coverage target, update pyproject.toml metadata by @oelachqar in https://github.com/oumi-ai/oumi/pull/476
Create a local inference engine for vLLM by @taenin in https://github.com/oumi-ai/oumi/pull/471
Add llava chat template, QoL improvement to multimodal testing script by @oelachqar in https://github.com/oumi-ai/oumi/pull/478
[Polaris Judge Inference] Adjusting script for Llama 70B quantized by @kaisopos in https://github.com/oumi-ai/oumi/pull/461
Add example for running inference using vLLM on GCP, single-node multi-gpu by @oelachqar in https://github.com/oumi-ai/oumi/pull/479
[tiny] Remove deepspeed from required dependencies by @oelachqar in https://github.com/oumi-ai/oumi/pull/482
Update train path to save meta-info as files under telemetry sub-dir by @xrdaukar in https://github.com/oumi-ai/oumi/pull/480
Add inference engine applychattemplate helper, update example notebook by @oelachqar in https://github.com/oumi-ai/oumi/pull/481
Update arg names for vLLM inference job by @wizeng23 in https://github.com/oumi-ai/oumi/pull/477
Remove device_map for model init from config by @wizeng23 in https://github.com/oumi-ai/oumi/pull/484
Add log_model_summary call back by @xrdaukar in https://github.com/oumi-ai/oumi/pull/485
Small typo fix in the vllm notebook by @taenin in https://github.com/oumi-ai/oumi/pull/483
Cleanup FSDP wrap class auto guesser by @oelachqar in https://github.com/oumi-ai/oumi/pull/486
Add missing documentation for model_params by @oelachqar in https://github.com/oumi-ai/oumi/pull/487
Add callback builder function by @oelachqar in https://github.com/oumi-ai/oumi/pull/490
Minor fixes in DISTRIBUTED_TRAINING.md by @xrdaukar in https://github.com/oumi-ai/oumi/pull/488
Switch to using official UV action with dependency caching by @oelachqar in https://github.com/oumi-ai/oumi/pull/491
Introduce BaseTrainerCallback alias by @xrdaukar in https://github.com/oumi-ai/oumi/pull/492
Add documentation to peft_params by @oelachqar in https://github.com/oumi-ai/oumi/pull/493
Update TelemetryCallback to save final metrics to JSON by @xrdaukar in https://github.com/oumi-ai/oumi/pull/494
Increase the rsync timeout from 40s to 300s by @taenin in https://github.com/oumi-ai/oumi/pull/495
[tiny] fix missing import by @oelachqar in https://github.com/oumi-ai/oumi/pull/497
Rename builddataset -> builddataset_mixture by @oelachqar in https://github.com/oumi-ai/oumi/pull/498
Define a simple callback to detect NaN/INF-s during training by @xrdaukar in https://github.com/oumi-ai/oumi/pull/496
Replace pip install flash-attn with .[gpu] target by @wizeng23 in https://github.com/oumi-ai/oumi/pull/502
Add simpler builder for single dataset use cases by @oelachqar in https://github.com/oumi-ai/oumi/pull/499
Use HF's built-in gradient checkpointing argument by @wizeng23 in https://github.com/oumi-ai/oumi/pull/500
[Draft] Example changes to support 70B single-node inference by @jgreer013 in https://github.com/oumi-ai/oumi/pull/503
Various updates to Llama 2b configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/489
Add Llama 2B FSDP config by @wizeng23 in https://github.com/oumi-ai/oumi/pull/505
Update TelemetryCallback to write JSON with GPU temperature summary by @xrdaukar in https://github.com/oumi-ai/oumi/pull/501
Rename src/lema to src/oumi by @wizeng23 in https://github.com/oumi-ai/oumi/pull/506
OpenAI Chat Engine - Custom servers by @taenin in https://github.com/oumi-ai/oumi/pull/504
Rename configs/lema to configs/oumi by @wizeng23 in https://github.com/oumi-ai/oumi/pull/507
Rename all relevant lema references in codebase by @wizeng23 in https://github.com/oumi-ai/oumi/pull/508
Re-generate Sphinx docs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/509
Update conf.py by @taenin in https://github.com/oumi-ai/oumi/pull/510
Rename remaining lema references in docs/ by @wizeng23 in https://github.com/oumi-ai/oumi/pull/511
Update final lema references by @wizeng23 in https://github.com/oumi-ai/oumi/pull/512
Update dev setup guide by @wizeng23 in https://github.com/oumi-ai/oumi/pull/513
Update TOTALNUMGPUS compare commands in SkyPilot configs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/514
[Minor] Issues arose by "newcomer" exploration [1/K] by @optas in https://github.com/oumi-ai/oumi/pull/518
Freeze lm-eval and torch versions as a workaround for OPE-390 by @xrdaukar in https://github.com/oumi-ai/oumi/pull/516
1. Write wandb telemetery 2. Reorder training callbacks by @xrdaukar in https://github.com/oumi-ai/oumi/pull/519
Multiple updates to Llama 2B by @wizeng23 in https://github.com/oumi-ai/oumi/pull/515
Rename OUMI to Oumi by @wizeng23 in https://github.com/oumi-ai/oumi/pull/520
Add llama.cpp Inference Engine by @oelachqar in https://github.com/oumi-ai/oumi/pull/524
Rename website references to oumi.ai by @wizeng23 in https://github.com/oumi-ai/oumi/pull/522
Add anthropic inference engine by @oelachqar in https://github.com/oumi-ai/oumi/pull/523
Update name typo by @oelachqar in https://github.com/oumi-ai/oumi/pull/526
Add a batch inference job runnable via the Oumi Launcher by @taenin in https://github.com/oumi-ai/oumi/pull/527
Auto-format pyproject and pre-commit configs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/530
Update Makefile by @taenin in https://github.com/oumi-ai/oumi/pull/529
Fix failing tests after a new install. by @taenin in https://github.com/oumi-ai/oumi/pull/531
Fix a small bug in infer_interactive(): only prints the first character by @xrdaukar in https://github.com/oumi-ai/oumi/pull/532
Boosting User-friendliness by @optas in https://github.com/oumi-ai/oumi/pull/521
[tiny] add override from typing_extentions by @oelachqar in https://github.com/oumi-ai/oumi/pull/534
Create CODEOFCONDUCT.md by @taenin in https://github.com/oumi-ai/oumi/pull/536
Add conversation helper methods by @oelachqar in https://github.com/oumi-ai/oumi/pull/535
[tiny] cleanup multimodal benchmark script by @oelachqar in https://github.com/oumi-ai/oumi/pull/537
Auto-format shell scripts under scripts by @xrdaukar in https://github.com/oumi-ai/oumi/pull/539
Add builder function for data collators by @oelachqar in https://github.com/oumi-ai/oumi/pull/538
Make tokenizer optional by @oelachqar in https://github.com/oumi-ai/oumi/pull/540
Add an optional -t flag to scripts/polaris/jobs/llama2bptworker.sh by @xrdaukar in https://github.com/oumi-ai/oumi/pull/541
Fix initial issues found by shellcheck by @xrdaukar in https://github.com/oumi-ai/oumi/pull/542
[tiny] fix small typo by @oelachqar in https://github.com/oumi-ai/oumi/pull/544
Minor changes in scripts/benchmarks/minimal_multimodal_training.py by @xrdaukar in https://github.com/oumi-ai/oumi/pull/543
[tiny] Add util to get install folder root dir by @oelachqar in https://github.com/oumi-ai/oumi/pull/545
[tiny] Add fp paged_adam optimizer option by @oelachqar in https://github.com/oumi-ai/oumi/pull/547
[tiny] Allow conversation metadata to contain values other than str by @oelachqar in https://github.com/oumi-ai/oumi/pull/546
Switch from Flash Attention 2 to PyTorch SDPA by @wizeng23 in https://github.com/oumi-ai/oumi/pull/533
Use local_rank to query GPU temperature by @xrdaukar in https://github.com/oumi-ai/oumi/pull/550
Fix a bug for handling stopped sky clusters in the oumi launcher. by @taenin in https://github.com/oumi-ai/oumi/pull/549
Remove flash attention validation check by @wizeng23 in https://github.com/oumi-ai/oumi/pull/551
Add support for AWS and Azure jobs in Oumi by @taenin in https://github.com/oumi-ai/oumi/pull/552
Pass split param to datasets.load_dataset() by @xrdaukar in https://github.com/oumi-ai/oumi/pull/553
Implement Judge API MVP by @oelachqar in https://github.com/oumi-ai/oumi/pull/548
Log dataset info: shape, columns, other metainfo by @xrdaukar in https://github.com/oumi-ai/oumi/pull/555
Update experimental pretokenize_dataset tool by @xrdaukar in https://github.com/oumi-ai/oumi/pull/554
Various improvements to Llama eval scripts by @wizeng23 in https://github.com/oumi-ai/oumi/pull/556
Add a couple of gc.collect() calls by @xrdaukar in https://github.com/oumi-ai/oumi/pull/560
[tiny] Fix Makefile setup command by @wizeng23 in https://github.com/oumi-ai/oumi/pull/561
Support datasets generated by dataset.save_to_disk() by @xrdaukar in https://github.com/oumi-ai/oumi/pull/559
Add support for LoRA adapters in vLLM inference engine by @wizeng23 in https://github.com/oumi-ai/oumi/pull/562
Updates in VisionLanguageCollator and in coco_captions by @xrdaukar in https://github.com/oumi-ai/oumi/pull/563
Update DEV_SETUP.md with Windows instructions by @taenin in https://github.com/oumi-ai/oumi/pull/566
Make the remote inference engine runnable in jupyter notebooks. by @taenin in https://github.com/oumi-ai/oumi/pull/565
Configure freezelayer map in `minimalmultimodal_training.py` by @xrdaukar in https://github.com/oumi-ai/oumi/pull/569
Clean up legacy evaluate_oumi code paths by @taenin in https://github.com/oumi-ai/oumi/pull/568
Update model builder to use default_chat_template if available by @xrdaukar in https://github.com/oumi-ai/oumi/pull/571
Add package build and deployment workflow to google artifact registry by @oelachqar in https://github.com/oumi-ai/oumi/pull/570

New Contributors

@oelachqar made their first contribution in https://github.com/oumi-ai/oumi/pull/1
@kaisopos made their first contribution in https://github.com/oumi-ai/oumi/pull/7
@jgreer013 made their first contribution in https://github.com/oumi-ai/oumi/pull/9
@optas made their first contribution in https://github.com/oumi-ai/oumi/pull/10
@xrdaukar made their first contribution in https://github.com/oumi-ai/oumi/pull/12
@wizeng23 made their first contribution in https://github.com/oumi-ai/oumi/pull/30
@taenin made their first contribution in https://github.com/oumi-ai/oumi/pull/95
@mkoukoumidis made their first contribution in https://github.com/oumi-ai/oumi/pull/462

Full Changelog: https://github.com/oumi-ai/oumi/commits/v0.1-alpha

- Python
Published by oelachqar over 1 year ago

Recent Releases of oumi

oumi - v0.4.0

What's Changed

New Contributors

oumi - v0.3.0

Oumi v0.3 Changelog

🔧 Model Quantization (NEW)

Quick start - quantize TinyLlama to 4-bit

With configuration file

⚖️ Judge API V2 (MAJOR UPDATE)

Built-in truthfulness judge

🎯 Adaptive Inference (NEW)

🛠️ Developer Experience

New Contributors

oumi - v0.3.0

Oumi v0.3 Changelog

🔧 Model Quantization (NEW)

Quick start - quantize TinyLlama to 4-bit

With configuration file

⚖️ Judge API V2 (MAJOR UPDATE)

Built-in truthfulness judge

🎯 Adaptive Inference (NEW)

🛠️ Developer Experience

New Contributors

oumi - v0.2.1

What's Changed

New Contributors

oumi - v0.2.0

Highlights

GRPO support for trl and verl trainers

Models built with Oumi: HallOumi and CoALM

New model support: Llama 4, Qwen3, Falcon H1, and more

Support for Slurm and Frontier clusters

What's Changed

oumi - v0.1.14

What's Changed

New Contributors

oumi - v0.1.13

What's Changed

New Contributors

oumi - v0.1.12

What's Changed

New Contributors

oumi - v0.1.11

Oumi v0.1.11 Release Notes 🚀

Key Highlights

Model Support 🤖

Developer Experience 💻

HallOumi Framework 🧠

oumi - v0.1.10

What's Changed

oumi - 0.1.9

What's Changed

oumi - v0.1.8

What's Changed

oumi - v0.1.7

What's Changed

oumi - v0.1.6

What's Changed

oumi - v0.1.5

What's Changed

New Contributors

oumi - v0.1.4

What's Changed

New Contributors

oumi - v0.1.3

What's Changed

oumi - v0.1.2.3.1

What's Changed

oumi - v0.1.2.3

What's Changed

oumi - v0.1.2.2

What's Changed

oumi - v0.1.2.1

What's Changed

oumi - v0.1.2.0-alpha

What's Changed

New Contributors

oumi - v0.1.1.0-alpha.1

What's Changed