Recent Releases of oumi

oumi - v0.4.0

What's Changed

  • Update README.md with v0.3 release by @stefanwebb in https://github.com/oumi-ai/oumi/pull/1904
  • Rename BaseExperimentalDpoDataset to BaseDpoDataset by @oelachqar in https://github.com/oumi-ai/oumi/pull/1903
  • [tiny] Pin exact omegaconf version by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1905
  • Add gpt-oss-120b Together AI inference config by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1907
  • feat: Add basic hugging face model management functionality in CLI by @aniruddh-alt in https://github.com/oumi-ai/oumi/pull/1897
  • [Breaking change] Change default model dtype from fp32 to "auto" by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1910
  • Add inference config for gpt-oss by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1909
  • Update dataset planner to use dependency injection by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1913
  • Modify document ingestion to read bytes by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1917
  • Update dependencies by @oelachqar in https://github.com/oumi-ai/oumi/pull/1908
  • Update dependencies [part 2] by @oelachqar in https://github.com/oumi-ai/oumi/pull/1912
  • Add processor support to AnalyzeConfig for vision-language datasets by @ryan-arman in https://github.com/oumi-ai/oumi/pull/1916
  • Unpin flash-attn version by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1918
  • Add august webinar to the readme. by @taenin in https://github.com/oumi-ai/oumi/pull/1919
  • Add Conversation-Level Analysis by @ryan-arman in https://github.com/oumi-ai/oumi/pull/1914
  • Deprecate experimental ring attention support by @oelachqar in https://github.com/oumi-ai/oumi/pull/1921
  • [tiny] Cleanup redundant specifications of lora_dropout 0.0 by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1926
  • Update README.md to highlight gpt-oss support by @stefanwebb in https://github.com/oumi-ai/oumi/pull/1927
  • Add lora target_parameters field and upgrade peft library by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1925
  • Improve e2e eval tests by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1924
  • Update README.md - updated after Aug webinar by @stefanwebb in https://github.com/oumi-ai/oumi/pull/1930
  • refactor: Build tokenizer inside analysis_utils based on input config by @ryan-arman in https://github.com/oumi-ai/oumi/pull/1928
  • Replace Luma link with YouTube recording for August webinar by @stefanwebb in https://github.com/oumi-ai/oumi/pull/1931
  • Ryan arman add analysis summary by @ryan-arman in https://github.com/oumi-ai/oumi/pull/1929
  • Add KTO support for preference tuning by @efsiatras in https://github.com/oumi-ai/oumi/pull/1538
  • [tiny] Rename mac to macos in config names by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1932
  • GGUF configs, MacOS LlamaCPP configs by @penfever in https://github.com/oumi-ai/oumi/pull/1923
  • Enable gpt-oss training and vLLM inference by @penfever in https://github.com/oumi-ai/oumi/pull/1920
  • Judge CLI | Display Overall Score by @kaisopos in https://github.com/oumi-ai/oumi/pull/1934
  • Replace mentions of deprecated huggingface-cli tool with hf by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1933
  • Add support for custom datasets in Analyze by @ryan-arman in https://github.com/oumi-ai/oumi/pull/1935
  • Judge | Validate Input Dataset by @kaisopos in https://github.com/oumi-ai/oumi/pull/1938
  • Add Pre-Populated GitHub Issue Link On Failures by @rlehman221 in https://github.com/oumi-ai/oumi/pull/1936
  • Adding Verbose Flag by @rlehman221 in https://github.com/oumi-ai/oumi/pull/1940
  • [tiny] Update .gitignore by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1944
  • Upgrade verl to 0.5.0 by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1941
  • Add shell script for launching jobs on NERSC Perlmutter HPC cluster by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1945
  • Update llamacppinference_engine.py by @penfever in https://github.com/oumi-ai/oumi/pull/1946
  • Simplify github issue creation in cli by @rlehman221 in https://github.com/oumi-ai/oumi/pull/1949
  • Add a parsable enum state field to the launcher to convey job status by @taenin in https://github.com/oumi-ai/oumi/pull/1950
  • Penfever/api and macos configs by @penfever in https://github.com/oumi-ai/oumi/pull/1947
  • Penfever/deepspeed by @penfever in https://github.com/oumi-ai/oumi/pull/1886
  • Run GitHub workflows in merge queue instead of on PR events by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1952
  • Dependency Upgrade For V4 by @rlehman221 in https://github.com/oumi-ai/oumi/pull/1942
  • Allow initializing DatasetAnalyzer directly with a dataset object by @ryan-arman in https://github.com/oumi-ai/oumi/pull/1953
  • Follow-up updates to DeepSpeed PR by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1954
  • Add vision dpo support, refactor dpo by @oelachqar in https://github.com/oumi-ai/oumi/pull/1922
  • Enable users to log data samples during training for debugging by @shanghongsim in https://github.com/oumi-ai/oumi/pull/1943
  • [tiny] Remove add_special_tokens kwarg from vision DPO dataset processor by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1957
  • Fix CPU tests by @taenin in https://github.com/oumi-ai/oumi/pull/1958
  • Penfever/llamacpp role fix by @penfever in https://github.com/oumi-ai/oumi/pull/1960
  • Update synthesis with usability feedback by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1939

New Contributors

  • @efsiatras made their first contribution in https://github.com/oumi-ai/oumi/pull/1538
  • @rlehman221 made their first contribution in https://github.com/oumi-ai/oumi/pull/1936

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.3.0...v0.4.0

- Python
Published by wizeng23 6 months ago

oumi - v0.3.0

Oumi v0.3 Changelog

๐Ÿ”ง Model Quantization (NEW)

Quantization is a crucially important family of methods for reducing model size, for example, prior to deployment. Oumi now supports applying Activation-aware Weight Quantization (AWQ) to all models. See how in our notebook.

Usage Example:

```

Quick start - quantize TinyLlama to 4-bit

oumi quantize --method awqq40 --model "TinyLlama/TinyLlama-1.1B-Chat-v1.0" --output quantized_model

With configuration file

oumi quantize --config quantization_config.yaml ```

โš–๏ธ Judge API V2 (MAJOR UPDATE)

LLM-as-a-Judge is a method for using foundation models to reliably evaluate other foundation models. Weโ€™ve overhauled Oumiโ€™s LLM-as-Judge interface for ease-of-use and flexibility. Check out our notebook here.

Usage Example:

``` from oumi.judges.simple_judge import SimpleJudge

Built-in truthfulness judge

simplejudge = SimpleJudge(judgeconfig="oumi://configs/projects/judges/generic/truthfulness.yaml")

dataset = [{"request": "What is the capital of France?", "response": "Rome"}] outputs = simple_judge.judge(dataset) ```

๐ŸŽฏ Adaptive Inference (NEW)

๐Ÿ’ช Adaptive Inference, as we term it, refers to new features in Oumi for resuming training (or any task) when a job has crashed, as well as optimizing inference parallelization to maximize bandwidth. Learn more in our notebook.

๐Ÿ› ๏ธ Developer Experience

  • Updated contributing guidelines
  • Enhanced documentation
  • Tutorial notebook fixes
  • Improved error handling and testing
  • MLflow integration improvements
  • Multi-node verl Slurm job support
  • Rich logging handler option

New Contributors

  • @amarpal made their first contribution in https://github.com/oumi-ai/oumi/pull/1831
  • @42Shawn made their first contribution in https://github.com/oumi-ai/oumi/pull/1837

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.2.1...v0.3.0

- Python
Published by wizeng23 7 months ago

oumi - v0.3.0

Oumi v0.3 Changelog

๐Ÿ”ง Model Quantization (NEW)

Quantization is a crucially important family of methods for reducing model size, for example, prior to deployment. Oumi now supports applying Activation-aware Weight Quantization (AWQ) to all models. See how in our notebook.

Usage Example:

```

Quick start - quantize TinyLlama to 4-bit

oumi quantize --method awqq40 --model "TinyLlama/TinyLlama-1.1B-Chat-v1.0" --output quantized_model

With configuration file

oumi quantize --config quantization_config.yaml ```

โš–๏ธ Judge API V2 (MAJOR UPDATE)

LLM-as-a-Judge is a method for using foundation models to reliably evaluate other foundation models. Weโ€™ve overhauled Oumiโ€™s LLM-as-Judge interface for ease-of-use and flexibility. Check out our notebook here.

Usage Example:

``` from oumi.judges.simple_judge import SimpleJudge

Built-in truthfulness judge

simplejudge = SimpleJudge(judgeconfig="oumi://configs/projects/judges/generic/truthfulness.yaml")

dataset = [{"request": "What is the capital of France?", "response": "Rome"}] outputs = simple_judge.judge(dataset) ```

๐ŸŽฏ Adaptive Inference (NEW)

๐Ÿ’ช Adaptive Inference, as we term it, refers to new features in Oumi for resuming training (or any task) when a job has crashed, as well as optimizing inference parallelization to maximize bandwidth. Learn more in our notebook.

๐Ÿ› ๏ธ Developer Experience

  • Updated contributing guidelines
  • Enhanced documentation
  • Tutorial notebook fixes
  • Improved error handling and testing
  • MLflow integration improvements
  • Multi-node verl Slurm job support
  • Rich logging handler option

New Contributors

  • @amarpal made their first contribution in https://github.com/oumi-ai/oumi/pull/1831
  • @42Shawn made their first contribution in https://github.com/oumi-ai/oumi/pull/1837

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.2.1...v0.3.0

- Python
Published by wizeng23 7 months ago

oumi - v0.2.1

What's Changed

  • Set inferonline and inferfrom_file to private by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1745
  • Update launch.md by @shanghongsim in https://github.com/oumi-ai/oumi/pull/1781
  • Add adaptive semaphore to enable future adaptive throughput scenarios by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1780
  • Fix a pyright regression by @taenin in https://github.com/oumi-ai/oumi/pull/1783
  • Judge API V2 | Fix judge config from repo path by @kaisopos in https://github.com/oumi-ai/oumi/pull/1782
  • Add permutable attributes and combination sampling for data synthesis by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1773
  • Removed collator in finetuning tutorial notebook by @shanghongsim in https://github.com/oumi-ai/oumi/pull/1788
  • Update our contributing guidelines. by @taenin in https://github.com/oumi-ai/oumi/pull/1789
  • Add adaptive concurrency controller in preparation for adaptive inference by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1784
  • Fixed issue with final conversations not consistently being saved by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1795
  • Add support for ingesting datasets for synthesis by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1790
  • Add support for adaptive inference by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1791
  • Add support for Example Sources in Synthesis by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1797
  • Webinar announcement and other news by @stefanwebb in https://github.com/oumi-ai/oumi/pull/1800
  • Added utm_source parameters by @stefanwebb in https://github.com/oumi-ai/oumi/pull/1802
  • Add code to handle document ingestion by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1796
  • Add code for handling basic document segmentation by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1803
  • Update mflow support in oumi trainer by @oelachqar in https://github.com/oumi-ai/oumi/pull/1804
  • Add multi-node verl SLURM job by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1798
  • Fixed various tutorial notebooks by @shanghongsim in https://github.com/oumi-ai/oumi/pull/1792
  • Add parameter logging to oumi trainer by @oelachqar in https://github.com/oumi-ai/oumi/pull/1807
  • Judge API V2 | Enable prompt variable replacement by YAML by @kaisopos in https://github.com/oumi-ai/oumi/pull/1805
  • [tiny] Update train config comment header by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1809
  • Add experimental option to use the rich logging handler by @oelachqar in https://github.com/oumi-ai/oumi/pull/1810

New Contributors

  • @shanghongsim made their first contribution in https://github.com/oumi-ai/oumi/pull/1781

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.2.0...v0.2.1

- Python
Published by kaisopos 7 months ago

oumi - v0.2.0

Highlights

GRPO support for trl and verl trainers

Oumi now supports GRPO training for both the trl and verl libraries! This allows you to run GRPO training with no/low code using Oumi's configs. You can also benefit from other features of the Oumi platform, such as custom evaluation and launching remote jobs.

Running GRPO training in Oumi is as simple as:

  1. Create a reward function, and register it to Oumi's reward function registry using @register("<my_reward_fn>", RegistryType.REWARD_FUNCTION).
  2. Create a dataset class to process your HF dataset into the format needed for your target framework, and register it to Oumi's dataset registry using @register_dataset("@hf-org-name/my-dataset-name").
  3. Create an Oumi training config with your model, dataset, reward function, and hyperparameters. For specific details on setting up the config for GRPO, see our documentation.
  4. Launch the training job locally using the oumi train CLI, or launch a remote job using the oumi launch CLI.

For an end-to-end example using Oumi + trl, check out our notebook walkthrough. For verl, check out our multi-modal Geometry3K config. Finally, check out our blog post for more information.

Models built with Oumi: HallOumi and CoALM

Weโ€™re proud to announce the release of two models built with Oumi: HallOumi and CoALM! Both of these were trained on Oumi, and we provide recipes to reproduce their training from scratch.

  • ๐Ÿง€ HallOumi: A truly open-source claim verification (hallucination detection) model developed by Oumi, outperforming Claude Sonnet, OpenAI o1, DeepSeek R1, Llama 405B, and Gemini Pro at only 8B parameters. Check out the Oumi recipe to train the model here.
  • ๐Ÿค– CoALM: Conversational Agentic Language Model (CoALM) is a a unified approach that integrates both conversational and agentic capabilities. It includes an instruction tuning dataset and three trained models (8B, 70B, 405B). The project was a partnership between the ConvAI Lab at UIUC and Oumi, and the paper was accepted to ACL. Check out the Oumi recipes to train the models here.

New model support: Llama 4, Qwen3, Falcon H1, and more

Weโ€™ve added support for many recent models to Oumi, with tested recipes that work out-of-the-box!

Support for Slurm and Frontier clusters

At Oumi, we want unify and simplify the processes for running jobs on remote clusters. We have now added support for launching jobs on Slurm clusters, and on Frontier, a supercomputer at the Oak Ridge Leadership Computing Facility.

What's Changed

  • [bugfix] Allow prerelease when building docker image by @oelachqar in https://github.com/oumi-ai/oumi/pull/1753
  • Update link to Oumi banner image in README by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1752
  • docs: add a badge and link to the social network Twitter by @Radovenchyk in https://github.com/oumi-ai/oumi/pull/1751
  • Support OLCF (Oak Ridge Leadership Computing Facility) Frontier HPC cluster in Oumi launcher by @nikg4 in https://github.com/oumi-ai/oumi/pull/1721
  • Judge API V2 | Core Functionality by @kaisopos in https://github.com/oumi-ai/oumi/pull/1717
  • Update oumi distributed torchrun to fallback to oumi train -c cfg.yaml .... on a single-node with 1 GPU by @nikg4 in https://github.com/oumi-ai/oumi/pull/1755
  • deps: Upgrade verl to 0.4.0 by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1749
  • add DCVLR logo to readme by @penfever in https://github.com/oumi-ai/oumi/pull/1754
  • Judge API V2 | Few-Shots by @kaisopos in https://github.com/oumi-ai/oumi/pull/1746
  • Update infer.md to fix a broken link by @ryan-arman in https://github.com/oumi-ai/oumi/pull/1756
  • Judge API V2 | minor nit by @kaisopos in https://github.com/oumi-ai/oumi/pull/1757
  • [Evaluation] Disabling flaky MMMU test by @kaisopos in https://github.com/oumi-ai/oumi/pull/1758
  • Automatically tail SkyPilot logs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1761
  • Enable vLLM for trl GRPO jobs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1760
  • Judge API V2 | Implement CLI by @kaisopos in https://github.com/oumi-ai/oumi/pull/1759
  • Updates to Oumi news for May, June by @stefanwebb in https://github.com/oumi-ai/oumi/pull/1763
  • Additional news items by @stefanwebb in https://github.com/oumi-ai/oumi/pull/1764
  • Judge API V2 | Support for built-in judges by @kaisopos in https://github.com/oumi-ai/oumi/pull/1762
  • [bug] safetensors v0.6.0rc0 is causing a regression, prevent upgrading by @oelachqar in https://github.com/oumi-ai/oumi/pull/1772
  • [verl] Support resuming from checkpoint by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1766
  • Upgrade accelerate and peft by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1774
  • [tiny] Pin flash-attn version by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1775
  • Pin the version of lm_eval to prevent a breaking change in the 4.9 release by @taenin in https://github.com/oumi-ai/oumi/pull/1777
  • Update inference to resume from temporary result file when possible by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1734
  • [tiny] Fix gradient checkpointing for Oumi trainer by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1778
  • [tiny] Remove use_liger argument by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1779
  • Judge API V2 | Merge Judge and Inference configs by @kaisopos in https://github.com/oumi-ai/oumi/pull/1776

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1.14...v0.2.0

- Python
Published by wizeng23 8 months ago

oumi - v0.1.14

What's Changed

  • Record latency histograms in base inference engine by @nikg4 in https://github.com/oumi-ai/oumi/pull/1702
  • Feat: add falcon-e integration by @younesbelkada in https://github.com/oumi-ai/oumi/pull/1705
  • [tiny] Minor update to fix the failing pre-commit checks by @oelachqar in https://github.com/oumi-ai/oumi/pull/1707
  • Add collator kwargs field to DataParams by @oelachqar in https://github.com/oumi-ai/oumi/pull/1708
  • [vision] Add option to process images individually by @oelachqar in https://github.com/oumi-ai/oumi/pull/1706
  • Update dev_setup.md to correct the order of steps by @ryan-arman in https://github.com/oumi-ai/oumi/pull/1709
  • Add configs for molmo support by @oelachqar in https://github.com/oumi-ai/oumi/pull/1710
  • [tiny] fix pre-commits checks on a fresh install by @oelachqar in https://github.com/oumi-ai/oumi/pull/1711
  • Add config for the Molmo O variant by @oelachqar in https://github.com/oumi-ai/oumi/pull/1712
  • Add experimental molmo grpo config and train aliases by @oelachqar in https://github.com/oumi-ai/oumi/pull/1713
  • Update installation.md to fix subversion handling by adding required โ€ฆ by @ryan-arman in https://github.com/oumi-ai/oumi/pull/1715
  • Frontier: Fix -n param in launcher script by @nikg4 in https://github.com/oumi-ai/oumi/pull/1720
  • Fix Falcon H1 dependency setup by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1723
  • letter count notebook improvements by @penfever in https://github.com/oumi-ai/oumi/pull/1697
  • [vision] Update vision feature generator to support training on completions only by @oelachqar in https://github.com/oumi-ai/oumi/pull/1722
  • [tiny] fix bug with vl collator by @oelachqar in https://github.com/oumi-ai/oumi/pull/1725
  • Add data synthesis config, params, and unit tests by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1700
  • Add support for additional exception types for remote inference engine, as well as fast failing for non-retryable status codes. by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1704
  • Adds DPO + QLoRA example for Falcon-H1 by @stefanwebb in https://github.com/oumi-ai/oumi/pull/1719
  • Update inference to always write intermediate results to file. by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1724
  • Added doc for new QLoRA param by @stefanwebb in https://github.com/oumi-ai/oumi/pull/1727
  • Readme for Falcon-E and note on extra dependencies required by @stefanwebb in https://github.com/oumi-ai/oumi/pull/1729
  • Add generic vision dataset by @oelachqar in https://github.com/oumi-ai/oumi/pull/1726
  • [tiny][bug] make git cmd optional by @oelachqar in https://github.com/oumi-ai/oumi/pull/1730
  • [tiny][bug] Add missing molmo feature by @oelachqar in https://github.com/oumi-ai/oumi/pull/1731
  • [tiny] Update phi3-vision configs to use oumi trainer by @oelachqar in https://github.com/oumi-ai/oumi/pull/1733
  • Minor bugfixes for 2 clouds in launcher code by @nikg4 in https://github.com/oumi-ai/oumi/pull/1728
  • Update dev_setup.md to add additional instructions by @ryan-arman in https://github.com/oumi-ai/oumi/pull/1736
  • Update trl to 0.18 by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1693
  • Update Verl trainer to export models in HF format by @nikg4 in https://github.com/oumi-ai/oumi/pull/1714
  • Add lmms-lab/multimodal-open-r1-8k-verified dataset by @oelachqar in https://github.com/oumi-ai/oumi/pull/1732
  • Add placeholders for DCVLR by @oelachqar in https://github.com/oumi-ai/oumi/pull/1738
  • add debug logging capabilities to collators by @aniruddh-alt in https://github.com/oumi-ai/oumi/pull/1678
  • [bug] update trainer to save processor when training with fsdp by @oelachqar in https://github.com/oumi-ai/oumi/pull/1742
  • Add model revision param by @oelachqar in https://github.com/oumi-ai/oumi/pull/1740
  • Add ability to customize HF model config via model.model_kwargs by @oelachqar in https://github.com/oumi-ai/oumi/pull/1741
  • Add docker release workflow by @oelachqar in https://github.com/oumi-ai/oumi/pull/1743
  • [bug] fix rank/local rank parsing for docker env by @oelachqar in https://github.com/oumi-ai/oumi/pull/1747
  • deps: Update vLLM to 0.8.3 by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1739
  • [docs] update dcvlr readme by @oelachqar in https://github.com/oumi-ai/oumi/pull/1748
  • Dcvlr by @penfever in https://github.com/oumi-ai/oumi/pull/1750

New Contributors

  • @younesbelkada made their first contribution in https://github.com/oumi-ai/oumi/pull/1705
  • @ryan-arman made their first contribution in https://github.com/oumi-ai/oumi/pull/1709
  • @stefanwebb made their first contribution in https://github.com/oumi-ai/oumi/pull/1719
  • @aniruddh-alt made their first contribution in https://github.com/oumi-ai/oumi/pull/1678

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1.13...v0.1.14

- Python
Published by wizeng23 9 months ago

oumi - v0.1.13

What's Changed

  • Update dev_setup.md by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1641
  • [tiny] Remove vllm install commands by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1643
  • Support for custom processor args: misc improvements by @nikg4 in https://github.com/oumi-ai/oumi/pull/1642
  • Add Countdown dataset and reward function by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1645
  • Adding LoRA train config for Qwen-VL 2.0 by @optas in https://github.com/oumi-ai/oumi/pull/1637
  • [Evaluation] Convenience function for standard config retrieval by @kaisopos in https://github.com/oumi-ai/oumi/pull/1644
  • Add demo script by @oelachqar in https://github.com/oumi-ai/oumi/pull/1647
  • [bug] fix build errors by @oelachqar in https://github.com/oumi-ai/oumi/pull/1649
  • Adding LoRA train config for SmolVLM by @optas in https://github.com/oumi-ai/oumi/pull/1639
  • [tiny] Update cli help shorthand by @oelachqar in https://github.com/oumi-ai/oumi/pull/1648
  • Oelachqar/update hooks by @oelachqar in https://github.com/oumi-ai/oumi/pull/1650
  • Add verl PPO trainer by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1646
  • Fix a missing dependency in the verl trainer. by @taenin in https://github.com/oumi-ai/oumi/pull/1651
  • Integrate verl GRPO trainer into train script by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1652
  • Update e2e tests to run on lambda by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1653
  • Add Qwen3 32B configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1661
  • Add Qwen3 30B A3B configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1665
  • [verl] Populate verl config from Oumi config by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1659
  • Provide option to configure label_ignore_index in training config by @nikg4 in https://github.com/oumi-ai/oumi/pull/1666
  • [Documentation] Custom Evaluations (PR 1-of-2) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1664
  • InterVL-3.0 SFT with limited training capabilities by @optas in https://github.com/oumi-ai/oumi/pull/1663
  • Add verl GRPO Countdown configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1668
  • Set explicit permissions for our test workflows. by @taenin in https://github.com/oumi-ai/oumi/pull/1670
  • Add support for repetition_penalty in GrpoParams by @REDDITARUN in https://github.com/oumi-ai/oumi/pull/1654
  • Fix broken tests due to precommit violations by @taenin in https://github.com/oumi-ai/oumi/pull/1671
  • [Documentation] Custom Evaluations (PR 2-of-2) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1669
  • Migrate to logger.warning usage by @emmanuel-ferdman in https://github.com/oumi-ai/oumi/pull/1673
  • Update the Oumi launcher and e2e tests to support runpod. by @taenin in https://github.com/oumi-ai/oumi/pull/1672
  • Switch back to using GCP for e2e tests. by @taenin in https://github.com/oumi-ai/oumi/pull/1675
  • Mark an e2e test as is_lora by @taenin in https://github.com/oumi-ai/oumi/pull/1676
  • Add Phi4 reasoning plus configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1674
  • Fix a test breakage caused by a new Click version (8.2.0) by @taenin in https://github.com/oumi-ai/oumi/pull/1679
  • chore: edited the link to the stars badge by @Radovenchyk in https://github.com/oumi-ai/oumi/pull/1681
  • Update verl GRPO countdown configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1682
  • [very nit] center oumi logo in the cli by @oelachqar in https://github.com/oumi-ai/oumi/pull/1683
  • [tiny] Update training environments doc by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1686
  • Add Geometry3K VLM dataset by @nikg4 in https://github.com/oumi-ai/oumi/pull/1687
  • Add torchao version to pyproject.toml by @nikg4 in https://github.com/oumi-ai/oumi/pull/1688
  • [Feature] Save evaluation config as YAML in output_dir #1546 by @asish-kun in https://github.com/oumi-ai/oumi/pull/1680
  • Create a script to calculate memory used during training by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1441
  • Support VLM-s with VERL_GRPO trainer by @nikg4 in https://github.com/oumi-ai/oumi/pull/1689
  • docs: Add GRPO/verl documentation by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1690
  • Update GRPO letter counting reward function and hparams for stability by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1692
  • [GRPO] Update letter counting notebook by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1694
  • Add Lambda Inference Engine by @oelachqar in https://github.com/oumi-ai/oumi/pull/1695
  • Basic shell script for launching jobs on OLCF Frontier HPC cluster by @nikg4 in https://github.com/oumi-ai/oumi/pull/1691
  • Add CoALM dataset class by @oelachqar in https://github.com/oumi-ai/oumi/pull/1696
  • Added exponential backoff and content-type error handling in remote inference engine by @abhiramvad in https://github.com/oumi-ai/oumi/pull/1685
  • Make SFT datasets usable with GRPO_TRL trainer by @nikg4 in https://github.com/oumi-ai/oumi/pull/1698
  • Implement Falcon H1 by @dhiaEddineRhaiem in https://github.com/oumi-ai/oumi/pull/1699
  • [tiny] Remove deprecated use_async_dataset from configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1701
  • Add sample inference configs for HuggingFaceTB/SmolVLM-Instruct by @nikg4 in https://github.com/oumi-ai/oumi/pull/1703

New Contributors

  • @REDDITARUN made their first contribution in https://github.com/oumi-ai/oumi/pull/1654
  • @emmanuel-ferdman made their first contribution in https://github.com/oumi-ai/oumi/pull/1673
  • @Radovenchyk made their first contribution in https://github.com/oumi-ai/oumi/pull/1681
  • @asish-kun made their first contribution in https://github.com/oumi-ai/oumi/pull/1680
  • @abhiramvad made their first contribution in https://github.com/oumi-ai/oumi/pull/1685
  • @dhiaEddineRhaiem made their first contribution in https://github.com/oumi-ai/oumi/pull/1699

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1.12...v0.1.13

- Python
Published by nikg4 9 months ago

oumi - v0.1.12

What's Changed

  • Add vllm to gpu optional dependencies by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1614
  • [HallOumi] Update inference notebook by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1613
  • Update llama4 GCP jobs for non-dev environments. by @taenin in https://github.com/oumi-ai/oumi/pull/1621
  • Update transformers to 4.51.0 by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1620
  • Lazy load skypilot by @taenin in https://github.com/oumi-ai/oumi/pull/1622
  • Add additionalmodelkwargs and additionaltrainerkwargs to train function by @hommayushi3 in https://github.com/oumi-ai/oumi/pull/1624
  • Added 3 Pixmo vision-language datasets by @jrwana in https://github.com/oumi-ai/oumi/pull/1523
  • [GRPO] Add notebook to demonstrate GRPO & evaluation for letter counting by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1625
  • [Remote Inference] Update Default Params by @kaisopos in https://github.com/oumi-ai/oumi/pull/1630
  • Update trl to 0.16 by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1631
  • Support custom processor args in ModelParams by @nikg4 in https://github.com/oumi-ai/oumi/pull/1634
  • Support BerryBench evaluation by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1635
  • [Remote Inference] Error checking for api_key by @kaisopos in https://github.com/oumi-ai/oumi/pull/1638
  • Rename cnnmnistexample to cnnmnisttutorial by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1640
  • [Remote Inference][GCP] Constructing api_url from the Project ID and Region by @kaisopos in https://github.com/oumi-ai/oumi/pull/1636

New Contributors

  • @jrwana made their first contribution in https://github.com/oumi-ai/oumi/pull/1523

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1.11...v0.1.12

- Python
Published by jgreer013 10 months ago

oumi - v0.1.11

Oumi v0.1.11 Release Notes ๐Ÿš€

Key Highlights

Model Support ๐Ÿค–

  • Integrated Llama 4 (Scout and Maverick variants) with complete workflow configs ๐Ÿฆ™
  • Added LoRA training for Phi3, Phi4, and Qwen2.5-VL multimodal models ๐Ÿ–ผ๏ธ

Developer Experience ๐Ÿ’ป

  • Introduced MLflow integration for experiment tracking ๐Ÿ“
  • Enhanced CLI with convenient alias functionality โŒจ๏ธ

HallOumi Framework ๐Ÿง 

  • Added examples for Halloumi
  • Added dedicated inference notebooks for both generative and classifier approaches ๐Ÿ““

Welcome to our new contributors @hommayushi3 and @gabrielaugz! ๐Ÿ‘‹

For details, see the full changelog.

- Python
Published by oelachqar 11 months ago

oumi - v0.1.10

What's Changed

  • Increment pillow version for compatibility with Python 3.13 by @nikg4 in https://github.com/oumi-ai/oumi/pull/1566
  • [Evaluation] Bug: Multiple GPUs attempt to save in the same folder by @kaisopos in https://github.com/oumi-ai/oumi/pull/1567

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1.9...v0.1.10

- Python
Published by nikg4 11 months ago

oumi - 0.1.9

What's Changed

  • Add QwQ full fine-tune and QLoRA configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1518
  • Update TRL to 0.15 and fix Liger/dataset code by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1507
  • [tiny] Remove vLLM Colab link and fix Alpaca Eval quickstart by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1530
  • Evaluation: Inference optimizations by @kaisopos in https://github.com/oumi-ai/oumi/pull/1522
  • Qwen2.5 VL: Replace "from source" install with transformers>=0.49 by @nikg4 in https://github.com/oumi-ai/oumi/pull/1528
  • [Evaluation] Renaming evaluation_platform โ†’ evaluation_backend by @kaisopos in https://github.com/oumi-ai/oumi/pull/1526
  • [tiny] Clean up datasets code by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1529
  • Minor logging improvements in BaseMapDataset by @nikg4 in https://github.com/oumi-ai/oumi/pull/1532
  • Upload scripts used in a Weekly Walkthrough by @taenin in https://github.com/oumi-ai/oumi/pull/1533
  • Update VisionLanguageConversationFeatureGenerator by @nikg4 in https://github.com/oumi-ai/oumi/pull/1531
  • [docs] add security.md by @oelachqar in https://github.com/oumi-ai/oumi/pull/1534
  • [Evaluation] Custom evaluation notebook: a reliability classifier by @kaisopos in https://github.com/oumi-ai/oumi/pull/1535
  • Multimodal: Limit max number of images per Conversation by @nikg4 in https://github.com/oumi-ai/oumi/pull/1536
  • Auto-populate and validate params specific to vision_language_sft collator in TrainingConfig by @nikg4 in https://github.com/oumi-ai/oumi/pull/1537
  • Update Oumi Env to use Rich formatting by @taenin in https://github.com/oumi-ai/oumi/pull/1541
  • Update oumi launch to use Rich formatting by @taenin in https://github.com/oumi-ai/oumi/pull/1543
  • Update oumi evaluate to use rich formatting. by @taenin in https://github.com/oumi-ai/oumi/pull/1544
  • Update the CLI to replace all prints with Rich prints. by @taenin in https://github.com/oumi-ai/oumi/pull/1547
  • Render the oumi env command as a shell block in bug reports. by @taenin in https://github.com/oumi-ai/oumi/pull/1548
  • Define Conversation proto bufs by @nikg4 in https://github.com/oumi-ai/oumi/pull/1550
  • [Evaluation] Modifying Alpaca Eval results format to be consistent with LM Harness by @kaisopos in https://github.com/oumi-ai/oumi/pull/1551
  • Augmenting logging training/model statistics by @optas in https://github.com/oumi-ai/oumi/pull/1545
  • Misc no-op code cleanups by @nikg4 in https://github.com/oumi-ai/oumi/pull/1553
  • Add code used for the evaluation demo. by @taenin in https://github.com/oumi-ai/oumi/pull/1556
  • Add OUMI_FORCE_EDITABLE_INSTALL env var to do editable Oumi install from source in job configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1420
  • Add letter counting GRPO example by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1539
  • Remove UV install from notebooks as this breaks colab by @taenin in https://github.com/oumi-ai/oumi/pull/1558
  • [Evaluation] Updates in hallucination notebook by @kaisopos in https://github.com/oumi-ai/oumi/pull/1552
  • [Evaluations] Custom evals: Adding support for eval_kwargs by @kaisopos in https://github.com/oumi-ai/oumi/pull/1557
  • Logging message update in log_number_of_model_parameters by @nikg4 in https://github.com/oumi-ai/oumi/pull/1560
  • [Evaluation][Custom] Removing restrictions and better error checking by @kaisopos in https://github.com/oumi-ai/oumi/pull/1561
  • Support text truncation (max_length) for vision_language_sft collator by @nikg4 in https://github.com/oumi-ai/oumi/pull/1559
  • phi 4 multimodal training version 1 ( with limitations ) by @optas in https://github.com/oumi-ai/oumi/pull/1555
  • Phi-4 basic inference with native/vllm by @optas in https://github.com/oumi-ai/oumi/pull/1563
  • [minor] phi4 train improvements by @optas in https://github.com/oumi-ai/oumi/pull/1564
  • Fix printing errors in oumi env for non-string values. by @taenin in https://github.com/oumi-ai/oumi/pull/1565

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1.8...v0.1.9

- Python
Published by nikg4 11 months ago

oumi - v0.1.8

What's Changed

  • GRPO trainer: Minimal initial integration by @nikg4 in https://github.com/oumi-ai/oumi/pull/1482
  • Update oumi infer to fall back to interactive mode if no input path is specified. by @taenin in https://github.com/oumi-ai/oumi/pull/1483
  • Add sample DDP/GCP config for GRPO trainer by @nikg4 in https://github.com/oumi-ai/oumi/pull/1485
  • Temporary fix for chat template issue with multimodal inference w/ in-process vLLM engine by @nikg4 in https://github.com/oumi-ai/oumi/pull/1486
  • [tiny] Update async_eval.yaml comments to reference correct class by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1488
  • Fix a bug where overriding remote_params fails via the CLI (oumi infer) by @taenin in https://github.com/oumi-ai/oumi/pull/1487
  • Define GrpoParams under configs by @nikg4 in https://github.com/oumi-ai/oumi/pull/1490
  • Support more GRPO params by @nikg4 in https://github.com/oumi-ai/oumi/pull/1491
  • Minor updates to oumi env by @nikg4 in https://github.com/oumi-ai/oumi/pull/1492
  • Warn instead of error when device not found for MFU calculation by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1489
  • Updated all CLI endpoints to support oumi:// prefix by @Spaarsh in https://github.com/oumi-ai/oumi/pull/1468
  • Fix chat template issue for nested content parts used for VLMs by @nikg4 in https://github.com/oumi-ai/oumi/pull/1493
  • Ctseng777/judge by @ctseng777 in https://github.com/oumi-ai/oumi/pull/1474
  • [Evaluation] Modularization & enabling custom evaluations by @kaisopos in https://github.com/oumi-ai/oumi/pull/1484
  • Update documentation formatting for BaseModel by @taenin in https://github.com/oumi-ai/oumi/pull/1494
  • Fix log_samples not propagating from eval_kwargs by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1496
  • [Evaluation] Adding support for logging model samples for all backends by @kaisopos in https://github.com/oumi-ai/oumi/pull/1499
  • Support for deprecated input param (evaluation_platform) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1500
  • Limiting the AlpacaEval number of samples for quickstart by @kaisopos in https://github.com/oumi-ai/oumi/pull/1501
  • Add recurring tests to keep our test badges updated. by @taenin in https://github.com/oumi-ai/oumi/pull/1498
  • Add a schedule for our GPU, CPU, and doc tests by @taenin in https://github.com/oumi-ai/oumi/pull/1503
  • Update the GPU Tests badge to use results from main by @taenin in https://github.com/oumi-ai/oumi/pull/1504
  • vLLM version increment by @nikg4 in https://github.com/oumi-ai/oumi/pull/1502
  • Minor logging improvements by @nikg4 in https://github.com/oumi-ai/oumi/pull/1505
  • [Evaluation] Save Utils: Moving, fixes, and unit tests by @kaisopos in https://github.com/oumi-ai/oumi/pull/1506
  • Update sample GRPO script to validate num_generations by @nikg4 in https://github.com/oumi-ai/oumi/pull/1509
  • Resolve warning about --dispatch batches deprecated param by @nikg4 in https://github.com/oumi-ai/oumi/pull/1510
  • [Evaluation] Re-enabling evaluations with Math Hard (leaderboard_math_hard) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1511
  • Update docker image and build script by @oelachqar in https://github.com/oumi-ai/oumi/pull/1508
  • Add Qwen QwQ Lora config by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1514
  • Add QwQ eval/infer configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1515
  • [Evaluation] Instantiating an inference engine (if needed) when running custom evaluations by @kaisopos in https://github.com/oumi-ai/oumi/pull/1513
  • Switch eval yaml configs to use evaluation_platform by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1516
  • Mark BaseMapDataset as typing.Sized by @nikg4 in https://github.com/oumi-ai/oumi/pull/1517
  • VLM collator refactor by @nikg4 in https://github.com/oumi-ai/oumi/pull/1512

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1.7...v0.1.8

- Python
Published by kaisopos 12 months ago

oumi - v0.1.7

What's Changed

  • Update the RemoteInferenceEngine to appropriately handle openai format batch prediction endpoints. by @taenin in https://github.com/oumi-ai/oumi/pull/1472
  • Fix local models to not break the registry. by @taenin in https://github.com/oumi-ai/oumi/pull/1476
  • Create an inference config for Claude Sonnet 3.7 by @taenin in https://github.com/oumi-ai/oumi/pull/1479
  • Add notebook for fine-tuning MiniMath-R1-1.5B by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1480
  • [Evaluation] Migrate LM Harness integration point from simple_evaluate to evaluate by @kaisopos in https://github.com/oumi-ai/oumi/pull/1455
  • [tiny]Update trl to 0.14 by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1478

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1.6...v0.1.7

- Python
Published by taenin 12 months ago

oumi - v0.1.6

What's Changed

  • Update RemoteParams to no longer require an API URL. by @taenin in https://github.com/oumi-ai/oumi/pull/1452
  • [Tiny] Update default training params for Qwen2-VL-2B-Instruct by @optas in https://github.com/oumi-ai/oumi/pull/1454
  • [Tiny] Add more warnings for "special" requirements of Qwen2.5-VL by @optas in https://github.com/oumi-ai/oumi/pull/1453
  • Minor cleanup of oumi fetch by @taenin in https://github.com/oumi-ai/oumi/pull/1463
  • Support for multi-image VLM training by @nikg4 in https://github.com/oumi-ai/oumi/pull/1448
  • Remove a temp workaround in pad_sequences on the left side by @nikg4 in https://github.com/oumi-ai/oumi/pull/1464
  • [tiny] Add warning that Oumi doesn't support Intel Macs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1467
  • VLM-related logging improvements by @nikg4 in https://github.com/oumi-ai/oumi/pull/1469
  • Fix Oumi launcher to be able to run on RunPod and Lambda by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1470
  • Enable pre-release install for uv in pyproject.toml by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1466

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1.5...v0.1.6

- Python
Published by wizeng23 12 months ago

oumi - v0.1.5

What's Changed

  • Fix the remainder of our configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1356
  • Adopt new Llama 3.1 HF names by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1357
  • Define OUMI_USE_SPOT_VM env var and start using it to override use_spot param by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1359
  • Support HuggingFaceM4/Docmatix dataset by @vishwamartur in https://github.com/oumi-ai/oumi/pull/1342
  • [nit] update default issue names by @oelachqar in https://github.com/oumi-ai/oumi/pull/1367
  • Update sft_datasets.md by @penfever in https://github.com/oumi-ai/oumi/pull/1349
  • Have GitHub Trending image hyperlink to GitHub Trending page by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1370
  • Update the link for the trending banner. by @taenin in https://github.com/oumi-ai/oumi/pull/1371
  • Move code to disable caching in model.config to a helper function by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1378
  • Update transformers version to 4.48 by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1372
  • Update notebooks to improve their Colab experience by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1380
  • Add proper labels and types to new Bugs and Feature Requests. by @taenin in https://github.com/oumi-ai/oumi/pull/1383
  • Upgrade omegaconf to 2.4.0dev3 by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1384
  • Support HuggingFaceM4/the_cauldron dataset by @vishwamartur in https://github.com/oumi-ai/oumi/pull/1366
  • Update our FAQ for tips about installing oumi on Windows by @taenin in https://github.com/oumi-ai/oumi/pull/1385
  • Cleanup HuggingFaceM4/Docmatix and HuggingFaceM4/the_cauldron multimodal datasets by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1387
  • Remove uneeded env vars from job configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1390
  • Remove transformer version override for HuggingFaceTB/SmolVLM-Instruct in launcher script by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1388
  • [Small Refactor] Moving the inference engine def outside the inference config by @kaisopos in https://github.com/oumi-ai/oumi/pull/1395
  • Evaluation - LM Harness: Adding vLLM support by @kaisopos in https://github.com/oumi-ai/oumi/pull/1379
  • Remove Docmatix dataset references from docstrings VLM config examples by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1397
  • Fixed broken link in Oumi - A Tour.ipynb notebook by @ciaralema in https://github.com/oumi-ai/oumi/pull/1398
  • Fix broken links in notebooks. by @taenin in https://github.com/oumi-ai/oumi/pull/1402
  • Create a client for communicating with a Slurm node via SSH. by @taenin in https://github.com/oumi-ai/oumi/pull/1389
  • [tiny] Remove references to missing job configs in README by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1404
  • Train+Inference with Qwen 2.5 VL (3B) by @optas in https://github.com/oumi-ai/oumi/pull/1396
  • Add a Slurm cluster and cloud to the oumi launcher. by @taenin in https://github.com/oumi-ai/oumi/pull/1406
  • Move pretokenize script from scripts/pretokenize/ to scripts/datasets/pretokenize/ by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1412
  • Create a script to save Conversation-s from SFT datasets into .jsonl file by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1413
  • [Evaluation] LM Harness refactor by @kaisopos in https://github.com/oumi-ai/oumi/pull/1410
  • Update save_conversations tool by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1421
  • [SambaNova] Integrate SambaNova Systems to oumi inference by @ctseng777 in https://github.com/oumi-ai/oumi/pull/1415
  • [ฮœinor] Equating Qwen's 2.5 chat-template to version's 2.0 by @optas in https://github.com/oumi-ai/oumi/pull/1419
  • Add requirements header to configs and clean them up by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1411
  • Updated oumi infer to support CLI argument for system prompt by @Spaarsh in https://github.com/oumi-ai/oumi/pull/1422
  • [Evaluation] LM Harness remote server support by @kaisopos in https://github.com/oumi-ai/oumi/pull/1414
  • [Feature] Add Tulu3 SFT Mixture Dataset Support by @bwalshe in https://github.com/oumi-ai/oumi/pull/1381
  • Support Multimodal inference with multiple images and PDF-s in NATIVE engine by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1424
  • Update notebooks to run on Colab by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1423
  • Add calm recipe. by @taenin in https://github.com/oumi-ai/oumi/pull/1425
  • Update VLM sample oumi infer -i commands by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1428
  • Provide example show to start SGLang server using Docker by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1429
  • Multi-image support in SGLang inference engine by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1426
  • Calm readme by @emrecanacikgoz in https://github.com/oumi-ai/oumi/pull/1432
  • WildChat-50M Reproduction by @penfever in https://github.com/oumi-ai/oumi/pull/1433
  • Add WildChat support by @penfever in https://github.com/oumi-ai/oumi/pull/1348
  • Create pad_to_max_dim_and_stack() function in torch_utils by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1435
  • use deterministic by @penfever in https://github.com/oumi-ai/oumi/pull/1434
  • Additional HF trainer parameters for config by @penfever in https://github.com/oumi-ai/oumi/pull/1436
  • Set a better default for vllm inference GPU usage. by @taenin in https://github.com/oumi-ai/oumi/pull/1437
  • Added fetch command and modified infer command to resolve oumi:// by @Spaarsh in https://github.com/oumi-ai/oumi/pull/1439
  • Require an inference config for oumi infer. by @taenin in https://github.com/oumi-ai/oumi/pull/1443
  • Make the tulu3 unit tests hermetic. by @taenin in https://github.com/oumi-ai/oumi/pull/1446
  • Add 2 more sample PDF-s with 1 and 2 pages under testdata/pdfs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1427
  • Enable ability to override list values in config via CLI by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1430
  • Renamed CALM to CoALM by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1450
  • Add support for Docmatix dataset to multimodal training script by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1449
  • Update oumi launch status to show clusters with no running jobs. by @taenin in https://github.com/oumi-ai/oumi/pull/1451

New Contributors

  • @vishwamartur made their first contribution in https://github.com/oumi-ai/oumi/pull/1342
  • @ctseng777 made their first contribution in https://github.com/oumi-ai/oumi/pull/1415
  • @Spaarsh made their first contribution in https://github.com/oumi-ai/oumi/pull/1422
  • @bwalshe made their first contribution in https://github.com/oumi-ai/oumi/pull/1381
  • @emrecanacikgoz made their first contribution in https://github.com/oumi-ai/oumi/pull/1432

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1.4...v0.1.5

- Python
Published by wizeng23 about 1 year ago

oumi - v0.1.4

What's Changed

  • Add memory cleanup calls in e2e integration tests by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1277
  • Set up versioning for our documentation by @taenin in https://github.com/oumi-ai/oumi/pull/1275
  • Make qwen2-VL evaluation job pass by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1278
  • Add multi-modal (vlm) notebook with Llama 11B by @optas in https://github.com/oumi-ai/oumi/pull/1258
  • Documentation: Inference -> List supported models by @kaisopos in https://github.com/oumi-ai/oumi/pull/1279
  • [tiny] update website link by @oelachqar in https://github.com/oumi-ai/oumi/pull/1280
  • Update all documentation links to the new doc URL by @taenin in https://github.com/oumi-ai/oumi/pull/1281
  • Update Oumi - A Tour.ipynb by @brragorn in https://github.com/oumi-ai/oumi/pull/1282
  • Documentation: Judge (minor edits) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1283
  • Fix citation by @oelachqar in https://github.com/oumi-ai/oumi/pull/1285
  • Add Deepseek R1 1.5B/32B configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1276
  • Misc eval configs cleanup by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1286
  • [docs] Describe parallel evaluation by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1284
  • Update microsoft/Phi-3-vision-128k-instruct training config by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1287
  • Add Together Deepseek R1 inference config by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1289
  • [minor] vlm notebook minor updates (doc referencing, freeze visual backbone) by @optas in https://github.com/oumi-ai/oumi/pull/1288
  • Add missing -m oumi evaluate argument in eval config by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1291
  • [docs] Add more references to VL-SFT and SFT notebooks by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1293
  • Eval config change for deepseek-ai/DeepSeek-R1-Distill-Llama-70B by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1292
  • [notebooks] Update intro & installation instruction by @oelachqar in https://github.com/oumi-ai/oumi/pull/1294
  • Update notebook intros by @oelachqar in https://github.com/oumi-ai/oumi/pull/1296
  • [notebooks] Update installation instructions for colab by @oelachqar in https://github.com/oumi-ai/oumi/pull/1297
  • Add Apache license header to src/oumi/**/*.py by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1290
  • Minor updates to VLM Multimodal notebook by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1299
  • [docs] Add latest notebooks and update references by @oelachqar in https://github.com/oumi-ai/oumi/pull/1300
  • [tiny] Add docs auto-generated .rst files to gitignore by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1298
  • [tiny] use GitHub link for header by @oelachqar in https://github.com/oumi-ai/oumi/pull/1301
  • [docs][tiny] update inference engines reference by @oelachqar in https://github.com/oumi-ai/oumi/pull/1302
  • Update README/docs to add new DeepSeek models by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1304
  • [docs] Use pip install oumi over pip install . by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1305
  • Tune VLM SFT configs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1306
  • Tune VLM configs for SmolVLM and Qwen2-VL by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1307
  • Update config/notebook pip installs to use PyPI by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1308
  • [tiny] upgrade torch version by @oelachqar in https://github.com/oumi-ai/oumi/pull/1295
  • Update logging and unit tests related to chat templates by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1311
  • fix(docs): "interested by joining" to "interested in joining" by @CharlesCNorton in https://github.com/oumi-ai/oumi/pull/1312
  • Add HF_TOKEN instructions to Oumi Multimodal notebook by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1313
  • Update configuration.md by @penfever in https://github.com/oumi-ai/oumi/pull/1314
  • remove duplicate keys in config example by @lucyknada in https://github.com/oumi-ai/oumi/pull/1315
  • [Notebooks] Update VLM notebook by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1317
  • Update parasailinferenceengine.py by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1320
  • Fix typo and update warning message for OUMI trainer by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1319
  • [Notebooks] Add a note that a notebook kernel restart may be needed after pip install oumi by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1318
  • Update Phi3 to support multiple images by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1321
  • Add more detailed comment headers to YAML configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1310
  • [Notebooks] Add a note to Tour notebook to restart kernel after the first pip install by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1327
  • Tweak --mem-fraction-static param in sample SGLang configs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1328
  • Disallow using DatasetParams field names as keys in DatasetParams.dataset_kwargs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1324
  • Support dataset_name_override dataset_kwarg by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1188
  • Add an util and a test marker for HF token by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1329
  • Update llama3-instruct chat template to align with the original models template by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1326
  • chore: update launcher.sh by @eltociear in https://github.com/oumi-ai/oumi/pull/1333
  • [Notebooks] Minor improvements in VLM and CNN notebooks by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1335
  • Update VLM cluster names in sample commands by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1336
  • Update our README and docs with the github trending badge. by @taenin in https://github.com/oumi-ai/oumi/pull/1340
  • Update README.md - Add DeepSeek to supported models by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1343
  • Update index.md - Add DeepSeek to supported models by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1344
  • Update "GPU Tests" status badge in README page by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1345

New Contributors

  • @CharlesCNorton made their first contribution in https://github.com/oumi-ai/oumi/pull/1312
  • @lucyknada made their first contribution in https://github.com/oumi-ai/oumi/pull/1315
  • @eltociear made their first contribution in https://github.com/oumi-ai/oumi/pull/1333

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1.3...v0.1.4

- Python
Published by oelachqar about 1 year ago

oumi - v0.1.3

What's Changed

  • Documentation: Judge | Custom Model page by @kaisopos in https://github.com/oumi-ai/oumi/pull/1195
  • [WIP] Add a notebook for using CNN with custom dataset by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1196
  • [Cherrypick for launch] Evaluate: return dict of results by @kaisopos in https://github.com/oumi-ai/oumi/pull/1197
  • Configs Train/Infer/Eval and Llama 3.3v (70b) by @optas in https://github.com/oumi-ai/oumi/pull/1200
  • Adding an integration test for evaluation fn's output (see PR-1197) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1199
  • [docs] Add more details and cross-references related to customization by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1198
  • Define single_gpu test marker by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1201
  • Native inference: Don't set min_p, temperature in GenerationConfig if sampling is disabled by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1202
  • Update tests to make them runnable on GCP by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1203
  • Add newline before pformat(train_config) by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1204
  • GCP tests launcher script changes by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1205
  • [Evaluation] Bug: serialization by @kaisopos in https://github.com/oumi-ai/oumi/pull/1207
  • [docs] Add inference snippet for together.ai and DeepSeek APIs by @oelachqar in https://github.com/oumi-ai/oumi/pull/1208
  • Exclude multi_gpu tests from GitHub GPU tests by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1210
  • Update e2e tests to support multi-GPU machines by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1206
  • Add wrappers for remote inference engines by @oelachqar in https://github.com/oumi-ai/oumi/pull/1209
  • Vision-Lang & Inference (including LoRA) by @optas in https://github.com/oumi-ai/oumi/pull/1174
  • [BugFix] Throw a runtime error for quantized models & inference=VLLM by @kaisopos in https://github.com/oumi-ai/oumi/pull/1212
  • Fix most job configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1213
  • e2e tests update by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1216
  • [Notebook] Evaluation with Oumi by @kaisopos in https://github.com/oumi-ai/oumi/pull/1218
  • gpt2: move include_performance_metrics param from script to yaml by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1217
  • Simplify inference engine API by @oelachqar in https://github.com/oumi-ai/oumi/pull/1214
  • Move configs to experimental by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1215
  • [docs] Update index page by @oelachqar in https://github.com/oumi-ai/oumi/pull/1220
  • Update ConsoleLogger to write to STDOUT by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1221
  • Set use_spot to False in our JobConfigs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1222
  • Delete oumi[optional] install target by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1224
  • Scaffolding and the first testcase for e2e evaluation tests by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1225
  • [docs] Update inference engines doc page by @oelachqar in https://github.com/oumi-ai/oumi/pull/1227
  • Clean-up inference engine builder by @oelachqar in https://github.com/oumi-ai/oumi/pull/1226
  • [VLLM Engine] Enabling BitsAndBytes quantization by @kaisopos in https://github.com/oumi-ai/oumi/pull/1223
  • Add example distillation notebook by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1228
  • Add a script to pre-download models for gpu_tests by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1231
  • Fix multi-GPU inference integration test by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1229
  • [tiny][docs] Update PEFT/LoRA content by @optas in https://github.com/oumi-ai/oumi/pull/1233
  • [BugFix] GGUF does not work with VLLM by @kaisopos in https://github.com/oumi-ai/oumi/pull/1232
  • Re-enable parallel evaluation for VLM-s by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1235
  • Add multimodal exemplar dataset in our provided mini-datasets by @optas in https://github.com/oumi-ai/oumi/pull/1234
  • [Tiny] renaming a field name (init_lora_weights) by @optas in https://github.com/oumi-ai/oumi/pull/1236
  • Add more e2e evaluation tests by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1237
  • Fix pyright breakage when vllm and llama_cpp are not installed by @taenin in https://github.com/oumi-ai/oumi/pull/1240
  • Update our oumi launch documentation. by @taenin in https://github.com/oumi-ai/oumi/pull/1239
  • Update index.md title for "Join the Community!" by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1242
  • Update quickstart.md - nit for Oumi support request by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1241
  • [VLLM Engine] Improve support for GGUF models (incl. auto-download) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1238
  • Update README.md title to "Join the Community!" by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1243
  • Update quickstart.md by @brragorn in https://github.com/oumi-ai/oumi/pull/1251
  • Update quickstart.md by @brragorn in https://github.com/oumi-ai/oumi/pull/1253
  • Update quickstart.md by @brragorn in https://github.com/oumi-ai/oumi/pull/1252
  • Update quickstart.md by @brragorn in https://github.com/oumi-ai/oumi/pull/1250
  • [Minor refactor] Moving model caching to oumi.utils by @kaisopos in https://github.com/oumi-ai/oumi/pull/1246
  • Add more details to troubleshooting FAQ by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1249
  • Update training_methods.md - Change compute requirement suggestions by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1245
  • Update train.md - nit description change by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1244
  • [docs] misc docs feedback by @oelachqar in https://github.com/oumi-ai/oumi/pull/1248
  • [tiny] Qwen2-VL activate experimental datapipes by @optas in https://github.com/oumi-ai/oumi/pull/1247
  • Update Oumi - A Tour.ipynb by @brragorn in https://github.com/oumi-ai/oumi/pull/1254
  • [docs] more docs feedback by @oelachqar in https://github.com/oumi-ai/oumi/pull/1255
  • Update supported_models.md by @penfever in https://github.com/oumi-ai/oumi/pull/1256
  • Rename experimental_use_torch_datapipes data param by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1257
  • Add pypi release workflow using testpypi by @oelachqar in https://github.com/oumi-ai/oumi/pull/1259
  • Update workflow names by @oelachqar in https://github.com/oumi-ai/oumi/pull/1262
  • Update default idleminutesto_autostop to 1 hour. by @taenin in https://github.com/oumi-ai/oumi/pull/1264
  • update pypi release workflow to use trusted env by @oelachqar in https://github.com/oumi-ai/oumi/pull/1265
  • Add padding_side param to internal model config by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1260
  • Documentation: Updates on Evaluation/Judge (based on Manos' feedback) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1261
  • [tiny] less strict requirements by @oelachqar in https://github.com/oumi-ai/oumi/pull/1266
  • Add Deepseek R1 Distill Llama 8B/70B configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1263
  • Update index.md to highlight beta stage by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1268
  • Update README.md to highlight beta stage by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1267
  • Disable pre-release packages by @oelachqar in https://github.com/oumi-ai/oumi/pull/1270
  • Update common_workflows.md - Clarify OpenAI is just an example by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1271
  • Documentation: Evaluation page (update to highlight multi-modal) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1269
  • Update launch.md by @taenin in https://github.com/oumi-ai/oumi/pull/1272
  • Add pypi release workflow by @oelachqar in https://github.com/oumi-ai/oumi/pull/1273
  • Documentation: Judge | minor edit (bold) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1274

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1.2...v0.1.3

- Python
Published by oelachqar about 1 year ago

oumi - v0.1.2.3.1

What's Changed

  • Re-enable parallel evaluation for VLM-s by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1235
  • Add multimodal exemplar dataset in our provided mini-datasets by @optas in https://github.com/oumi-ai/oumi/pull/1234
  • [Tiny] renaming a field name (init_lora_weights) by @optas in https://github.com/oumi-ai/oumi/pull/1236
  • Add more e2e evaluation tests by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1237
  • Fix pyright breakage when vllm and llama_cpp are not installed by @taenin in https://github.com/oumi-ai/oumi/pull/1240
  • Update our oumi launch documentation. by @taenin in https://github.com/oumi-ai/oumi/pull/1239
  • Update index.md title for "Join the Community!" by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1242
  • Update quickstart.md - nit for Oumi support request by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1241
  • [VLLM Engine] Improve support for GGUF models (incl. auto-download) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1238
  • Update README.md title to "Join the Community!" by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1243
  • Update quickstart.md by @brragorn in https://github.com/oumi-ai/oumi/pull/1251
  • Update quickstart.md by @brragorn in https://github.com/oumi-ai/oumi/pull/1253
  • Update quickstart.md by @brragorn in https://github.com/oumi-ai/oumi/pull/1252
  • Update quickstart.md by @brragorn in https://github.com/oumi-ai/oumi/pull/1250
  • [Minor refactor] Moving model caching to oumi.utils by @kaisopos in https://github.com/oumi-ai/oumi/pull/1246
  • Add more details to troubleshooting FAQ by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1249
  • Update training_methods.md - Change compute requirement suggestions by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1245
  • Update train.md - nit description change by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1244
  • [docs] misc docs feedback by @oelachqar in https://github.com/oumi-ai/oumi/pull/1248
  • [tiny] Qwen2-VL activate experimental datapipes by @optas in https://github.com/oumi-ai/oumi/pull/1247
  • Update Oumi - A Tour.ipynb by @brragorn in https://github.com/oumi-ai/oumi/pull/1254
  • [docs] more docs feedback by @oelachqar in https://github.com/oumi-ai/oumi/pull/1255
  • Update supported_models.md by @penfever in https://github.com/oumi-ai/oumi/pull/1256
  • Rename experimental_use_torch_datapipes data param by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1257
  • Add pypi release workflow using testpypi by @oelachqar in https://github.com/oumi-ai/oumi/pull/1259
  • Update workflow names by @oelachqar in https://github.com/oumi-ai/oumi/pull/1262
  • Update default idleminutesto_autostop to 1 hour. by @taenin in https://github.com/oumi-ai/oumi/pull/1264
  • update pypi release workflow to use trusted env by @oelachqar in https://github.com/oumi-ai/oumi/pull/1265
  • Add padding_side param to internal model config by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1260
  • Documentation: Updates on Evaluation/Judge (based on Manos' feedback) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1261
  • [tiny] less strict requirements by @oelachqar in https://github.com/oumi-ai/oumi/pull/1266
  • Add Deepseek R1 Distill Llama 8B/70B configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1263

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1.2.2...v0.1.2.3.1

- Python
Published by oelachqar about 1 year ago

oumi - v0.1.2.3

What's Changed

  • Re-enable parallel evaluation for VLM-s by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1235
  • Add multimodal exemplar dataset in our provided mini-datasets by @optas in https://github.com/oumi-ai/oumi/pull/1234
  • [Tiny] renaming a field name (init_lora_weights) by @optas in https://github.com/oumi-ai/oumi/pull/1236
  • Add more e2e evaluation tests by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1237
  • Fix pyright breakage when vllm and llama_cpp are not installed by @taenin in https://github.com/oumi-ai/oumi/pull/1240
  • Update our oumi launch documentation. by @taenin in https://github.com/oumi-ai/oumi/pull/1239
  • Update index.md title for "Join the Community!" by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1242
  • Update quickstart.md - nit for Oumi support request by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1241
  • [VLLM Engine] Improve support for GGUF models (incl. auto-download) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1238
  • Update README.md title to "Join the Community!" by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1243
  • Update quickstart.md by @brragorn in https://github.com/oumi-ai/oumi/pull/1251
  • Update quickstart.md by @brragorn in https://github.com/oumi-ai/oumi/pull/1253
  • Update quickstart.md by @brragorn in https://github.com/oumi-ai/oumi/pull/1252
  • Update quickstart.md by @brragorn in https://github.com/oumi-ai/oumi/pull/1250
  • [Minor refactor] Moving model caching to oumi.utils by @kaisopos in https://github.com/oumi-ai/oumi/pull/1246
  • Add more details to troubleshooting FAQ by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1249
  • Update training_methods.md - Change compute requirement suggestions by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1245
  • Update train.md - nit description change by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1244
  • [docs] misc docs feedback by @oelachqar in https://github.com/oumi-ai/oumi/pull/1248
  • [tiny] Qwen2-VL activate experimental datapipes by @optas in https://github.com/oumi-ai/oumi/pull/1247
  • Update Oumi - A Tour.ipynb by @brragorn in https://github.com/oumi-ai/oumi/pull/1254
  • [docs] more docs feedback by @oelachqar in https://github.com/oumi-ai/oumi/pull/1255
  • Update supported_models.md by @penfever in https://github.com/oumi-ai/oumi/pull/1256
  • Rename experimental_use_torch_datapipes data param by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1257
  • Add pypi release workflow using testpypi by @oelachqar in https://github.com/oumi-ai/oumi/pull/1259
  • Update workflow names by @oelachqar in https://github.com/oumi-ai/oumi/pull/1262
  • Update default idleminutesto_autostop to 1 hour. by @taenin in https://github.com/oumi-ai/oumi/pull/1264
  • update pypi release workflow to use trusted env by @oelachqar in https://github.com/oumi-ai/oumi/pull/1265

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1.2.2...v0.1.2.3

- Python
Published by oelachqar about 1 year ago

oumi - v0.1.2.2

What's Changed

  • Configs Train/Infer/Eval and Llama 3.3v (70b) by @optas in https://github.com/oumi-ai/oumi/pull/1200
  • Adding an integration test for evaluation fn's output (see PR-1197) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1199
  • [docs] Add more details and cross-references related to customization by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1198
  • Define single_gpu test marker by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1201
  • Native inference: Don't set min_p, temperature in GenerationConfig if sampling is disabled by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1202
  • Update tests to make them runnable on GCP by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1203
  • Add newline before pformat(train_config) by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1204
  • GCP tests launcher script changes by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1205
  • [Evaluation] Bug: serialization by @kaisopos in https://github.com/oumi-ai/oumi/pull/1207
  • [docs] Add inference snippet for together.ai and DeepSeek APIs by @oelachqar in https://github.com/oumi-ai/oumi/pull/1208
  • Exclude multi_gpu tests from GitHub GPU tests by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1210
  • Update e2e tests to support multi-GPU machines by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1206
  • Add wrappers for remote inference engines by @oelachqar in https://github.com/oumi-ai/oumi/pull/1209
  • Vision-Lang & Inference (including LoRA) by @optas in https://github.com/oumi-ai/oumi/pull/1174
  • [BugFix] Throw a runtime error for quantized models & inference=VLLM by @kaisopos in https://github.com/oumi-ai/oumi/pull/1212
  • Fix most job configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1213
  • e2e tests update by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1216
  • [Notebook] Evaluation with Oumi by @kaisopos in https://github.com/oumi-ai/oumi/pull/1218
  • gpt2: move include_performance_metrics param from script to yaml by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1217
  • Simplify inference engine API by @oelachqar in https://github.com/oumi-ai/oumi/pull/1214
  • Move configs to experimental by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1215
  • [docs] Update index page by @oelachqar in https://github.com/oumi-ai/oumi/pull/1220
  • Update ConsoleLogger to write to STDOUT by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1221
  • Set use_spot to False in our JobConfigs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1222
  • Delete oumi[optional] install target by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1224
  • Scaffolding and the first testcase for e2e evaluation tests by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1225
  • [docs] Update inference engines doc page by @oelachqar in https://github.com/oumi-ai/oumi/pull/1227
  • Clean-up inference engine builder by @oelachqar in https://github.com/oumi-ai/oumi/pull/1226
  • [VLLM Engine] Enabling BitsAndBytes quantization by @kaisopos in https://github.com/oumi-ai/oumi/pull/1223
  • Add example distillation notebook by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1228
  • Add a script to pre-download models for gpu_tests by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1231
  • Fix multi-GPU inference integration test by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1229
  • [tiny][docs] Update PEFT/LoRA content by @optas in https://github.com/oumi-ai/oumi/pull/1233
  • [BugFix] GGUF does not work with VLLM by @kaisopos in https://github.com/oumi-ai/oumi/pull/1232

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1.2.1...v0.1.2.2

- Python
Published by oelachqar about 1 year ago

oumi - v0.1.2.1

What's Changed

  • [docs] Make absolute links local in our Launch + Contributing Docs by @taenin in https://github.com/oumi-ai/oumi/pull/1152
  • Fix broken ruff precommit. by @taenin in https://github.com/oumi-ai/oumi/pull/1153
  • Documentation: Judge | 1st revision (of many) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1148
  • [docs] Put -m oumi train on single line by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1154
  • [docs] Add a description to the oumi env command. by @taenin in https://github.com/oumi-ai/oumi/pull/1155
  • [docs] Update the list of commands on Core Concepts page by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1156
  • Add a comment to Omegaconf yaml files with config info by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1149
  • Update training_methods.md by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1159
  • Update train.md by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1158
  • [docs] Improve docs for custom multimodal SFT datasets by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1157
  • Update quickstart.md Removed H1 emoji by @brragorn in https://github.com/oumi-ai/oumi/pull/1160
  • Update style_guide.md by @brragorn in https://github.com/oumi-ai/oumi/pull/1162
  • Exclude unit/integration tests explicitly marked as e2e from GitHub runs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1164
  • Update train.yaml by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1167
  • Define e2e_eternal test marker by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1169
  • Bugfixes in tests/integration/datasets/test_sft_vision_datasets_load_datasets.py by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1166
  • Update contributing.md by @brragorn in https://github.com/oumi-ai/oumi/pull/1161
  • Removed unnecessary cloud.md which is already covered in depth in another section. by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1168
  • Update GitHub actions markers expression by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1171
  • Add MNIST as Vision SFT dataset by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1170
  • Documentation: Judge | Judge (main) page by @kaisopos in https://github.com/oumi-ai/oumi/pull/1165
  • [Documentation] Evaluation pages: fixing broken links by @kaisopos in https://github.com/oumi-ai/oumi/pull/1172
  • Documentation: Judge | Built-In page by @kaisopos in https://github.com/oumi-ai/oumi/pull/1173
  • OUMI Trainer: Support non-text models by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1175
  • OUMI Trainer: Improve handling of num_training_epochs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1176
  • Update readme by @oelachqar in https://github.com/oumi-ai/oumi/pull/1177
  • Training vlm llama 90b by @optas in https://github.com/oumi-ai/oumi/pull/1179
  • Documentation: Judge | Custom Prompts page by @kaisopos in https://github.com/oumi-ai/oumi/pull/1183
  • [docs] fix misc broken links by @oelachqar in https://github.com/oumi-ai/oumi/pull/1186
  • Fixing judge's broken doc links by @kaisopos in https://github.com/oumi-ai/oumi/pull/1185
  • OUMI Trainer: Improve handling of num_training_epochs and update _get_total_training_steps() by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1178
  • [docs] Add initial page how to define datasets for custom data types by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1182
  • Add CnnClassifier custom model by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1180
  • Define default_dataset property for all pretraining datasets by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1189
  • [tiny] Raise explicit error for models not supporting infer by @oelachqar in https://github.com/oumi-ai/oumi/pull/1187
  • [tests] Replace external models & datasets to make unit tests hermetic by @oelachqar in https://github.com/oumi-ai/oumi/pull/1163
  • [tiny] Add error message to bare exceptions by @oelachqar in https://github.com/oumi-ai/oumi/pull/1190
  • Bugfix in CnnClassifier example by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1191
  • Move data mixtures tests to unit tests by @oelachqar in https://github.com/oumi-ai/oumi/pull/1192
  • [docs] Resolve PyRight issues in NpzDataset in custom dataset example by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1194
  • Add examples of Alpaca and Oumi Dataset Formats by @optas in https://github.com/oumi-ai/oumi/pull/1184
  • Add integration test for CNNClassifier custom model by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1193

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1.2-alpha...v0.1.2

- Python
Published by oelachqar about 1 year ago

oumi - v0.1.2.0-alpha

What's Changed

  • Update README.md - Better highlight features & nits by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/995
  • [tiny] update docstring and cleanup by @oelachqar in https://github.com/oumi-ai/oumi/pull/1006
  • Qwen2-VL: minor updates by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1000
  • Update README.md - Describe Oumi's most common capabilities by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/996
  • Fix readme. by @taenin in https://github.com/oumi-ai/oumi/pull/1009
  • Updated our ascii logo by @taenin in https://github.com/oumi-ai/oumi/pull/1008
  • [docs] Update readme by @oelachqar in https://github.com/oumi-ai/oumi/pull/1010
  • Cleanup scripts by @oelachqar in https://github.com/oumi-ai/oumi/pull/1011
  • Cleanup experimental folder by @oelachqar in https://github.com/oumi-ai/oumi/pull/1012
  • Update lists of supported VLM-s in README and docs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1014
  • Freeze Python package versions by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1007
  • Update blip2's chat template to use the "default" one by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1015
  • Add docstrings how to start vLLM and SGLang servers for Llama-3.2-11B-Vision-Instruct by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1016
  • Evaluation: bugfixing, corner case, unit tests by @kaisopos in https://github.com/oumi-ai/oumi/pull/1003
  • Configure asyncio_default_fixture_loop_scope to reduce pytest warnings by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1013
  • Update the registry to load registered core values upon use. by @taenin in https://github.com/oumi-ai/oumi/pull/1017
  • Update default installation instructions to pypi by @taenin in https://github.com/oumi-ai/oumi/pull/1018
  • [tiny] Update debug datasets by @oelachqar in https://github.com/oumi-ai/oumi/pull/1020
  • [docs] Address misc docs feedback by @oelachqar in https://github.com/oumi-ai/oumi/pull/1019
  • [tiny] update pre-defined judges and docs by @oelachqar in https://github.com/oumi-ai/oumi/pull/1021
  • Parameterize e2e training test, and add config for Qwen2-VL by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1023
  • Remove our docs password from the readme. by @taenin in https://github.com/oumi-ai/oumi/pull/1024
  • VLM docs update by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1025
  • Fix loading registered pretrain datasets by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1005
  • Update @requires_gpus test decorator to optionally specify min GPU memory requirement by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1029
  • [tiny] Update GitHub workflows by @oelachqar in https://github.com/oumi-ai/oumi/pull/1034
  • Update BaseConfig.from_yaml to also support Path by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1026
  • [tiny] Cleanup judge engine builder & fix circular dep by @oelachqar in https://github.com/oumi-ai/oumi/pull/1035
  • Create GPU GitHub Actions workflow by @oelachqar in https://github.com/oumi-ai/oumi/pull/1004
  • Add structured outputs support to gemini/vertex engines by @oelachqar in https://github.com/oumi-ai/oumi/pull/1022
  • [docs] Fix feedback on training and inference user guides by @oelachqar in https://github.com/oumi-ai/oumi/pull/1037
  • [docs][tiny] fix examples in inference guide by @oelachqar in https://github.com/oumi-ai/oumi/pull/1038
  • Add a sanity test for circular imports. by @taenin in https://github.com/oumi-ai/oumi/pull/1030
  • Resolve circular dependencies in Oumi by @taenin in https://github.com/oumi-ai/oumi/pull/1039
  • Move our circular dependency test to e2e to speed up GPU CI tests. by @taenin in https://github.com/oumi-ai/oumi/pull/1040
  • Add custom inference engine for gemini API by @oelachqar in https://github.com/oumi-ai/oumi/pull/1036
  • Define CLI in our quickstart. by @taenin in https://github.com/oumi-ai/oumi/pull/1042
  • Skip running GPU tests on low-risk code paths by @oelachqar in https://github.com/oumi-ai/oumi/pull/1043
  • Define more terms in our training docs. by @taenin in https://github.com/oumi-ai/oumi/pull/1044
  • Fix the broken python text snippet on the train page. by @taenin in https://github.com/oumi-ai/oumi/pull/1045
  • Fix the second python snippet in the train page. by @taenin in https://github.com/oumi-ai/oumi/pull/1046
  • [docs] Add Gemini to the list of supported inference API-s, and sort them by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1048
  • Fix issues in most notebooks by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1047
  • [docs][tiny] remove termynal from sphinx conf by @oelachqar in https://github.com/oumi-ai/oumi/pull/1041
  • Fix a typo in the VS Code environment page. by @taenin in https://github.com/oumi-ai/oumi/pull/1049
  • Define WSL in our vscode docs. by @taenin in https://github.com/oumi-ai/oumi/pull/1052
  • [tiny] disable unit tests on safe paths by @oelachqar in https://github.com/oumi-ai/oumi/pull/1051
  • [docs] Fix contributing and open issue links by @oelachqar in https://github.com/oumi-ai/oumi/pull/1050
  • [evaluations/generative_benchmark] Broken link by @kaisopos in https://github.com/oumi-ai/oumi/pull/1054
  • Remove dangling reference to jupyter in Makefile help by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1053
  • [evaluations/generative_benchmark] Removing notebook link by @kaisopos in https://github.com/oumi-ai/oumi/pull/1055
  • Support constrained decoding in SGLang inference engine by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1032
  • [tiny] Update tutorials page by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1056
  • Minor updates to Launch.md by @taenin in https://github.com/oumi-ai/oumi/pull/1059
  • [docs] Update docs/user_guides/infer/infer.md by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1058
  • Nits for common_workflows.md by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1061
  • Nit fixes for acknowledgements.md by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1057
  • Add sample trouble shooting for remote jobs. by @taenin in https://github.com/oumi-ai/oumi/pull/1062
  • Add a Github Issues selector for questions and have it redirect to Discord. by @taenin in https://github.com/oumi-ai/oumi/pull/1064
  • Package checking: Adding functionality for checking package versioning and fast failing by @kaisopos in https://github.com/oumi-ai/oumi/pull/1031
  • Fix various typos in contributing.md by @taenin in https://github.com/oumi-ai/oumi/pull/1066
  • SGLang inference documentation by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1065
  • Replace assert in NativeInferenceEngine with RuntimeError by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1068
  • Update dev set up instructions to use a Fork. by @taenin in https://github.com/oumi-ai/oumi/pull/1067
  • Define inference configs for more models by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1069
  • [Evaluation] HF Leaderboards yaml files by @kaisopos in https://github.com/oumi-ai/oumi/pull/1071
  • Specify engine: NATIVE is inference configs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1075
  • Improve handling of image path and URLs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1074
  • [Doc > Quickstart] Should we add links to guides for better discoverability? by @kaisopos in https://github.com/oumi-ai/oumi/pull/1076
  • Add e2e tests for running tutorial notebooks by @oelachqar in https://github.com/oumi-ai/oumi/pull/1079
  • Ignore all experimental files when running our circular dependency test. by @taenin in https://github.com/oumi-ai/oumi/pull/1081
  • [Super Nit Doc Update] environments.md by @kaisopos in https://github.com/oumi-ai/oumi/pull/1082
  • Add an env var for loading user registered values (dataset, models, clouds) when initializing the Oumi Registry by @taenin in https://github.com/oumi-ai/oumi/pull/1077
  • Update internal model configs to support default tokenizer_pad_token and chat_template by model type by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1078
  • [Minor] Notebook typo by @kaisopos in https://github.com/oumi-ai/oumi/pull/1085
  • Upgrade transformers to 4.47 by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1033
  • [tiny][docs] Update recipes page by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1072
  • Configure e2e integration test for Llama 3.2 Vision 11B by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1086
  • Nits for cli_reference.md by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/1063
  • [Documentation] Evaluate | Leaderboards Page by @kaisopos in https://github.com/oumi-ai/oumi/pull/1084
  • [Documentation] Evaluate | Main Page (revision) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1089
  • [tiny] Fix precommit by @oelachqar in https://github.com/oumi-ai/oumi/pull/1092
  • Add timeout for unit & integration tests by @oelachqar in https://github.com/oumi-ai/oumi/pull/1091
  • Add GitHub Actions workflow for doctests by @oelachqar in https://github.com/oumi-ai/oumi/pull/1093
  • [docs] remove unused page, fix links by @oelachqar in https://github.com/oumi-ai/oumi/pull/1094
  • [Documentation] Evaluate | Main Page (small refactor) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1095
  • Rewrite of the main Oumi Launch page. by @taenin in https://github.com/oumi-ai/oumi/pull/1087
  • Remove pytest.mark.skip() for basic e2e tests by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1088
  • [tiny] Upgrade minimum numpy version to unblock python3.12 installation by @oelachqar in https://github.com/oumi-ai/oumi/pull/1099
  • Update our Readme with a new header image. by @taenin in https://github.com/oumi-ai/oumi/pull/1098
  • [docs] Minor refresh to dataset resource pages by @oelachqar in https://github.com/oumi-ai/oumi/pull/1097
  • [docs] Add docs guide page by @oelachqar in https://github.com/oumi-ai/oumi/pull/1096
  • Add a quick unit test to ensure new dependencies are not added to the top-level CLI by @taenin in https://github.com/oumi-ai/oumi/pull/1100
  • Updates distributed run to handle local machine scenarios by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1028
  • [docs] Refresh datasets resource page by @oelachqar in https://github.com/oumi-ai/oumi/pull/1101
  • Update e2e integration tests to allow trainer_type overrides by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1090
  • Temporary fix for "processing_class" error for OUMI trainer by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1102
  • Update training logging by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1104
  • Vision-Language LoRA training by @optas in https://github.com/oumi-ai/oumi/pull/1027
  • Update the Deploy page to use CLI and python instructions. by @taenin in https://github.com/oumi-ai/oumi/pull/1106
  • Set rank/world_size/device_id= params in init_process_group() to resolve warnings by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1109
  • [Revert] Undo package upgrades including transformers 4.47 by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1111
  • Move Llama 3.2 Vision 11b SFT FFT configs into subdir by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1110
  • [docs] Update dataset formats page by @oelachqar in https://github.com/oumi-ai/oumi/pull/1112
  • [docs] Remove placeholder pages by @oelachqar in https://github.com/oumi-ai/oumi/pull/1113
  • [docs] address mic feedback from the team by @oelachqar in https://github.com/oumi-ai/oumi/pull/1114
  • Update quickstart.md by @penfever in https://github.com/oumi-ai/oumi/pull/1115
  • Minor updates to oumi env by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1116
  • Update oumi train path to set TOKENIZERS_PARALLELISM variable by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1117
  • Reorganize oumi distributed run logic by @oelachqar in https://github.com/oumi-ai/oumi/pull/1118
  • [docs] Update misc Multimodal docs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1120
  • [docs] Add badges, smaller logos, and nit fix to notebook by @oelachqar in https://github.com/oumi-ai/oumi/pull/1121
  • Rename LLAMA 8b and 11B Vision inference REMOTE_VLLM configs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1122
  • Fix a circular dependency when invoking the oumi src directory via Python. by @taenin in https://github.com/oumi-ai/oumi/pull/1119
  • [docs] Multimodal SFT datasets: minor updates by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1123
  • Fix all doctests by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1107
  • Cleanup legacy constraints on datasets by @oelachqar in https://github.com/oumi-ai/oumi/pull/1080
  • [tiny] skip alpaca_eval test if not installed by @oelachqar in https://github.com/oumi-ai/oumi/pull/1125
  • Remove target_col: from VLM configs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1126
  • Update our documentation for custom clusters by @taenin in https://github.com/oumi-ai/oumi/pull/1128
  • Mark targetcol and useasync_dataset as deprecated by @oelachqar in https://github.com/oumi-ai/oumi/pull/1127
  • Update sample VLM eval config by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1129
  • [Documentation] Evaluate | Standardized Benchmarks page (refactor) by @kaisopos in https://github.com/oumi-ai/oumi/pull/1131
  • [docs] Add doc section for MMMU benchmark by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1130
  • Evaluation yaml configs: standardize task name by @kaisopos in https://github.com/oumi-ai/oumi/pull/1108
  • [doc] Oumi Models (part-1) by @optas in https://github.com/oumi-ai/oumi/pull/1083
  • Enable FSDP for Llama Vision with OUMI trainer by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1105
  • [docs] Add an entry for "Fine-tuning a Vision-Language Model" by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1132
  • [tiny] [docs] troubleshooting improvements by @optas in https://github.com/oumi-ai/oumi/pull/1070
  • [tiny] [docs] Cleanup touches on Quickstart by @optas in https://github.com/oumi-ai/oumi/pull/1133
  • VLM e2e integration tests update by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1138
  • Reduce batch size in Qwen2 VL eval config to reduce chances of OOM by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1136
  • Update readme, first iteration by @oelachqar in https://github.com/oumi-ai/oumi/pull/1135
  • Remove BLIP2 configs temporarily by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1140
  • [tiny] Looser pandas version requirements by @oelachqar in https://github.com/oumi-ai/oumi/pull/1141
  • Add a Customizing Oumi page to our docs by @taenin in https://github.com/oumi-ai/oumi/pull/1142
  • Upgrade wandb version by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1143
  • Get all notebooks to run by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1139
  • Add fineweb e2e integration test by @xrdaukar in https://github.com/oumi-ai/oumi/pull/1146
  • Update quickstart.md with emoji by @michaellema in https://github.com/oumi-ai/oumi/pull/1147
  • Remove some notebook E2E tests by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1145
  • Updated notebooks and quickstart documentation by @penfever in https://github.com/oumi-ai/oumi/pull/1134
  • updated quickstart with clearer instructions about setting up cloud by @penfever in https://github.com/oumi-ai/oumi/pull/1151
  • add maxmemorypergpu and devicemap params to lm eval harness by @penfever in https://github.com/oumi-ai/oumi/pull/1150
  • Add generic prompt-response dataset and update finetuning notebook by @jgreer013 in https://github.com/oumi-ai/oumi/pull/1144
  • Review default values by @wizeng23 in https://github.com/oumi-ai/oumi/pull/1124

New Contributors

  • @penfever made their first contribution in https://github.com/oumi-ai/oumi/pull/1115
  • @michaellema made their first contribution in https://github.com/oumi-ai/oumi/pull/1147

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1.1-alpha...v0.1.2-alpha

- Python
Published by oelachqar about 1 year ago

oumi - v0.1.1.0-alpha.1

What's Changed

  • Minimal SkyPilot config for blip2 and llava models for GCP with TRL_SFT by @xrdaukar in https://github.com/oumi-ai/oumi/pull/573
  • Inference Engine async writes by @taenin in https://github.com/oumi-ai/oumi/pull/574
  • Misc cleanups in JsonlinesDataset by @xrdaukar in https://github.com/oumi-ai/oumi/pull/576
  • Split out cloud dependencies by @taenin in https://github.com/oumi-ai/oumi/pull/575
  • Disable sdpa for blip2 by @xrdaukar in https://github.com/oumi-ai/oumi/pull/579
  • Set accelerate version to fix FSDP model saving by @wizeng23 in https://github.com/oumi-ai/oumi/pull/580
  • Remove AWS as a default dep by @taenin in https://github.com/oumi-ai/oumi/pull/582
  • Update ProfilerParams docstrings to follow the new style by @xrdaukar in https://github.com/oumi-ai/oumi/pull/583
  • Minor updates in scripts/benchmarks/minimal_multimodal_training.py by @xrdaukar in https://github.com/oumi-ai/oumi/pull/585
  • Add @override annotations to methods of few Dataset subclasses by @xrdaukar in https://github.com/oumi-ai/oumi/pull/584
  • Add dataset class for dolly dataset by @oelachqar in https://github.com/oumi-ai/oumi/pull/586
  • Refactor debugging/device utils, and add new GPU stats measurement functions by @xrdaukar in https://github.com/oumi-ai/oumi/pull/587
  • Add text jsonlines dataset class by @oelachqar in https://github.com/oumi-ai/oumi/pull/589
  • Define DataCollationParams by @xrdaukar in https://github.com/oumi-ai/oumi/pull/581
  • Misc updates to Polaris launcher scripts by @xrdaukar in https://github.com/oumi-ai/oumi/pull/591
  • Set up a new version of the Oumi CLI using Typer by @taenin in https://github.com/oumi-ai/oumi/pull/588
  • Update handling of GPU fan speed info by @xrdaukar in https://github.com/oumi-ai/oumi/pull/595
  • Add support for magpie dataset variants by @oelachqar in https://github.com/oumi-ai/oumi/pull/594
  • Rename GenerationConfig to GenerationParams by @wizeng23 in https://github.com/oumi-ai/oumi/pull/592
  • Fix cli infer test by @wizeng23 in https://github.com/oumi-ai/oumi/pull/598
  • Judge Notebook 1: default judge by @kaisopos in https://github.com/oumi-ai/oumi/pull/593
  • [Tiny] update missing dataset import by @oelachqar in https://github.com/oumi-ai/oumi/pull/599
  • Update training script to support data collators by @xrdaukar in https://github.com/oumi-ai/oumi/pull/590
  • Update accelerate version to 1.0.0 by @wizeng23 in https://github.com/oumi-ai/oumi/pull/601
  • Remove deprecated dataset code paths by @oelachqar in https://github.com/oumi-ai/oumi/pull/596
  • Refactor Aya & Ultrachat to use oumi dataset sft classes by @oelachqar in https://github.com/oumi-ai/oumi/pull/597
  • Add Llama train/eval/infer E2E integration test by @wizeng23 in https://github.com/oumi-ai/oumi/pull/602
  • Set docstring for NVidiaGpuRuntimeInfo struct by @xrdaukar in https://github.com/oumi-ai/oumi/pull/603
  • Add generation params to inference engines by @oelachqar in https://github.com/oumi-ai/oumi/pull/600
  • [bug] Fix issue loading jsonl datasets from file by @oelachqar in https://github.com/oumi-ai/oumi/pull/604
  • Add Llama 3B configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/605
  • Align pyright checks with latest Pylance version by @oelachqar in https://github.com/oumi-ai/oumi/pull/611
  • Fix apply_chat_template issue in VisionLanguageSftDataset by @xrdaukar in https://github.com/oumi-ai/oumi/pull/609
  • More robust make setup by @oelachqar in https://github.com/oumi-ai/oumi/pull/610
  • Fix a bug where the new CLI was improperly importing functions from top-level modules. by @taenin in https://github.com/oumi-ai/oumi/pull/613
  • Add support for the Launch command suite in the new CLI by @taenin in https://github.com/oumi-ai/oumi/pull/612
  • Support HuggingFaceH4/llava-instruct-mix-vsft dataset by @xrdaukar in https://github.com/oumi-ai/oumi/pull/608
  • [tiny] Fix .gitignore by @wizeng23 in https://github.com/oumi-ai/oumi/pull/616
  • [tiny] add gpt2 chat template, and update tests to use it by @oelachqar in https://github.com/oumi-ai/oumi/pull/617
  • Turn off pretty-printing exceptions in our CLI by @taenin in https://github.com/oumi-ai/oumi/pull/618
  • Cleanup dependencies by @oelachqar in https://github.com/oumi-ai/oumi/pull/615
  • Upgrade oumi dependencies by @oelachqar in https://github.com/oumi-ai/oumi/pull/606
  • Update makefile to use uv, add Jupyter target by @oelachqar in https://github.com/oumi-ai/oumi/pull/614
  • Add miniconda installation target, cleanup unused make commands by @oelachqar in https://github.com/oumi-ai/oumi/pull/620
  • Update several notebooks with the new EvaluationConfig format. by @taenin in https://github.com/oumi-ai/oumi/pull/621
  • Make sure conda env is registered by @oelachqar in https://github.com/oumi-ai/oumi/pull/622
  • Add Llama 3b sft/lora/qlora configs for Polaris by @wizeng23 in https://github.com/oumi-ai/oumi/pull/626
  • Add check if installation is successful by @oelachqar in https://github.com/oumi-ai/oumi/pull/625
  • Initial Cambrian integration by @xrdaukar in https://github.com/oumi-ai/oumi/pull/557
  • [tiny] alpaca - minor reproducibility boost by @optas in https://github.com/oumi-ai/oumi/pull/619
  • explicitly specify the model's dtype in LMH by @optas in https://github.com/oumi-ai/oumi/pull/607
  • [tiny] Add flops for T4 GPU by @wizeng23 in https://github.com/oumi-ai/oumi/pull/628
  • Use a timestamp for job directories on Polaris by @taenin in https://github.com/oumi-ai/oumi/pull/627
  • [tiny] Fix bug with Polaris job num by @wizeng23 in https://github.com/oumi-ai/oumi/pull/629
  • Update two VLLM configs. by @xrdaukar in https://github.com/oumi-ai/oumi/pull/624
  • Add pip install -U uv; to make setup for existing envs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/630
  • Disable MFU logging for non-packed datasets by @wizeng23 in https://github.com/oumi-ai/oumi/pull/632
  • Add config example for long context fine-tuning by @oelachqar in https://github.com/oumi-ai/oumi/pull/631
  • Add distribution mode flag to llama_tune by @wizeng23 in https://github.com/oumi-ai/oumi/pull/635
  • Judge Notebook 2: Custom Judge by @kaisopos in https://github.com/oumi-ai/oumi/pull/623
  • Bugfixes for LLAVA by @xrdaukar in https://github.com/oumi-ai/oumi/pull/634
  • Update sphinx config and docs to fix misc errors and warnings by @oelachqar in https://github.com/oumi-ai/oumi/pull/639
  • Factor out OUMITOTALNUM_GPUS env var by @wizeng23 in https://github.com/oumi-ai/oumi/pull/636
  • Remove bitsandbytes from train dependencies by @oelachqar in https://github.com/oumi-ai/oumi/pull/643
  • Enable intershinx to allow linking to external documentation pages by @oelachqar in https://github.com/oumi-ai/oumi/pull/640
  • Tune few training params for LLAVA and blip2 models by @xrdaukar in https://github.com/oumi-ai/oumi/pull/642
  • Added support for specifying the inference engine via the InferenceConfig by @taenin in https://github.com/oumi-ai/oumi/pull/638
  • Add popular pre-training dataset classes by @oelachqar in https://github.com/oumi-ai/oumi/pull/641
  • Remove openai dependency by @oelachqar in https://github.com/oumi-ai/oumi/pull/644
  • Update our documentation to point to the new CLI. by @taenin in https://github.com/oumi-ai/oumi/pull/645
  • Enable dataloaders for VLLM-s (llava and blip2) by @xrdaukar in https://github.com/oumi-ai/oumi/pull/646
  • Allow gradient clipping to be optional by @optas in https://github.com/oumi-ai/oumi/pull/649
  • Add support for add_generation_prompt in LLAVA chat template by @xrdaukar in https://github.com/oumi-ai/oumi/pull/648
  • Add a description to the Launch CLI by @taenin in https://github.com/oumi-ai/oumi/pull/651
  • Add all Llama FSDP GCP configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/637
  • Coerce model params to correct dtype for QLoRA FSDP by @wizeng23 in https://github.com/oumi-ai/oumi/pull/652
  • Use uv for pip install commands by @wizeng23 in https://github.com/oumi-ai/oumi/pull/653
  • Update sphinx docs by @oelachqar in https://github.com/oumi-ai/oumi/pull/654
  • [Docs] Refactor docs pipeline by @oelachqar in https://github.com/oumi-ai/oumi/pull/655
  • [docs] swap and configure sphinx theme by @oelachqar in https://github.com/oumi-ai/oumi/pull/656
  • [Docs] Add documentation placeholders by @oelachqar in https://github.com/oumi-ai/oumi/pull/658
  • [Docs] Add sphinx-bibtex by @oelachqar in https://github.com/oumi-ai/oumi/pull/659
  • [Docs] fix rendering issues by @oelachqar in https://github.com/oumi-ai/oumi/pull/660
  • [docs] fix broken links by @oelachqar in https://github.com/oumi-ai/oumi/pull/661
  • Fix broken link in readme (dev_setup) by @kaisopos in https://github.com/oumi-ai/oumi/pull/662
  • [docs][tiny] fix minor doc typos by @oelachqar in https://github.com/oumi-ai/oumi/pull/666
  • [docs] add autodoc2 template by @oelachqar in https://github.com/oumi-ai/oumi/pull/665
  • [docs] Add content links and references by @oelachqar in https://github.com/oumi-ai/oumi/pull/668
  • [docs] switch to myst-nb for rendering notebooks by @oelachqar in https://github.com/oumi-ai/oumi/pull/669
  • [docs] Add script to generate module summaries by @oelachqar in https://github.com/oumi-ai/oumi/pull/670
  • [docs] Include cli reference by @oelachqar in https://github.com/oumi-ai/oumi/pull/671
  • Add dataset submodules by @oelachqar in https://github.com/oumi-ai/oumi/pull/667
  • Update notebooks to include a descriptive title by @oelachqar in https://github.com/oumi-ai/oumi/pull/664
  • Update tests/utils/testdeviceutils.py by @xrdaukar in https://github.com/oumi-ai/oumi/pull/672
  • [Inference] Bug in generation config stop tokens by @kaisopos in https://github.com/oumi-ai/oumi/pull/663
  • Support rewriting special label values to -100 (ignore_index) to exclude from loss by @xrdaukar in https://github.com/oumi-ai/oumi/pull/657
  • Rename emails and website url to Oumi by @wizeng23 in https://github.com/oumi-ai/oumi/pull/675
  • Update scripts/benchmarks/minimalmultimodaltraining.py to be able to test 3 other models by @xrdaukar in https://github.com/oumi-ai/oumi/pull/677
  • Update data collators TextCollatorWithPadding by @xrdaukar in https://github.com/oumi-ai/oumi/pull/674
  • Add image support to llama3-instruct.jinja chat template by @xrdaukar in https://github.com/oumi-ai/oumi/pull/676
  • Cleanup vscode launch.json by @oelachqar in https://github.com/oumi-ai/oumi/pull/679
  • Add Llama 3B DDP configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/681
  • Add cff file for citations by @oelachqar in https://github.com/oumi-ai/oumi/pull/682
  • Small edits to Llama configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/683
  • Increase llava.7b.sft max length from 512 to 1024 by @xrdaukar in https://github.com/oumi-ai/oumi/pull/687
  • Rename turn.py to conversation.py by @oelachqar in https://github.com/oumi-ai/oumi/pull/685
  • Pad token fix by @kaisopos in https://github.com/oumi-ai/oumi/pull/678
  • Add basic validation for data collators by @xrdaukar in https://github.com/oumi-ai/oumi/pull/688
  • Support --image CLI param in infer_interactive by @xrdaukar in https://github.com/oumi-ai/oumi/pull/686
  • Move sft and preference tuning datasets to their own module by @oelachqar in https://github.com/oumi-ai/oumi/pull/691
  • Set model.tokenizer_pad_token for gpt2 in Oumi Inference example in vsconfig by @xrdaukar in https://github.com/oumi-ai/oumi/pull/692
  • Update our dev set up instructions to not conflict with the new CLI by @taenin in https://github.com/oumi-ai/oumi/pull/698
  • Add back the polling spinner to the Launch CLI by @taenin in https://github.com/oumi-ai/oumi/pull/695
  • Add a dummy sft dataset for unit tests by @oelachqar in https://github.com/oumi-ai/oumi/pull/697
  • Include judge commands into main CLI by @oelachqar in https://github.com/oumi-ai/oumi/pull/690
  • Replace logger.exception() in TextCollatorWithPadding with logger.error() by @xrdaukar in https://github.com/oumi-ai/oumi/pull/700
  • [Clean-up] Remove dead code (custom eval) by @kaisopos in https://github.com/oumi-ai/oumi/pull/696
  • Define oumi.core.constants file for global constants by @xrdaukar in https://github.com/oumi-ai/oumi/pull/694
  • Add serialization helper methods for Conversation object by @oelachqar in https://github.com/oumi-ai/oumi/pull/701
  • Create pullrequesttemplate.md by @taenin in https://github.com/oumi-ai/oumi/pull/693
  • Use -100 as ignore_index in VisionLanguageSftDataset and in build_data_collator() by default by @xrdaukar in https://github.com/oumi-ai/oumi/pull/702
  • Add main entrypoint, update vscode launch.json to use new cli by @oelachqar in https://github.com/oumi-ai/oumi/pull/704
  • Update experimental async dataset feature by @oelachqar in https://github.com/oumi-ai/oumi/pull/705
  • Update new image CLI to support input image for VLLMs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/699
  • Update sft jsonl dataset to support json files and alpaca format by @oelachqar in https://github.com/oumi-ai/oumi/pull/703
  • Make interactive inference continually loop by @wizeng23 in https://github.com/oumi-ai/oumi/pull/689
  • Update vscode launch inference example by @xrdaukar in https://github.com/oumi-ai/oumi/pull/706
  • Training configurations llama3.1 + ChatML-Template by @optas in https://github.com/oumi-ai/oumi/pull/684
  • Add help strings for oumi train/evaluate/infer by @wizeng23 in https://github.com/oumi-ai/oumi/pull/708
  • Log peak gpu memory at the end of training by @oelachqar in https://github.com/oumi-ai/oumi/pull/710
  • Log training git commit and tag by @oelachqar in https://github.com/oumi-ai/oumi/pull/709
  • Add github issue templates. by @taenin in https://github.com/oumi-ai/oumi/pull/711
  • Enable more ruff rules by @oelachqar in https://github.com/oumi-ai/oumi/pull/712
  • Add Llama inference configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/713
  • Enable pathlib rule by @oelachqar in https://github.com/oumi-ai/oumi/pull/714
  • Update index.rst - switch expanded name Open Universal Machine Intelligence and replace with "Oumi" by @ciaralema in https://github.com/oumi-ai/oumi/pull/715
  • Make sure registry is case agnostic by @oelachqar in https://github.com/oumi-ai/oumi/pull/717
  • Move config files to new directory structure by @wizeng23 in https://github.com/oumi-ai/oumi/pull/720
  • Replace np.array(x) with to x.numpy() to resolve test warnings by @xrdaukar in https://github.com/oumi-ai/oumi/pull/724
  • Update index.rst by @ciaralema in https://github.com/oumi-ai/oumi/pull/719
  • Add judge cli tests and bugfixes by @oelachqar in https://github.com/oumi-ai/oumi/pull/722
  • Update tests/builders/test_lr_schedules.py by @xrdaukar in https://github.com/oumi-ai/oumi/pull/723
  • Add dataset_path parameter by @oelachqar in https://github.com/oumi-ai/oumi/pull/716
  • [tiny] Update judge configs by @oelachqar in https://github.com/oumi-ai/oumi/pull/725
  • Rename --generation.(input/output)_filepath flags to --(input/output)_path and rename detach flag by @wizeng23 in https://github.com/oumi-ai/oumi/pull/707
  • Rename LLAVA sftskyjob.yaml for consistency with blip2 by @xrdaukar in https://github.com/oumi-ai/oumi/pull/729
  • Remove deprecated preprocessing function by @oelachqar in https://github.com/oumi-ai/oumi/pull/726
  • Add DPO base class, initial dataset, integration tests by @oelachqar in https://github.com/oumi-ai/oumi/pull/728
  • [tiny] Add datasets import to enable registry discovery by @oelachqar in https://github.com/oumi-ai/oumi/pull/731
  • Add a helper function is_image_text_llm() to detect supported image+text LLMs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/730
  • Add dpo debug dataset and integration test by @oelachqar in https://github.com/oumi-ai/oumi/pull/732
  • Add inference engine for vertex GCP by @oelachqar in https://github.com/oumi-ai/oumi/pull/734
  • [docs] Cleanup placeholder documentation pages by @oelachqar in https://github.com/oumi-ai/oumi/pull/736
  • Native inference with images (model.generate), and Processor abstraction by @xrdaukar in https://github.com/oumi-ai/oumi/pull/721
  • [docs] update faq section by @oelachqar in https://github.com/oumi-ai/oumi/pull/739
  • [docs] update advanced topics section by @oelachqar in https://github.com/oumi-ai/oumi/pull/737
  • [docs] update datasets section by @oelachqar in https://github.com/oumi-ai/oumi/pull/738
  • [docs] Update inference section by @oelachqar in https://github.com/oumi-ai/oumi/pull/744
  • [docs] Add evaluate and judge user guides by @oelachqar in https://github.com/oumi-ai/oumi/pull/745
  • [docs] Update training section by @oelachqar in https://github.com/oumi-ai/oumi/pull/741
  • Support merve/vqav2-small dataset by @xrdaukar in https://github.com/oumi-ai/oumi/pull/742
  • Reorder IMAGE and TEXT messages in coco and Flick datasets by @xrdaukar in https://github.com/oumi-ai/oumi/pull/743
  • Rename base datasets modules by @oelachqar in https://github.com/oumi-ai/oumi/pull/735
  • [tests] Move unit tests to unit folder by @oelachqar in https://github.com/oumi-ai/oumi/pull/753
  • Update training params for LLAVA and BLIP2 models by @xrdaukar in https://github.com/oumi-ai/oumi/pull/754
  • [docs] Add script to summarize configs by @oelachqar in https://github.com/oumi-ai/oumi/pull/751
  • [docs] Enable termynal cli and add example by @oelachqar in https://github.com/oumi-ai/oumi/pull/750
  • [docs] Update apidoc rst files with latest modules by @oelachqar in https://github.com/oumi-ai/oumi/pull/749
  • [docs] Add logo, analytics, and update sphinx config by @oelachqar in https://github.com/oumi-ai/oumi/pull/747
  • [docs] Update about section by @oelachqar in https://github.com/oumi-ai/oumi/pull/748
  • [docs] Add page for using skypilot by @oelachqar in https://github.com/oumi-ai/oumi/pull/746
  • Update base dataset class to use dataset_path by @oelachqar in https://github.com/oumi-ai/oumi/pull/757
  • Unit tests with sanity checks for oumi chat templates by @xrdaukar in https://github.com/oumi-ai/oumi/pull/755
  • [docs] Misc docs cleanup tasks by @oelachqar in https://github.com/oumi-ai/oumi/pull/756
  • And WandB integration to eval by @wizeng23 in https://github.com/oumi-ai/oumi/pull/727
  • Add 3 new PNG images to testdata by @xrdaukar in https://github.com/oumi-ai/oumi/pull/758
  • Update LLAVA integration tests to use different test images by @xrdaukar in https://github.com/oumi-ai/oumi/pull/761
  • [tests] Add shared mock tokenizer mixture, requires_gpus marker by @oelachqar in https://github.com/oumi-ai/oumi/pull/759
  • Update vllm engine to allow for continuous batching by @oelachqar in https://github.com/oumi-ai/oumi/pull/762
  • Added text completions collator hard-coded for Llama 3.1 support by @jgreer013 in https://github.com/oumi-ai/oumi/pull/752
  • Update remote inference engines to show progress, reduce native engine verbosity by @oelachqar in https://github.com/oumi-ai/oumi/pull/760
  • Add sample oumi launcher configs for LLAVA and blip2 by @xrdaukar in https://github.com/oumi-ai/oumi/pull/763
  • Create 2 other Launcher configs for LLAVA and blip2 using OUMI training loop by @xrdaukar in https://github.com/oumi-ai/oumi/pull/765
  • Update CI to include all dependencies, fix newly introduced vllm bugs by @oelachqar in https://github.com/oumi-ai/oumi/pull/764
  • Suggest using the git root directory when launching jobs via a CLI in dev-build. by @taenin in https://github.com/oumi-ai/oumi/pull/766
  • [bug] enable logger propagation so that pytest can capture them by @oelachqar in https://github.com/oumi-ai/oumi/pull/769
  • Update liger kernel integration by @oelachqar in https://github.com/oumi-ai/oumi/pull/768
  • Add BaseInferenceEngine supported_params to keep track of inference engine parameters by @oelachqar in https://github.com/oumi-ai/oumi/pull/770
  • Add READMEs for new config structure by @wizeng23 in https://github.com/oumi-ai/oumi/pull/767
  • Fix *_no_cuda() tests in tests/unit/utils/test_device_utils.py by @xrdaukar in https://github.com/oumi-ai/oumi/pull/775
  • Remove support for python 3.8, update to 3.9 by @oelachqar in https://github.com/oumi-ai/oumi/pull/772
  • Multimodal improvements, and basic support for meta-llama/Llama-3.2-11B-Vision-Instruct by @xrdaukar in https://github.com/oumi-ai/oumi/pull/771
  • Set vLLM max LoRA rank based on LoRA adapter by @wizeng23 in https://github.com/oumi-ai/oumi/pull/776
  • [WIP] Llama 3.2 Vision 11B configs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/777
  • Tune MLLMs configs for Llama Vision and LLAVA by @xrdaukar in https://github.com/oumi-ai/oumi/pull/779
  • ฮ™nference vLLM: towards fitting big models in GPU memory by @optas in https://github.com/oumi-ai/oumi/pull/774
  • Adds control for an explicit quantization-compute dtype by @optas in https://github.com/oumi-ai/oumi/pull/680
  • Correct layer name in Llama vision accelerate config by @xrdaukar in https://github.com/oumi-ai/oumi/pull/782
  • Clean up configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/781
  • Save (pre-)processor configs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/780
  • Tiny: force batch_size of generation parameters >= 1 by @optas in https://github.com/oumi-ai/oumi/pull/783
  • Switch to the regular pip temporarily in the old Polaris launcher by @xrdaukar in https://github.com/oumi-ai/oumi/pull/787
  • Define PAD token for Llama 2B model (fineweb pretrain) by @xrdaukar in https://github.com/oumi-ai/oumi/pull/785
  • Rename oumi launch stop to oumi launch cancel by @wizeng23 in https://github.com/oumi-ai/oumi/pull/786
  • [tiny] move collator test to unit folder by @oelachqar in https://github.com/oumi-ai/oumi/pull/790
  • Add oumi launch stop subcommand by @wizeng23 in https://github.com/oumi-ai/oumi/pull/789
  • Add sft packed dataset and tests by @oelachqar in https://github.com/oumi-ai/oumi/pull/788
  • CLI documentation updates. by @taenin in https://github.com/oumi-ai/oumi/pull/791
  • Updated default temperature from 1.0 to 0.0 for generation params by @jgreer013 in https://github.com/oumi-ai/oumi/pull/792
  • Add a prominent link to our docs. by @taenin in https://github.com/oumi-ai/oumi/pull/793
  • Fixing 2 broken links in README.md by @kaisopos in https://github.com/oumi-ai/oumi/pull/794
  • Switch from writing individual conversations to all conversations by @jgreer013 in https://github.com/oumi-ai/oumi/pull/796
  • Update pullrequesttemplate.md by @taenin in https://github.com/oumi-ai/oumi/pull/798
  • Fix polling jobs and job status in the CLI by @taenin in https://github.com/oumi-ai/oumi/pull/795
  • Update base SFT dataset to add support for training on assistant turns only by @oelachqar in https://github.com/oumi-ai/oumi/pull/797
  • Small fix for building docs by @taenin in https://github.com/oumi-ai/oumi/pull/799
  • Update polaris user regex to handle usernames with dashes by @jgreer013 in https://github.com/oumi-ai/oumi/pull/801
  • Simplify TextCollatorWithPadding by @xrdaukar in https://github.com/oumi-ai/oumi/pull/800
  • Minor updates to docstrings and error messages related to text_with_padding collator by @xrdaukar in https://github.com/oumi-ai/oumi/pull/802
  • Adding more control/parameters for text generation by @optas in https://github.com/oumi-ai/oumi/pull/784
  • Pre-training HSDP configs for Llama 8B for A100-40GB by @xrdaukar in https://github.com/oumi-ai/oumi/pull/778
  • Fix pyproject.toml merge error by @oelachqar in https://github.com/oumi-ai/oumi/pull/805
  • Update llama_tune script to support 8B pretraining by @xrdaukar in https://github.com/oumi-ai/oumi/pull/804
  • Add e2e fine-tuning test by @oelachqar in https://github.com/oumi-ai/oumi/pull/806
  • Create Polaris and GCP launcher configs for Llama 8B pretraining by @xrdaukar in https://github.com/oumi-ai/oumi/pull/807
  • Enable sphinx-tabs, update OOM faq page to use it by @oelachqar in https://github.com/oumi-ai/oumi/pull/808
  • [docs] Start adding doctests by @oelachqar in https://github.com/oumi-ai/oumi/pull/809
  • Fixed broken skyssh and skycode commands by @jgreer013 in https://github.com/oumi-ai/oumi/pull/810
  • Enable Accelerate FSDP runs without using the Accelerate launcher by @wizeng23 in https://github.com/oumi-ai/oumi/pull/803
  • Disable uv in polaris launcher by @xrdaukar in https://github.com/oumi-ai/oumi/pull/812
  • Migrate configs to use Oumi instead of Accelerate configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/811
  • Update pretest.yaml by @taenin in https://github.com/oumi-ai/oumi/pull/817
  • Update installation.md by @taenin in https://github.com/oumi-ai/oumi/pull/814
  • Add smollm config by @oelachqar in https://github.com/oumi-ai/oumi/pull/815
  • Add extra logging in Polaris client by @xrdaukar in https://github.com/oumi-ai/oumi/pull/813
  • Fix a small typo in quickstart by @taenin in https://github.com/oumi-ai/oumi/pull/818
  • Correct names of few env vars in llama_tune.sh by @xrdaukar in https://github.com/oumi-ai/oumi/pull/819
  • [tiny] move collator to builders by @oelachqar in https://github.com/oumi-ai/oumi/pull/821
  • Update sample commands to run Llama 8B model on Polaris by @xrdaukar in https://github.com/oumi-ai/oumi/pull/823
  • [tiny] update device cleanup by @oelachqar in https://github.com/oumi-ai/oumi/pull/822
  • Update quickstart to use the new CLI arg format. by @taenin in https://github.com/oumi-ai/oumi/pull/820
  • Mock out slow tests. by @taenin in https://github.com/oumi-ai/oumi/pull/825
  • More changes for Llama 8B Polaris by @xrdaukar in https://github.com/oumi-ai/oumi/pull/826
  • Minor updates to Polaris scripts by @xrdaukar in https://github.com/oumi-ai/oumi/pull/827
  • Misc documentation updates for installation, quickstart, and tutorials. by @taenin in https://github.com/oumi-ai/oumi/pull/828
  • Add image utils to load image bytes for messages with IMAGE_URL, IMAGE_PATH types by @xrdaukar in https://github.com/oumi-ai/oumi/pull/829
  • Update our documentation to include modules for directories that aren't packages. by @taenin in https://github.com/oumi-ai/oumi/pull/830
  • Add Llama405B Qlora/Lora/FFT configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/833
  • Add SGLangInferenceEngine by @xrdaukar in https://github.com/oumi-ai/oumi/pull/834
  • Switch from 8-bit AdamW to AdamW for FSDP Lora/QLora by @wizeng23 in https://github.com/oumi-ai/oumi/pull/835
  • Add options for specifying log levels in the CLI. by @taenin in https://github.com/oumi-ai/oumi/pull/836
  • Request user feedback after training. by @taenin in https://github.com/oumi-ai/oumi/pull/837
  • Documentation fix for training. by @taenin in https://github.com/oumi-ai/oumi/pull/840
  • [tiny] pass dataset_path to constructor by @oelachqar in https://github.com/oumi-ai/oumi/pull/841
  • Move chatqa to projects, add details by @oelachqar in https://github.com/oumi-ai/oumi/pull/816
  • [tiny] fix typo in docs/quickstart by @oelachqar in https://github.com/oumi-ai/oumi/pull/843
  • Add unit tests for builders/data module by @oelachqar in https://github.com/oumi-ai/oumi/pull/842
  • Additional improvements to 405B by @wizeng23 in https://github.com/oumi-ai/oumi/pull/838
  • Add constrained decoding to vllm engine by @oelachqar in https://github.com/oumi-ai/oumi/pull/844
  • Add support for remote vllm inference engine by @oelachqar in https://github.com/oumi-ai/oumi/pull/845
  • Add @override annotations to misc inference engines methods where missing by @xrdaukar in https://github.com/oumi-ai/oumi/pull/846
  • Add a debug command for dumping environment information. by @taenin in https://github.com/oumi-ai/oumi/pull/839
  • [tiny] fix openai remote engine auth bug by @oelachqar in https://github.com/oumi-ai/oumi/pull/848
  • Update RemoteInferenceEngine to support guided decoding by @oelachqar in https://github.com/oumi-ai/oumi/pull/849
  • Support multimodal inference with remote engines by @xrdaukar in https://github.com/oumi-ai/oumi/pull/847
  • Fixes for quickstart and launch commands. by @taenin in https://github.com/oumi-ai/oumi/pull/851
  • Support Qwen2-VL model for training by @xrdaukar in https://github.com/oumi-ai/oumi/pull/850
  • Fix our config parsing test by @taenin in https://github.com/oumi-ai/oumi/pull/855
  • Move remote_params from GenerationParams to InferenceConfig by @xrdaukar in https://github.com/oumi-ai/oumi/pull/853
  • Tune PyTorch CUDA allocation params for Llama 405B qlora by @xrdaukar in https://github.com/oumi-ai/oumi/pull/854
  • Update Llama 405B qlora config to use adamw_torch_fused optimizer by @xrdaukar in https://github.com/oumi-ai/oumi/pull/856
  • Update vllm config path in Polaris multi-node inference notebook by @xrdaukar in https://github.com/oumi-ai/oumi/pull/857
  • Change config dir structure for LLAVA and blip2 models by @xrdaukar in https://github.com/oumi-ai/oumi/pull/859
  • Enable LM-harness eval for multimodal, and add eval config for Llama 3.2 Vision 11B by @xrdaukar in https://github.com/oumi-ai/oumi/pull/858
  • Update inference engine docs. by @taenin in https://github.com/oumi-ai/oumi/pull/862
  • Initial port of zigzag attention by @xrdaukar in https://github.com/oumi-ai/oumi/pull/861
  • Add integration tests for visual SFT datasets (VisionLanguageSftDataset) by @xrdaukar in https://github.com/oumi-ai/oumi/pull/863
  • Update base config to_yaml signature by @oelachqar in https://github.com/oumi-ai/oumi/pull/865
  • Add support for batch inference by @oelachqar in https://github.com/oumi-ai/oumi/pull/864
  • Parallelize data preprocessing in BaseMapDataset by @xrdaukar in https://github.com/oumi-ai/oumi/pull/868
  • Tweak writer batch size computation in BaseMapDataset by @xrdaukar in https://github.com/oumi-ai/oumi/pull/870
  • Refresh fine-tuning notebook by @oelachqar in https://github.com/oumi-ai/oumi/pull/872
  • Support freezing nested model layers by @xrdaukar in https://github.com/oumi-ai/oumi/pull/873
  • Update dev_setup.md with minor clarification for use of SSH in GitHub by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/866
  • Rewrite CONTRIBUTING.md with new content, fixes and nits by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/867
  • Phi3 Vision (microsoft/Phi-3-vision-128k-instruct): Basic training support by @xrdaukar in https://github.com/oumi-ai/oumi/pull/871
  • Update main readme (part 1) by @oelachqar in https://github.com/oumi-ai/oumi/pull/874
  • Update readme sections (part 2) by @oelachqar in https://github.com/oumi-ai/oumi/pull/875
  • Refresh documentation pages by @oelachqar in https://github.com/oumi-ai/oumi/pull/877
  • Qwen2-VL-2B-Instruct: basic training config by @xrdaukar in https://github.com/oumi-ai/oumi/pull/876
  • Update recipes list in readme by @oelachqar in https://github.com/oumi-ai/oumi/pull/878
  • Update the list of supported VLM-s in README by @xrdaukar in https://github.com/oumi-ai/oumi/pull/879
  • Update documentation landing page by @oelachqar in https://github.com/oumi-ai/oumi/pull/881
  • Cap TRL library version to <0.12.2 by @jgreer013 in https://github.com/oumi-ai/oumi/pull/882
  • [tiny] fix remote inference bug for messages with image urls by @oelachqar in https://github.com/oumi-ai/oumi/pull/883
  • Adding support for Alpaca Eval by @kaisopos in https://github.com/oumi-ai/oumi/pull/880
  • Improve error messaging for the remote inference engine. by @taenin in https://github.com/oumi-ai/oumi/pull/884
  • Move multimodal configs from configs/examples/vision to configs/recipes/vision by @xrdaukar in https://github.com/oumi-ai/oumi/pull/886
  • Define internal VLM model configs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/885
  • Alpaca Eval 2.0 notebook by @kaisopos in https://github.com/oumi-ai/oumi/pull/887
  • Freeze torchdata>=0.8.0,<0.10.0 by @xrdaukar in https://github.com/oumi-ai/oumi/pull/890
  • Create torchrun, accelerate CLI wrapper by @xrdaukar in https://github.com/oumi-ai/oumi/pull/888
  • Update VLM configs to use oumi distributed torchrun by @xrdaukar in https://github.com/oumi-ai/oumi/pull/891
  • Allow None batch size in GenerationParams by @wizeng23 in https://github.com/oumi-ai/oumi/pull/895
  • Evaluation: Saving metadata for future repro by @kaisopos in https://github.com/oumi-ai/oumi/pull/892
  • Fix missing import precommits when you create a fresh install of Oumi by @taenin in https://github.com/oumi-ai/oumi/pull/898
  • Update inference documentation by @wizeng23 in https://github.com/oumi-ai/oumi/pull/893
  • Allow simpler method of specifying LoRA Adapter for eval/inference by @wizeng23 in https://github.com/oumi-ai/oumi/pull/889
  • Introduce complex Message.content with sub-items - Phase 1 by @xrdaukar in https://github.com/oumi-ai/oumi/pull/897
  • Enable Message.content with sub-items - phase 2 by @xrdaukar in https://github.com/oumi-ai/oumi/pull/901
  • Update configs/notebooks to use typer CLI instead of oumi.train/eval/infer modules by @wizeng23 in https://github.com/oumi-ai/oumi/pull/902
  • Evaluation Notebooks: MT Bench by @kaisopos in https://github.com/oumi-ai/oumi/pull/899
  • Update chat template/conversation format - Phase 3 by @xrdaukar in https://github.com/oumi-ai/oumi/pull/903
  • Update make skycode to use Oumi launcher by @wizeng23 in https://github.com/oumi-ai/oumi/pull/894
  • Delete main() functions in train/evaluate/infer.py by @wizeng23 in https://github.com/oumi-ai/oumi/pull/900
  • Update Inference links in oumi README by @xrdaukar in https://github.com/oumi-ai/oumi/pull/904
  • Define qwen2-vl-instruct chat template by @xrdaukar in https://github.com/oumi-ai/oumi/pull/907
  • Set max_pixels/min_pixels params for Qwen2 VL model by @xrdaukar in https://github.com/oumi-ai/oumi/pull/908
  • Update chat template/conversation format - Phase 4 by @xrdaukar in https://github.com/oumi-ai/oumi/pull/906
  • Update configs to use oumi distributed torchrun by @wizeng23 in https://github.com/oumi-ai/oumi/pull/905
  • [tiny] minor docs updates by @oelachqar in https://github.com/oumi-ai/oumi/pull/913
  • [docs] clean-up docs pages by @oelachqar in https://github.com/oumi-ai/oumi/pull/914
  • Initial support for HuggingFaceTB/SmolVLM-Instruct by @xrdaukar in https://github.com/oumi-ai/oumi/pull/910
  • Catch exceptions for missing LoRA adapters by @taenin in https://github.com/oumi-ai/oumi/pull/918
  • Exclude custom models from Vision-Language model checks. by @taenin in https://github.com/oumi-ai/oumi/pull/919
  • Refactor is_custom_model() check into a function + Minor tests updates by @xrdaukar in https://github.com/oumi-ai/oumi/pull/921
  • Disable FSDP's use_orig_params for PEFT training by @wizeng23 in https://github.com/oumi-ai/oumi/pull/869
  • Mark Message as frozen by @xrdaukar in https://github.com/oumi-ai/oumi/pull/923
  • Rename MessageContentItem to ContentItem for brevity. by @xrdaukar in https://github.com/oumi-ai/oumi/pull/925
  • Delete configs/examples/llama3_1 by @wizeng23 in https://github.com/oumi-ai/oumi/pull/922
  • Update OOM FAQ with instructions to tune CUDA alloc by @xrdaukar in https://github.com/oumi-ai/oumi/pull/926
  • Update oumi launch documentation by @wizeng23 in https://github.com/oumi-ai/oumi/pull/930
  • 90% speed up for the Oumi CLI by @taenin in https://github.com/oumi-ai/oumi/pull/929
  • Update docs/datasets/vl_sft.md and docs/datasets/local_datasets.md reflecting the latest changes in Conversation format and VL-dataset setup by @xrdaukar in https://github.com/oumi-ai/oumi/pull/928
  • Remove remaining references to MessageContentItem by @xrdaukar in https://github.com/oumi-ai/oumi/pull/931
  • Extend dev setup guide by @wizeng23 in https://github.com/oumi-ai/oumi/pull/932
  • Clean up configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/933
  • Add ascii art to all oumi commands in the CLI. by @taenin in https://github.com/oumi-ai/oumi/pull/934
  • [docs] Reorganize doc pages by @oelachqar in https://github.com/oumi-ai/oumi/pull/935
  • Fix broken pip targets by @wizeng23 in https://github.com/oumi-ai/oumi/pull/927
  • [docs] Enable doctest directive and add make doctests by @oelachqar in https://github.com/oumi-ai/oumi/pull/937
  • Evaluation refactoring by @kaisopos in https://github.com/oumi-ai/oumi/pull/912
  • Migrate Sky configs to Oumi job configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/938
  • [docs] fix misc doc build errors / warnings by @oelachqar in https://github.com/oumi-ai/oumi/pull/936
  • Update fineweb config by @wizeng23 in https://github.com/oumi-ai/oumi/pull/940
  • Evaluation refactoring (2nd) by @kaisopos in https://github.com/oumi-ai/oumi/pull/939
  • Update CLI parser to support = separated arguments by @xrdaukar in https://github.com/oumi-ai/oumi/pull/942
  • Integrating with AlpacaEval (PR 1 of 2) by @kaisopos in https://github.com/oumi-ai/oumi/pull/941
  • [Bug Fix] AlpacaEval by @kaisopos in https://github.com/oumi-ai/oumi/pull/943
  • Configure logging for oumi.telemetry in train.py by @xrdaukar in https://github.com/oumi-ai/oumi/pull/944
  • Only print ASCII log on main process by @xrdaukar in https://github.com/oumi-ai/oumi/pull/946
  • [bug] Fix issue with registered dataset loading by @oelachqar in https://github.com/oumi-ai/oumi/pull/947
  • Alpaca Eval: integration test by @kaisopos in https://github.com/oumi-ai/oumi/pull/945
  • Print telemetry stats in more compact format by @xrdaukar in https://github.com/oumi-ai/oumi/pull/950
  • Refactor conversation_utils by @xrdaukar in https://github.com/oumi-ai/oumi/pull/951
  • Improve rank detection for logging by @xrdaukar in https://github.com/oumi-ai/oumi/pull/953
  • [docs] Add core concepts page by @oelachqar in https://github.com/oumi-ai/oumi/pull/952
  • [docs][tiny] Update acknowledgement page by @oelachqar in https://github.com/oumi-ai/oumi/pull/954
  • [docs][tiny] remove unused pages by @oelachqar in https://github.com/oumi-ai/oumi/pull/955
  • [docs][bug] Fix rendering issue in installation page by @oelachqar in https://github.com/oumi-ai/oumi/pull/956
  • [docs] Refresh evaluate user guide by @oelachqar in https://github.com/oumi-ai/oumi/pull/961
  • [docs] Refresh datasets resource section by @oelachqar in https://github.com/oumi-ai/oumi/pull/960
  • [docs] Refresh models resource section by @oelachqar in https://github.com/oumi-ai/oumi/pull/959
  • [docs] Refresh infer user guide by @oelachqar in https://github.com/oumi-ai/oumi/pull/957
  • [docs] Refresh training user guide by @oelachqar in https://github.com/oumi-ai/oumi/pull/958
  • [docs][tiny] Update links by @oelachqar in https://github.com/oumi-ai/oumi/pull/962
  • [docs] Update doc generation config by @oelachqar in https://github.com/oumi-ai/oumi/pull/963
  • [docs] Fix misc docstring rendering issues by @oelachqar in https://github.com/oumi-ai/oumi/pull/964
  • [docs] Refresh judge user guide by @oelachqar in https://github.com/oumi-ai/oumi/pull/965
  • [docs] Add documentation page for local training by @oelachqar in https://github.com/oumi-ai/oumi/pull/966
  • [docs] Update documentation index page by @oelachqar in https://github.com/oumi-ai/oumi/pull/967
  • [Evaluation] Refactor repro info & add repro info in AlpacaEval by @kaisopos in https://github.com/oumi-ai/oumi/pull/949
  • [docs] Add documentation page for notebook environment by @oelachqar in https://github.com/oumi-ai/oumi/pull/969
  • Add more options for saving LoRA models by @wizeng23 in https://github.com/oumi-ai/oumi/pull/968
  • [docs] Add vscode environment doc page by @oelachqar in https://github.com/oumi-ai/oumi/pull/970
  • [docs] Cleanup evaluation guide page by @oelachqar in https://github.com/oumi-ai/oumi/pull/971
  • [docs] Update recipes page by @oelachqar in https://github.com/oumi-ai/oumi/pull/972
  • [docs] Update tutorials page by @oelachqar in https://github.com/oumi-ai/oumi/pull/973
  • [docs] Add doc page for generative evals by @oelachqar in https://github.com/oumi-ai/oumi/pull/974
  • [docs] misc clarifications in train, judge, dataset pages by @oelachqar in https://github.com/oumi-ai/oumi/pull/976
  • [docs][tiny] Remove train/common_workflows page by @oelachqar in https://github.com/oumi-ai/oumi/pull/977
  • [docs] Add infer common workflows page by @oelachqar in https://github.com/oumi-ai/oumi/pull/978
  • Update notebooks setup documentation by @wizeng23 in https://github.com/oumi-ai/oumi/pull/975
  • [docs] Add supported models page by @oelachqar in https://github.com/oumi-ai/oumi/pull/979
  • [docs] update inference engine user guide by @oelachqar in https://github.com/oumi-ai/oumi/pull/980
  • Refactor conversation JSON builders by @xrdaukar in https://github.com/oumi-ai/oumi/pull/981
  • [docs] Update CLI reference page by @oelachqar in https://github.com/oumi-ai/oumi/pull/983
  • [docs] Minor updates to datasets intro page by @oelachqar in https://github.com/oumi-ai/oumi/pull/984
  • Remove internal GCS mentions by @wizeng23 in https://github.com/oumi-ai/oumi/pull/982
  • Update vision/smollm config names by @wizeng23 in https://github.com/oumi-ai/oumi/pull/987
  • Update VLLMInferenceEngine to support multimodal conversations by @xrdaukar in https://github.com/oumi-ai/oumi/pull/986
  • Image utils refactorings by @xrdaukar in https://github.com/oumi-ai/oumi/pull/985
  • [docs] Add training monitoring documentation page by @oelachqar in https://github.com/oumi-ai/oumi/pull/988
  • Add torchvision dependency by @xrdaukar in https://github.com/oumi-ai/oumi/pull/989
  • Define configs/recipes/vision/phi3/sft/trl_gcp_job.yaml for Phi3 Vision by @xrdaukar in https://github.com/oumi-ai/oumi/pull/990
  • data.train.text_col cleanup for VLM-s by @xrdaukar in https://github.com/oumi-ai/oumi/pull/991
  • [Evaluation] Updating Documentation (1st pass) by @kaisopos in https://github.com/oumi-ai/oumi/pull/993
  • Fix saving merged lora model by @wizeng23 in https://github.com/oumi-ai/oumi/pull/992
  • Add an option to return custom datasets as IterableDataset by @xrdaukar in https://github.com/oumi-ai/oumi/pull/994
  • Fix regression in CLI speed by @taenin in https://github.com/oumi-ai/oumi/pull/997
  • Create alias for json/jsonl datasets by @oelachqar in https://github.com/oumi-ai/oumi/pull/998
  • [Evaluation] Updating Documentation (Generative Benchmarks) by @kaisopos in https://github.com/oumi-ai/oumi/pull/999
  • [docs][tiny] Fix broken links, update sft dataset examples by @oelachqar in https://github.com/oumi-ai/oumi/pull/1002

New Contributors

  • @ciaralema made their first contribution in https://github.com/oumi-ai/oumi/pull/715

Full Changelog: https://github.com/oumi-ai/oumi/compare/v0.1-alpha...v0.1.1-alpha

- Python
Published by oelachqar about 1 year ago

oumi - Initial release

What's Changed

  • Add python project configs by @oelachqar in https://github.com/oumi-ai/oumi/pull/1
  • Add repo skeleton by @oelachqar in https://github.com/oumi-ai/oumi/pull/2
  • Export lema entrypoint scripts by @oelachqar in https://github.com/oumi-ai/oumi/pull/3
  • Update static type checking config by @oelachqar in https://github.com/oumi-ai/oumi/pull/5
  • Add example jupyter / colab notebook by @oelachqar in https://github.com/oumi-ai/oumi/pull/4
  • Refactor config parsing to use omegaconf by @oelachqar in https://github.com/oumi-ai/oumi/pull/6
  • Updating documentation (Dev Environment Setup) by @kaisopos in https://github.com/oumi-ai/oumi/pull/7
  • Add tests and vscode config by @oelachqar in https://github.com/oumi-ai/oumi/pull/8
  • Added DPOTrainer example to repo, as well as cuda device cleanup to training loop by @jgreer013 in https://github.com/oumi-ai/oumi/pull/9
  • Adding torch as top-level module dependency by @optas in https://github.com/oumi-ai/oumi/pull/10
  • Add configs for specific hardware requirements by @jgreer013 in https://github.com/oumi-ai/oumi/pull/11
  • Sort pre-commit hooks lexicographically by @xrdaukar in https://github.com/oumi-ai/oumi/pull/12
  • Add logging config by @oelachqar in https://github.com/oumi-ai/oumi/pull/13
  • Lema inference by @xrdaukar in https://github.com/oumi-ai/oumi/pull/14
  • Panos dev by @optas in https://github.com/oumi-ai/oumi/pull/16
  • Add job launcher by @oelachqar in https://github.com/oumi-ai/oumi/pull/15
  • Making split of data a flexible variable by @optas in https://github.com/oumi-ai/oumi/pull/17
  • Configure max file size in precommit hooks by @xrdaukar in https://github.com/oumi-ai/oumi/pull/18
  • Minor bugfix and documentation update by @oelachqar in https://github.com/oumi-ai/oumi/pull/19
  • adding pynvml to train env by @kaisopos in https://github.com/oumi-ai/oumi/pull/20
  • Panos dev by @optas in https://github.com/oumi-ai/oumi/pull/22
  • Augmenting Types for training hyperparams by @optas in https://github.com/oumi-ai/oumi/pull/23
  • Train refactoring (config file visibility) + a few minor changes by @kaisopos in https://github.com/oumi-ai/oumi/pull/21
  • Minimal test for train function by @xrdaukar in https://github.com/oumi-ai/oumi/pull/25
  • Fix leftover 'torchdtype' in 'ModelParams' by @xrdaukar in https://github.com/oumi-ai/oumi/pull/26
  • Update GPU types list in the default SkyPilot config by @xrdaukar in https://github.com/oumi-ai/oumi/pull/27
  • Add a missing lema-infer command under [project.scripts] by @xrdaukar in https://github.com/oumi-ai/oumi/pull/28
  • add basic pytests for evaluate and infer by @xrdaukar in https://github.com/oumi-ai/oumi/pull/29
  • Update README and pyproject.toml by @wizeng23 in https://github.com/oumi-ai/oumi/pull/30
  • A helper function to print info about available CUDA devices by @xrdaukar in https://github.com/oumi-ai/oumi/pull/31
  • Update SkyPilot cconfig to start using torchrun by @xrdaukar in https://github.com/oumi-ai/oumi/pull/32
  • Support basic single-node, multi-gpu training by @xrdaukar in https://github.com/oumi-ai/oumi/pull/33
  • Run all precommit hooks on the repo by @xrdaukar in https://github.com/oumi-ai/oumi/pull/35
  • Add experimental code for llama cpp inference by @jgreer013 in https://github.com/oumi-ai/oumi/pull/37
  • Create skeleton of STYLE_GUIDE.md by @xrdaukar in https://github.com/oumi-ai/oumi/pull/36
  • Adding support for training custom models (for now just a dummy model). by @kaisopos in https://github.com/oumi-ai/oumi/pull/38
  • Fix custom model name in test_train.py by @xrdaukar in https://github.com/oumi-ai/oumi/pull/39
  • Configure pyright (static type checker) and resolve existing type errors to make it pass by @xrdaukar in https://github.com/oumi-ai/oumi/pull/41
  • fix trailing whitespace warning in STYLE_GUIDE.md by @xrdaukar in https://github.com/oumi-ai/oumi/pull/43
  • Configure initial GitHub Actions workflow to run pre-commits and tests by @xrdaukar in https://github.com/oumi-ai/oumi/pull/44
  • A variety of proposed extensions to finetune a chat-based model (starting with Zephyr) by @optas in https://github.com/oumi-ai/oumi/pull/34
  • Fix syntax error in ultrachat by @xrdaukar in https://github.com/oumi-ai/oumi/pull/48
  • Create initial version of CONTRIBUTING.md by @xrdaukar in https://github.com/oumi-ai/oumi/pull/46
  • Reduce the number of training steps from 5 to 3 to make test_train.py faster by @xrdaukar in https://github.com/oumi-ai/oumi/pull/49
  • Adding registry for custom models. by @kaisopos in https://github.com/oumi-ai/oumi/pull/42
  • Add config and streaming args to DataParams by @wizeng23 in https://github.com/oumi-ai/oumi/pull/47
  • Update Pre-review Tests to only run on pull_request by @xrdaukar in https://github.com/oumi-ai/oumi/pull/50
  • Add training flags to computes tokens-based stats by @xrdaukar in https://github.com/oumi-ai/oumi/pull/51
  • reduce test training steps in another test which I missed before by @xrdaukar in https://github.com/oumi-ai/oumi/pull/53
  • Rename var names of *Params classes by @wizeng23 in https://github.com/oumi-ai/oumi/pull/52
  • Make some NVIDIA-specific dependencies optional by @xrdaukar in https://github.com/oumi-ai/oumi/pull/54
  • fix trl version as 0.8.6 by @xrdaukar in https://github.com/oumi-ai/oumi/pull/56
  • Remove reference to torch.cuda.clock_rate by @xrdaukar in https://github.com/oumi-ai/oumi/pull/57
  • Update inference to support non-interactive batch mode. by @kaisopos in https://github.com/oumi-ai/oumi/pull/58
  • Update README.md to include Linux/WSL specific instructions by @xrdaukar in https://github.com/oumi-ai/oumi/pull/59
  • Minor formatting improvements in README.md by @xrdaukar in https://github.com/oumi-ai/oumi/pull/60
  • Minor: Updating Lora Params by @optas in https://github.com/oumi-ai/oumi/pull/55
  • Support dataset packing by @wizeng23 in https://github.com/oumi-ai/oumi/pull/63
  • Disallow relative imports in LeMa by @xrdaukar in https://github.com/oumi-ai/oumi/pull/65
  • Add text_col param that's required for SFTTrainer by @wizeng23 in https://github.com/oumi-ai/oumi/pull/66
  • Refactor common config parsing logic (YAML, arg_list) into a common util by @xrdaukar in https://github.com/oumi-ai/oumi/pull/68
  • Standardize test naming convention by @wizeng23 in https://github.com/oumi-ai/oumi/pull/69
  • Adding support for a hardcoded evaluation with MMLU. by @kaisopos in https://github.com/oumi-ai/oumi/pull/67
  • Minor changes to the default configs/skypilot/sky.yaml config by @xrdaukar in https://github.com/oumi-ai/oumi/pull/71
  • Prototype to pass config.model.model_max_length to Trainers by @xrdaukar in https://github.com/oumi-ai/oumi/pull/70
  • [Inference] Remove the prepended prompts from model responses. by @kaisopos in https://github.com/oumi-ai/oumi/pull/73
  • Add a util to print versioning info by @xrdaukar in https://github.com/oumi-ai/oumi/pull/74
  • Switch to tempfile.TemporaryDirectory() in test_train.py by @xrdaukar in https://github.com/oumi-ai/oumi/pull/75
  • Update docstring verbs to descriptive form by @wizeng23 in https://github.com/oumi-ai/oumi/pull/76
  • Add sample accelerate and fsdp configs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/77
  • Refactor code to get device rank and world size into a helper function by @xrdaukar in https://github.com/oumi-ai/oumi/pull/79
  • Add a simple util to print model summary e.g., layer names, architecture summary by @xrdaukar in https://github.com/oumi-ai/oumi/pull/80
  • Freeze numpy to pre 2.0 version by @xrdaukar in https://github.com/oumi-ai/oumi/pull/81
  • Adding inference support for next logit probability. by @kaisopos in https://github.com/oumi-ai/oumi/pull/78
  • Create FSDP configs for Phi3 by @xrdaukar in https://github.com/oumi-ai/oumi/pull/82
  • Auto-format pyproject.toml with "Even Better TOML" by @xrdaukar in https://github.com/oumi-ai/oumi/pull/83
  • Minor cleanup updates to SkyPilot configs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/84
  • Mixed Precision Training, Flash-Attention-2, Print-trainable-params by @optas in https://github.com/oumi-ai/oumi/pull/85
  • Update README.md to include basic instructions for multi-GPU training (DDP, FSDP) by @xrdaukar in https://github.com/oumi-ai/oumi/pull/86
  • Start using $SKYPILOTNUMGPUSPERNODE in SkyPilot config by @xrdaukar in https://github.com/oumi-ai/oumi/pull/90
  • Add configs for FineWeb Llama2 pretraining by @wizeng23 in https://github.com/oumi-ai/oumi/pull/89
  • Quantization by @optas in https://github.com/oumi-ai/oumi/pull/87
  • Update the default SkyPilot config to print more debug/context info by @xrdaukar in https://github.com/oumi-ai/oumi/pull/92
  • Add license by @oelachqar in https://github.com/oumi-ai/oumi/pull/93
  • Initial version of SkyPilot config for multi-node training (num_nodes: N) by @xrdaukar in https://github.com/oumi-ai/oumi/pull/94
  • MMLU eval refactor. by @kaisopos in https://github.com/oumi-ai/oumi/pull/88
  • Remove comparison between LOCAL_RANK and RANK by @xrdaukar in https://github.com/oumi-ai/oumi/pull/96
  • Handling the loading of peft adapters and other minor issues (e.g., adding more logging parameters) by @optas in https://github.com/oumi-ai/oumi/pull/91
  • Update configs/skypilot/skyllama2b.yaml to start using skyinit.sh by @xrdaukar in https://github.com/oumi-ai/oumi/pull/97
  • Add bool param to resume training from the last known checkpoint (if exists) by @xrdaukar in https://github.com/oumi-ai/oumi/pull/99
  • Inference: save/restore probabilities to/from file. by @kaisopos in https://github.com/oumi-ai/oumi/pull/98
  • Add support for dataset mixtures during training by @taenin in https://github.com/oumi-ai/oumi/pull/95
  • Add train, test, and validation splits to the LeMa config. by @taenin in https://github.com/oumi-ai/oumi/pull/101
  • nanoGPT (GPT2) pretraining recipe by @wizeng23 in https://github.com/oumi-ai/oumi/pull/103
  • Minor: Updates on Zephyr-Config by @optas in https://github.com/oumi-ai/oumi/pull/106
  • Update pre-commit config by @oelachqar in https://github.com/oumi-ai/oumi/pull/108
  • Add integration tests that verify all configs load properly. by @taenin in https://github.com/oumi-ai/oumi/pull/102
  • Handling Gradient Checkpointing by @optas in https://github.com/oumi-ai/oumi/pull/107
  • Update skypilot/sky_gpt2.yaml to include an example how to mount GCS dir by @xrdaukar in https://github.com/oumi-ai/oumi/pull/111
  • Rename datasetparams.datasetconfig to dataset_params.subset by @oelachqar in https://github.com/oumi-ai/oumi/pull/109
  • Refactor SFT dataset preprocessing by @oelachqar in https://github.com/oumi-ai/oumi/pull/112
  • Support shuffling and random seeds for dataset sampling by @taenin in https://github.com/oumi-ai/oumi/pull/113
  • Split types file into module by @oelachqar in https://github.com/oumi-ai/oumi/pull/114
  • Add GCP deps to lema[cloud] by @xrdaukar in https://github.com/oumi-ai/oumi/pull/117
  • Add llama3-instruct jinja template by @jgreer013 in https://github.com/oumi-ai/oumi/pull/118
  • Update sky_init.sh to print current dir by @xrdaukar in https://github.com/oumi-ai/oumi/pull/120
  • Add prompt response sft preprocessor factory for aya dataset by @jgreer013 in https://github.com/oumi-ai/oumi/pull/121
  • Add configs for chatqa model by @oelachqar in https://github.com/oumi-ai/oumi/pull/110
  • Saving inference probs in parquet format. by @kaisopos in https://github.com/oumi-ai/oumi/pull/115
  • Refactor model registry by @oelachqar in https://github.com/oumi-ai/oumi/pull/122
  • Define BaseTrainer abstraction by @xrdaukar in https://github.com/oumi-ai/oumi/pull/116
  • Add a registry for metric functions that we can run during training. by @taenin in https://github.com/oumi-ai/oumi/pull/126
  • Update trainingparams.py so HF trainer uses numtrain_epochs by @optas in https://github.com/oumi-ai/oumi/pull/125
  • Add native PyTorch model training by @oelachqar in https://github.com/oumi-ai/oumi/pull/123
  • [Quick fix] Handle pynvml being misconfigured by @taenin in https://github.com/oumi-ai/oumi/pull/128
  • Enable DP for inference by @kaisopos in https://github.com/oumi-ai/oumi/pull/100
  • Add configs for training llama3-8b with aya finetune by @jgreer013 in https://github.com/oumi-ai/oumi/pull/130
  • Update HF save_model() to only save on master replica by @xrdaukar in https://github.com/oumi-ai/oumi/pull/131
  • Pipe MetricsFunction from our config to train.py by @taenin in https://github.com/oumi-ai/oumi/pull/129
  • Fixing broken eval. by @kaisopos in https://github.com/oumi-ai/oumi/pull/132
  • Minor updates in SkyPilot docstrings by @xrdaukar in https://github.com/oumi-ai/oumi/pull/133
  • Fix bug with DP evaluation by @oelachqar in https://github.com/oumi-ai/oumi/pull/134
  • [MMLU custom eval] removing hardcoded subject, samples, num-shots. by @kaisopos in https://github.com/oumi-ai/oumi/pull/135
  • Add an initial config for async evaluations by @taenin in https://github.com/oumi-ai/oumi/pull/137
  • Add a new top level command: evaluate_async by @taenin in https://github.com/oumi-ai/oumi/pull/138
  • Minor bug fix in writing evaluations by @taenin in https://github.com/oumi-ai/oumi/pull/140
  • Support full GPT2 run by @wizeng23 in https://github.com/oumi-ai/oumi/pull/141
  • Upload sample configs for running async evals on GPT2 by @taenin in https://github.com/oumi-ai/oumi/pull/139
  • Apply torch.distributed.barrier() in save_model by @xrdaukar in https://github.com/oumi-ai/oumi/pull/136
  • Create an experimental util to generate pre-tokenized datasets (Parquet files) with token_ids column by @xrdaukar in https://github.com/oumi-ai/oumi/pull/144
  • Created a new dataset class with async loading & tokenization by @jgreer013 in https://github.com/oumi-ai/oumi/pull/142
  • Remove private debug dir from configs/skypilot/sky_gpt2.yaml by @xrdaukar in https://github.com/oumi-ai/oumi/pull/145
  • Define dataloadernumworkers and dataloaderprefetchfactor params by @xrdaukar in https://github.com/oumi-ai/oumi/pull/146
  • [Evaluations] Integration with LM Evaluation Harness by @kaisopos in https://github.com/oumi-ai/oumi/pull/143
  • Support model compilation by @wizeng23 in https://github.com/oumi-ai/oumi/pull/147
  • Multiple cleanup changes in configs/skypilot/sky_gpt2.yaml by @xrdaukar in https://github.com/oumi-ai/oumi/pull/148
  • Update SkyPilot training configs to include run_name by @xrdaukar in https://github.com/oumi-ai/oumi/pull/149
  • Update async eval to properly parse eval configs by @taenin in https://github.com/oumi-ai/oumi/pull/150
  • Zephyr Configs [full-model, skypilot] by @optas in https://github.com/oumi-ai/oumi/pull/152
  • Disable model.compile in gpt2 config by @xrdaukar in https://github.com/oumi-ai/oumi/pull/154
  • Update sky_init.sh to print task id and cluster info by @xrdaukar in https://github.com/oumi-ai/oumi/pull/156
  • [bug] Include jinja templates in build by @oelachqar in https://github.com/oumi-ai/oumi/pull/158
  • Add basic scaffolding for torch profiler around training loop by @xrdaukar in https://github.com/oumi-ai/oumi/pull/157
  • [Minor] Adding attn_implementation arg in LM Harness. by @kaisopos in https://github.com/oumi-ai/oumi/pull/160
  • Update Trainer.savemodel to start using the public HF savemodel() method (except for PEFT) by @xrdaukar in https://github.com/oumi-ai/oumi/pull/161
  • Update the vanilla eval config for gpt2 to run hellaswag evals. by @taenin in https://github.com/oumi-ai/oumi/pull/165
  • Add Dataset base class & API by @oelachqar in https://github.com/oumi-ai/oumi/pull/151
  • Add experimental notebook to run Nvidia's ChatRAG-Bench evaluation by @oelachqar in https://github.com/oumi-ai/oumi/pull/166
  • Update ChatQA training configs by @oelachqar in https://github.com/oumi-ai/oumi/pull/159
  • Update async dataset class to support pre-tokenized datasets by @oelachqar in https://github.com/oumi-ai/oumi/pull/162
  • Create a launcher script for Polaris jobs (ALCF) by @taenin in https://github.com/oumi-ai/oumi/pull/164
  • Update pre-tokenized column name to be input_ids in tokenize_dataset tool by @xrdaukar in https://github.com/oumi-ai/oumi/pull/167
  • Replacing EvaluationConfig's DataParams with DatasetSplitParams by @kaisopos in https://github.com/oumi-ai/oumi/pull/168
  • Submit config to create Custom IAM role for SkyPilot Service Accounts on GCP by @xrdaukar in https://github.com/oumi-ai/oumi/pull/169
  • Remove GCP project reference by @xrdaukar in https://github.com/oumi-ai/oumi/pull/172
  • Make sure output training dir exists by @xrdaukar in https://github.com/oumi-ai/oumi/pull/171
  • Improve launcher usability via command line arguments. by @taenin in https://github.com/oumi-ai/oumi/pull/170
  • Add a source directory to the Polaris launcher and clean up rsync copies. by @taenin in https://github.com/oumi-ai/oumi/pull/173
  • Introduce LEMARUNNAME env var to SkyPilot training configs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/174
  • Minor changes: 1. Remove hardcoded HF_TOKEN 2. Log effective training config by @xrdaukar in https://github.com/oumi-ai/oumi/pull/175
  • Tweak default params in gpt2 scripts by @xrdaukar in https://github.com/oumi-ai/oumi/pull/177
  • LM Harness optimizations by @kaisopos in https://github.com/oumi-ai/oumi/pull/176
  • No longer ignore .git. in Polaris Needed for venv. by @taenin in https://github.com/oumi-ai/oumi/pull/179
  • A hack for running jobs on Polaris. by @taenin in https://github.com/oumi-ai/oumi/pull/180
  • [Polaris] Move venv creation from worker to launcher. by @taenin in https://github.com/oumi-ai/oumi/pull/181
  • Update README.md to include sky launch - 10 ... example by @xrdaukar in https://github.com/oumi-ai/oumi/pull/182
  • [Evaluations] Adding support for HuggingFace's leaderboard v1 benchmarks by @kaisopos in https://github.com/oumi-ai/oumi/pull/183
  • Llama 3 Aya Fine-Tuning Updates by @jgreer013 in https://github.com/oumi-ai/oumi/pull/163
  • Remove logger propagation by @wizeng23 in https://github.com/oumi-ai/oumi/pull/185
  • [Evaluations] HF leaderboard v1 configs by @kaisopos in https://github.com/oumi-ai/oumi/pull/186
  • Move logging.py to utils by @wizeng23 in https://github.com/oumi-ai/oumi/pull/187
  • Create the Jobs config for the lema launcher. by @taenin in https://github.com/oumi-ai/oumi/pull/188
  • Initial abstract base classes for the lema launcher. by @taenin in https://github.com/oumi-ai/oumi/pull/189
  • Added mfu calculation and tests by @jgreer013 in https://github.com/oumi-ai/oumi/pull/190
  • Introduce two new training params: savemodel and saveepoch by @xrdaukar in https://github.com/oumi-ai/oumi/pull/191
  • Update FineWeb ablation model configs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/196
  • Added MFU telemetry by @jgreer013 in https://github.com/oumi-ai/oumi/pull/193
  • Update Polaris script by @wizeng23 in https://github.com/oumi-ai/oumi/pull/192
  • Rename training.save_model param to training.save_final_model for clarity by @xrdaukar in https://github.com/oumi-ai/oumi/pull/197
  • Support disabling dropout by @wizeng23 in https://github.com/oumi-ai/oumi/pull/184
  • Update actual mfu calculation by @jgreer013 in https://github.com/oumi-ai/oumi/pull/199
  • Implement a client for talking to SkyPilot. by @taenin in https://github.com/oumi-ai/oumi/pull/201
  • Fixed miscalculation of second step start time by @jgreer013 in https://github.com/oumi-ai/oumi/pull/202
  • Update ablation-model-fineweb-v1 config to start using grad checkpointing by @xrdaukar in https://github.com/oumi-ai/oumi/pull/198
  • Add distributed operations by @oelachqar in https://github.com/oumi-ai/oumi/pull/194
  • Add pre-commit hooks for credential scanning + new checks by @oelachqar in https://github.com/oumi-ai/oumi/pull/195
  • Sample job for multi-node training by @xrdaukar in https://github.com/oumi-ai/oumi/pull/203
  • Update Polaris multi-node launcher by @xrdaukar in https://github.com/oumi-ai/oumi/pull/204
  • Multi-node config improvements for llama2b model (HuggingFaceFW/ablation-model-fineweb-v1) by @xrdaukar in https://github.com/oumi-ai/oumi/pull/205
  • Minor updates to Polaris launcher script by @xrdaukar in https://github.com/oumi-ai/oumi/pull/206
  • Update Lema FSDP configs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/207
  • [tiny] add default formatter for markdown by @oelachqar in https://github.com/oumi-ai/oumi/pull/210
  • Preparations for Lema custom pre-training loop by @oelachqar in https://github.com/oumi-ai/oumi/pull/208
  • Update MFU callback to support Lema trainer by @oelachqar in https://github.com/oumi-ai/oumi/pull/209
  • Configure llama2b model to use FSDP HYBRID_SHARD by @xrdaukar in https://github.com/oumi-ai/oumi/pull/213
  • Implement a Cluster resource manager around Sky Pilot. by @taenin in https://github.com/oumi-ai/oumi/pull/214
  • Add utils to setup distributed training by @oelachqar in https://github.com/oumi-ai/oumi/pull/211
  • Add example notebook to train NanoGPT model with Lema by @oelachqar in https://github.com/oumi-ai/oumi/pull/212
  • [tiny] update sky pilot ssh config by @oelachqar in https://github.com/oumi-ai/oumi/pull/215
  • Implement a Cloud resource manager around Sky Pilot by @taenin in https://github.com/oumi-ai/oumi/pull/216
  • Sanitize run name by @xrdaukar in https://github.com/oumi-ai/oumi/pull/217
  • Use "cluster_name" instead of "name" in the Sky client. by @taenin in https://github.com/oumi-ai/oumi/pull/218
  • Minor logging improvements in Polaris sample job scripts by @xrdaukar in https://github.com/oumi-ai/oumi/pull/219
  • Update shell scripts to point to local dataset by @jgreer013 in https://github.com/oumi-ai/oumi/pull/221
  • Support FSDP on Polaris using accelerate by @xrdaukar in https://github.com/oumi-ai/oumi/pull/220
  • Add telemetry manager by @oelachqar in https://github.com/oumi-ai/oumi/pull/222
  • Switch to the latest transformers=4.43.1 by @xrdaukar in https://github.com/oumi-ai/oumi/pull/223
  • Re-enable model compilation for llama2b model by @xrdaukar in https://github.com/oumi-ai/oumi/pull/224
  • Increase llama2b batch size from 2 to 3 by @xrdaukar in https://github.com/oumi-ai/oumi/pull/225
  • Add makefile with common local commands by @oelachqar in https://github.com/oumi-ai/oumi/pull/227
  • Add DeepSpeed config for Llama2b by @wizeng23 in https://github.com/oumi-ai/oumi/pull/228
  • MFU Improvements for Llama 2B on Polaris by @jgreer013 in https://github.com/oumi-ai/oumi/pull/229
  • FSDP config updates by @xrdaukar in https://github.com/oumi-ai/oumi/pull/231
  • Rename accelerate configs to be in line with other configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/232
  • [tiny] Update logger format to include rank, pid and threadname by @oelachqar in https://github.com/oumi-ai/oumi/pull/235
  • Set model.config.use_cache = False by @xrdaukar in https://github.com/oumi-ai/oumi/pull/233
  • Experimental training loop for pre-training by @oelachqar in https://github.com/oumi-ai/oumi/pull/230
  • Disable gradient checkpointing in SkyPilot llama2b config by @xrdaukar in https://github.com/oumi-ai/oumi/pull/236
  • Implement a client for communicating with Polaris via python. by @taenin in https://github.com/oumi-ai/oumi/pull/234
  • Add SkyPilot config for experimental/pretokenize/tokenize_dataset.py by @xrdaukar in https://github.com/oumi-ai/oumi/pull/237
  • Update Fabric.run() calls to use the "warn" flag. by @taenin in https://github.com/oumi-ai/oumi/pull/239
  • Update pretokenize tool to support input datasets by @xrdaukar in https://github.com/oumi-ai/oumi/pull/238
  • Add optimizers builder function by @oelachqar in https://github.com/oumi-ai/oumi/pull/240
  • Add a "put" method in the Polaris client for writing remote files. by @taenin in https://github.com/oumi-ai/oumi/pull/242
  • Add deepspeed (DS) config to support hierarchical partitioning by @wizeng23 in https://github.com/oumi-ai/oumi/pull/244
  • Add support for uploading MFU in wandb by @jgreer013 in https://github.com/oumi-ai/oumi/pull/245
  • Create a Polaris Cluster class consuming the polaris client by @taenin in https://github.com/oumi-ai/oumi/pull/246
  • Add initial docker image by @oelachqar in https://github.com/oumi-ai/oumi/pull/241
  • Fix a string in the Polaris Cluster tests. by @taenin in https://github.com/oumi-ai/oumi/pull/249
  • Set training loop random seeds by @oelachqar in https://github.com/oumi-ai/oumi/pull/248
  • Fix bug with Polaris multi-node script by @wizeng23 in https://github.com/oumi-ai/oumi/pull/247
  • Add torchfix listing target by @oelachqar in https://github.com/oumi-ai/oumi/pull/250
  • Add training state classes by @oelachqar in https://github.com/oumi-ai/oumi/pull/251
  • Save and restore telemetry state during training by @oelachqar in https://github.com/oumi-ai/oumi/pull/252
  • Configure file logging by @oelachqar in https://github.com/oumi-ai/oumi/pull/254
  • Create a Polaris Cloud class consuming the polaris client by @taenin in https://github.com/oumi-ai/oumi/pull/253
  • Define a registry for cloud builders. by @taenin in https://github.com/oumi-ai/oumi/pull/255
  • Add logging to tensor board, wandb in custom training loop by @oelachqar in https://github.com/oumi-ai/oumi/pull/256
  • Add a get_all utility method to the LeMa Registry by @taenin in https://github.com/oumi-ai/oumi/pull/257
  • Update the BaseCloud up_cluster definition to return a job status. by @taenin in https://github.com/oumi-ai/oumi/pull/258
  • Create a launcher class for the LeMa Launcher. by @taenin in https://github.com/oumi-ai/oumi/pull/261
  • Add script to benchmark datasets and data loader params by @oelachqar in https://github.com/oumi-ai/oumi/pull/260
  • [Follow-up] data loader benchmarking script by @oelachqar in https://github.com/oumi-ai/oumi/pull/262
  • Create DDP configs for accelerate by @xrdaukar in https://github.com/oumi-ai/oumi/pull/259
  • Switch from nightly to stable version of SkyPilot by @xrdaukar in https://github.com/oumi-ai/oumi/pull/264
  • Make all tests green by @xrdaukar in https://github.com/oumi-ai/oumi/pull/265
  • Set dataloader_pin_memory=True to be intentional by @xrdaukar in https://github.com/oumi-ai/oumi/pull/266
  • Move torch_profiler_utils from lema.utils to lema.perfomance by @xrdaukar in https://github.com/oumi-ai/oumi/pull/267
  • Add BaseIterableDataset, refactor DataLoader to use DataPipes by @oelachqar in https://github.com/oumi-ai/oumi/pull/263
  • Add a dataset_kwargs attribute, tests by @oelachqar in https://github.com/oumi-ai/oumi/pull/268
  • Use stateful dataloader by @oelachqar in https://github.com/oumi-ai/oumi/pull/269
  • Update the polaris client / cluster to work e2e by @taenin in https://github.com/oumi-ai/oumi/pull/270
  • Update package structure for the launcher by @taenin in https://github.com/oumi-ai/oumi/pull/273
  • [tiny] Register debug datasets by @oelachqar in https://github.com/oumi-ai/oumi/pull/272
  • Update several of our launcher base fields to use strings instead of ints. by @taenin in https://github.com/oumi-ai/oumi/pull/274
  • Configure data loader sampling strategy for map-style datasets by @oelachqar in https://github.com/oumi-ai/oumi/pull/271
  • Ensure we CD into the working DIR before submitting polaris jobs. by @taenin in https://github.com/oumi-ai/oumi/pull/276
  • Compute the number of dataloader workers per node by @xrdaukar in https://github.com/oumi-ai/oumi/pull/277
  • Introduce BaseTokenizer alias by @xrdaukar in https://github.com/oumi-ai/oumi/pull/280
  • Cache getdevicerank_info by @xrdaukar in https://github.com/oumi-ai/oumi/pull/279
  • Adding initial scripts for running polaris jobs. by @taenin in https://github.com/oumi-ai/oumi/pull/275
  • Update the polaris client to automatically set execute permissions for copied files. by @taenin in https://github.com/oumi-ai/oumi/pull/286
  • Deprecate building models data parallel by @oelachqar in https://github.com/oumi-ai/oumi/pull/282
  • Switch to using safetensors when saving models by @oelachqar in https://github.com/oumi-ai/oumi/pull/281
  • Add ability to validate configs and params after init by @oelachqar in https://github.com/oumi-ai/oumi/pull/285
  • Some updates to Polaris launcher script by @xrdaukar in https://github.com/oumi-ai/oumi/pull/287
  • Upgrade to latest TRL version, remove numpy version condition by @oelachqar in https://github.com/oumi-ai/oumi/pull/283
  • Add learning rate builder function by @oelachqar in https://github.com/oumi-ai/oumi/pull/284
  • Remove patchwork as a dep. by @taenin in https://github.com/oumi-ai/oumi/pull/290
  • Set up initial demo launcher jobs for GCP. by @taenin in https://github.com/oumi-ai/oumi/pull/288
  • [tiny] cleanup pyproject.toml dependencies by @oelachqar in https://github.com/oumi-ai/oumi/pull/292
  • Make dataset data backend attribute read-only by @oelachqar in https://github.com/oumi-ai/oumi/pull/291
  • Optimize Github actions by @oelachqar in https://github.com/oumi-ai/oumi/pull/289
  • Misc minor changes by @xrdaukar in https://github.com/oumi-ai/oumi/pull/293
  • [tiny] Update GitHub action cache version by @oelachqar in https://github.com/oumi-ai/oumi/pull/295
  • Rename 'NodeParams' -> 'JobResources' by @taenin in https://github.com/oumi-ai/oumi/pull/296
  • Disable compilation for DDP accelerate launch config by @xrdaukar in https://github.com/oumi-ai/oumi/pull/297
  • Export top level launcher functions and instantiate a default launcher. by @taenin in https://github.com/oumi-ai/oumi/pull/298
  • Prevent HF version bump by @taenin in https://github.com/oumi-ai/oumi/pull/300
  • Add dtype/mixed precision configs to Lema trainer by @wizeng23 in https://github.com/oumi-ai/oumi/pull/278
  • Create a notebook tutorial for running remote training. by @taenin in https://github.com/oumi-ai/oumi/pull/299
  • Increase the default value of ProfilerParams.row_limit from 20 to 50 by @xrdaukar in https://github.com/oumi-ai/oumi/pull/304
  • Mini guide on using basic lema functionality by @oelachqar in https://github.com/oumi-ai/oumi/pull/303
  • Compute MFU based of HF total_flos (alternative way to compute MFU) by @xrdaukar in https://github.com/oumi-ai/oumi/pull/301
  • Support GPT2 training with Lema trainer by @wizeng23 in https://github.com/oumi-ai/oumi/pull/302
  • Add a client for running local jobs via the launcher. by @taenin in https://github.com/oumi-ai/oumi/pull/305
  • Add a local cluster for running local jobs. by @taenin in https://github.com/oumi-ai/oumi/pull/306
  • Support llama2b with lema trainer by @wizeng23 in https://github.com/oumi-ai/oumi/pull/308
  • Add a convenience method for listing all registered clouds. by @taenin in https://github.com/oumi-ai/oumi/pull/310
  • [ALCF] Reverse Polaris GPU order to match CPU/GPU affinities by @xrdaukar in https://github.com/oumi-ai/oumi/pull/307
  • Create a local cloud for the LeMa launcher. by @taenin in https://github.com/oumi-ai/oumi/pull/309
  • Remove some leftover occurrences of builtin_ prefix in HF MFU callback by @xrdaukar in https://github.com/oumi-ai/oumi/pull/312
  • Clean up mixed precision params by @wizeng23 in https://github.com/oumi-ai/oumi/pull/311
  • Add finetuning tutorial by @oelachqar in https://github.com/oumi-ai/oumi/pull/313
  • Fix interpolation when loading lema configs. by @taenin in https://github.com/oumi-ai/oumi/pull/314
  • [bugfix] GPU workers not waiting for global leader to save final checkpoint by @oelachqar in https://github.com/oumi-ai/oumi/pull/315
  • Add simple benchmark script for distributed operations by @oelachqar in https://github.com/oumi-ai/oumi/pull/316
  • Add a 'done' field to the LeMa job status object. by @taenin in https://github.com/oumi-ai/oumi/pull/317
  • Fix a small typo in Lema README by @xrdaukar in https://github.com/oumi-ai/oumi/pull/318
  • Add pytorch profiler (-p) option to multinode_example_worker.sh script by @xrdaukar in https://github.com/oumi-ai/oumi/pull/319
  • Create a simpler tutorial for running jobs. by @taenin in https://github.com/oumi-ai/oumi/pull/320
  • Minor cleanups in Lema training loop by @xrdaukar in https://github.com/oumi-ai/oumi/pull/322
  • Remove unbalanced call to barrier() in HuggingFaceTrainer.save_model by @xrdaukar in https://github.com/oumi-ai/oumi/pull/323
  • Create a tutorial for custom clouds. by @taenin in https://github.com/oumi-ai/oumi/pull/321
  • Add support for logging stdout and stderr for Local runs. by @taenin in https://github.com/oumi-ai/oumi/pull/324
  • Fix nanoGPT notebook by @wizeng23 in https://github.com/oumi-ai/oumi/pull/325
  • Add more pytorch profiler instrumentations in Lema training loop by @xrdaukar in https://github.com/oumi-ai/oumi/pull/327
  • Add training param: dataloader_main_process_only by @xrdaukar in https://github.com/oumi-ai/oumi/pull/326
  • fix synchronization issues in LEMA training loop by @xrdaukar in https://github.com/oumi-ai/oumi/pull/328
  • Update LEMA training loop to count tokens on CPU by @xrdaukar in https://github.com/oumi-ai/oumi/pull/330
  • Update README.md by @taenin in https://github.com/oumi-ai/oumi/pull/331
  • Add various improvements to Lema trainer by @wizeng23 in https://github.com/oumi-ai/oumi/pull/329
  • Add PyTorch profiler annotation for each step/micro-step by @xrdaukar in https://github.com/oumi-ai/oumi/pull/333
  • Enable HfMfuTrainerCallback if supported by @xrdaukar in https://github.com/oumi-ai/oumi/pull/332
  • Add support for PyTorch profiling schedule by @xrdaukar in https://github.com/oumi-ai/oumi/pull/334
  • Set up Sphinx-based doc generation for LeMa by @taenin in https://github.com/oumi-ai/oumi/pull/335
  • Fix dataclass strings to be parsable by our docs generator. by @taenin in https://github.com/oumi-ai/oumi/pull/337
  • Update ProfilerStepCallback to add microstep profiler annotations by @xrdaukar in https://github.com/oumi-ai/oumi/pull/338
  • Add include_alternative_mfu_metrics param to control if HF MFU is enabled by @xrdaukar in https://github.com/oumi-ai/oumi/pull/336
  • Minor doc formatting updates. by @taenin in https://github.com/oumi-ai/oumi/pull/340
  • Add 8-bit Adam optimizer to Lema trainer by @wizeng23 in https://github.com/oumi-ai/oumi/pull/339
  • Enable gradient scaling for fp16 mixed-precision training by @wizeng23 in https://github.com/oumi-ai/oumi/pull/342
  • Add a link to our documentation via the readme. by @taenin in https://github.com/oumi-ai/oumi/pull/344
  • Disable weight decay for layernorm/biases in Lema trainer by @wizeng23 in https://github.com/oumi-ai/oumi/pull/341
  • Polaris: Enable NCCL debug logging at WARNING level by @xrdaukar in https://github.com/oumi-ai/oumi/pull/347
  • Add a new notebook for getting started. by @taenin in https://github.com/oumi-ai/oumi/pull/345
  • Create TelemetryCallback by @xrdaukar in https://github.com/oumi-ai/oumi/pull/343
  • Various improvements for our autogenerated docs by @taenin in https://github.com/oumi-ai/oumi/pull/349
  • Polaris: update sample tail command to use -n200 by @xrdaukar in https://github.com/oumi-ai/oumi/pull/348
  • Fix a minor bug in TelemetryCallback.on_train_end by @xrdaukar in https://github.com/oumi-ai/oumi/pull/350
  • Update LEMA training loop to log wandb url by @xrdaukar in https://github.com/oumi-ai/oumi/pull/351
  • Update model dtype for DeepSpeed to make it work with SkyPilot and Polaris by @xrdaukar in https://github.com/oumi-ai/oumi/pull/352
  • Enable the launcher via the CLI by @taenin in https://github.com/oumi-ai/oumi/pull/353
  • Update Polaris init script to print nodelist by @xrdaukar in https://github.com/oumi-ai/oumi/pull/354
  • Minor logging updates in Polaris scripts by @xrdaukar in https://github.com/oumi-ai/oumi/pull/355
  • Define ddp1gpu Polaris mode: Spawn 1 torchrun process per GPU (4 torchrun-s per node) by @xrdaukar in https://github.com/oumi-ai/oumi/pull/356
  • Add a helper util to query GPU temperatures by @xrdaukar in https://github.com/oumi-ai/oumi/pull/359
  • Add Llama 8B config by @wizeng23 in https://github.com/oumi-ai/oumi/pull/358
  • Add another bareer() call before train() by @xrdaukar in https://github.com/oumi-ai/oumi/pull/360
  • Add Llama70B FSDP config by @wizeng23 in https://github.com/oumi-ai/oumi/pull/361
  • Minor improvements in logging and instrumentations in train.py by @xrdaukar in https://github.com/oumi-ai/oumi/pull/362
  • Refactor our core directory to logically organize our classes. by @taenin in https://github.com/oumi-ai/oumi/pull/357
  • Basic plumbing for GPU temperature telemetry by @xrdaukar in https://github.com/oumi-ai/oumi/pull/363
  • Minor update to Llama70B by @wizeng23 in https://github.com/oumi-ai/oumi/pull/365
  • Reorder model compilation and DDP/FSDP wrapping by @xrdaukar in https://github.com/oumi-ai/oumi/pull/364
  • Mini tutorial for Llama3.1-70b inference on Polaris. by @taenin in https://github.com/oumi-ai/oumi/pull/367
  • jgreer013/vllm-inference by @jgreer013 in https://github.com/oumi-ai/oumi/pull/366
  • Fix interpolation when using the launcher CLI for various sky configs. by @taenin in https://github.com/oumi-ai/oumi/pull/369
  • Add Llama8B Lora config for GCP/Polaris by @wizeng23 in https://github.com/oumi-ai/oumi/pull/368
  • Add vllm parallel inference to improve throughput by @jgreer013 in https://github.com/oumi-ai/oumi/pull/370
  • Set TOKENIZERS_PARALLELISM: false for llama8b model by @xrdaukar in https://github.com/oumi-ai/oumi/pull/371
  • Disable MFU computation for PEFT by @xrdaukar in https://github.com/oumi-ai/oumi/pull/372
  • Add empty_device_cache_steps param and configure it for Llama8b model by @xrdaukar in https://github.com/oumi-ai/oumi/pull/373
  • Add TelemetryCallback.include_timer_metrics param: False by default by @xrdaukar in https://github.com/oumi-ai/oumi/pull/378
  • Update llama8b GCP launcher script to allow Spot VMs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/380
  • Minimal Llama8B LoRA eval config by @xrdaukar in https://github.com/oumi-ai/oumi/pull/376
  • Add Llama 8b SFT config by @wizeng23 in https://github.com/oumi-ai/oumi/pull/379
  • Move common NCCL variables initialization into polaris_init.sh by @xrdaukar in https://github.com/oumi-ai/oumi/pull/377
  • Minor tuning of llama8b configs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/382
  • Update eval script to use Meta-Llama-3.1-8B-Instruct model version by @xrdaukar in https://github.com/oumi-ai/oumi/pull/381
  • Initial notebook for llama 8b LoRa tuning. by @taenin in https://github.com/oumi-ai/oumi/pull/374
  • Update SkyPilot GCP script to download the right model version by @xrdaukar in https://github.com/oumi-ai/oumi/pull/385
  • Clean up Sky configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/383
  • Update main makefile to generate docs by @oelachqar in https://github.com/oumi-ai/oumi/pull/386
  • Add docs-serve makefile command by @oelachqar in https://github.com/oumi-ai/oumi/pull/387
  • Fix missing new line at the end of Makefile by @xrdaukar in https://github.com/oumi-ai/oumi/pull/390
  • Raise NOT_IMPLEMENTED if adapter_model is configured for LM_HARNESS eval by @xrdaukar in https://github.com/oumi-ai/oumi/pull/391
  • Update Llama8B LoRA eval script to use built-in LEMA evaluator by @xrdaukar in https://github.com/oumi-ai/oumi/pull/389
  • Add Llama 70b lora config by @wizeng23 in https://github.com/oumi-ai/oumi/pull/388
  • Enable markdown docs by @oelachqar in https://github.com/oumi-ai/oumi/pull/394
  • Check ignored docstring rules by @oelachqar in https://github.com/oumi-ai/oumi/pull/395
  • Remove special case for saving PEFT models by @xrdaukar in https://github.com/oumi-ai/oumi/pull/384
  • Move shared code into polaris_init by @wizeng23 in https://github.com/oumi-ai/oumi/pull/392
  • Update Llama notebook to include 8B SFT by @wizeng23 in https://github.com/oumi-ai/oumi/pull/393
  • Update sample commands to point to the preemptable queue by @taenin in https://github.com/oumi-ai/oumi/pull/396
  • Update lm_harness to support LoRA adapters by @jgreer013 in https://github.com/oumi-ai/oumi/pull/397
  • Fix FSDP model initialization by @wizeng23 in https://github.com/oumi-ai/oumi/pull/398
  • Add vscode launch config for accelerate distributed training by @oelachqar in https://github.com/oumi-ai/oumi/pull/400
  • Update trainer save model by @oelachqar in https://github.com/oumi-ai/oumi/pull/399
  • Increase from 2 to 3 nodes for Llama 70B Lora by @wizeng23 in https://github.com/oumi-ai/oumi/pull/402
  • Add param to customize NCCL timeout by @oelachqar in https://github.com/oumi-ai/oumi/pull/401
  • Add docs and gpu install targets by @oelachqar in https://github.com/oumi-ai/oumi/pull/403
  • Significant improvements for the Polaris launcher by @taenin in https://github.com/oumi-ai/oumi/pull/404
  • Ensure that jobs are queued on existing clusters when users call UP by @taenin in https://github.com/oumi-ai/oumi/pull/406
  • Autostop sky clusters after 30 min of no activity by @taenin in https://github.com/oumi-ai/oumi/pull/407
  • Add support for triton kernels from Liger Kernel by @oelachqar in https://github.com/oumi-ai/oumi/pull/405
  • Add support for including notebooks in the docs by @oelachqar in https://github.com/oumi-ai/oumi/pull/408
  • Update sphinx comments to docstrings by @oelachqar in https://github.com/oumi-ai/oumi/pull/411
  • Add missing docstrings to TrainingParams by @oelachqar in https://github.com/oumi-ai/oumi/pull/409
  • Capped model max length for Llama tuning by @wizeng23 in https://github.com/oumi-ai/oumi/pull/413
  • Fix a deadlock in the Polaris launcher for users with 500+ jobs. by @taenin in https://github.com/oumi-ai/oumi/pull/412
  • Script to run inference with Llama/GPT judges. by @kaisopos in https://github.com/oumi-ai/oumi/pull/414
  • Add missing docstrings to top-level configs by @oelachqar in https://github.com/oumi-ai/oumi/pull/410
  • [tiny] sphinx conf update by @oelachqar in https://github.com/oumi-ai/oumi/pull/416
  • Improve launcher polling by running tasks in a subprocess. by @taenin in https://github.com/oumi-ai/oumi/pull/417
  • Add missing package docstrings by @oelachqar in https://github.com/oumi-ai/oumi/pull/415
  • [tiny] Enable D104 rule by @oelachqar in https://github.com/oumi-ai/oumi/pull/419
  • Fix bug with 70B Lora by @wizeng23 in https://github.com/oumi-ai/oumi/pull/421
  • Update the CLI to look for open SSH tunnels as a way of preserving Polaris state by @taenin in https://github.com/oumi-ai/oumi/pull/418
  • Update the polaris launcher to always update the lema installation on job creation. by @taenin in https://github.com/oumi-ai/oumi/pull/422
  • Cleanup doc RSTs by @oelachqar in https://github.com/oumi-ai/oumi/pull/420
  • Add sphinx api doc template for packages by @oelachqar in https://github.com/oumi-ai/oumi/pull/425
  • Add automatically generated apidoc RSTs by @oelachqar in https://github.com/oumi-ai/oumi/pull/424
  • [tiny] Move apidocs into their own folder by @oelachqar in https://github.com/oumi-ai/oumi/pull/426
  • Add docs-rebuild command to Makefile by @oelachqar in https://github.com/oumi-ai/oumi/pull/427
  • Refresh markdown docs by @oelachqar in https://github.com/oumi-ai/oumi/pull/429
  • Reorganize our test structure by @taenin in https://github.com/oumi-ai/oumi/pull/431
  • Add Llama 70B SFT config by @wizeng23 in https://github.com/oumi-ai/oumi/pull/428
  • Script to generate judge prompts. by @kaisopos in https://github.com/oumi-ai/oumi/pull/423
  • [tiny] Breakdown main Readme into multiple docs by @oelachqar in https://github.com/oumi-ai/oumi/pull/430
  • Update main readme file by @oelachqar in https://github.com/oumi-ai/oumi/pull/432
  • Add GitHub badges, readme typos by @oelachqar in https://github.com/oumi-ai/oumi/pull/434
  • Fix markdown lint errors by @oelachqar in https://github.com/oumi-ai/oumi/pull/433
  • Update documentation index by @oelachqar in https://github.com/oumi-ai/oumi/pull/436
  • [tiny] Only log to console on global leader by @wizeng23 in https://github.com/oumi-ai/oumi/pull/435
  • Tune sphinx config by @oelachqar in https://github.com/oumi-ai/oumi/pull/437
  • Enable Liger for Llama 8B SFT by @wizeng23 in https://github.com/oumi-ai/oumi/pull/439
  • Updated Parallel Inference job by @jgreer013 in https://github.com/oumi-ai/oumi/pull/438
  • Add a mkdir to polaris init. by @taenin in https://github.com/oumi-ai/oumi/pull/440
  • [tiny] Fix lema loop performance gap by @oelachqar in https://github.com/oumi-ai/oumi/pull/441
  • [tiny] update trainer benchmark script and minor updates by @oelachqar in https://github.com/oumi-ai/oumi/pull/443
  • Add Llama 8B eval script by @wizeng23 in https://github.com/oumi-ai/oumi/pull/442
  • Add dataset remote code param by @oelachqar in https://github.com/oumi-ai/oumi/pull/445
  • [docs] Update format + add missing docs to data_params.py by @oelachqar in https://github.com/oumi-ai/oumi/pull/444
  • Update Polaris Llama8b eval script to enable data-parallel evals for LM_HARNESS by @xrdaukar in https://github.com/oumi-ai/oumi/pull/446
  • Copy changes from PR-446 into Polaris launcher config by @xrdaukar in https://github.com/oumi-ai/oumi/pull/448
  • Copy changes from PR-446 into GCP launcher config by @xrdaukar in https://github.com/oumi-ai/oumi/pull/449
  • Minor fixes in llama8B eval scripts by @xrdaukar in https://github.com/oumi-ai/oumi/pull/450
  • Add Llama 70B eval script by @wizeng23 in https://github.com/oumi-ai/oumi/pull/447
  • [bugfix] add isusingaccelerate_fsdp util by @oelachqar in https://github.com/oumi-ai/oumi/pull/453
  • [tiny] Fix inference notebook by @wizeng23 in https://github.com/oumi-ai/oumi/pull/451
  • Simplify record_function annotation in LEMA training loop by @xrdaukar in https://github.com/oumi-ai/oumi/pull/454
  • [tiny] enable ruff format on save with notebooks by @oelachqar in https://github.com/oumi-ai/oumi/pull/455
  • [tiny] Add missing default value to hf_trainer by @oelachqar in https://github.com/oumi-ai/oumi/pull/458
  • Judge inference script for Polaris by @kaisopos in https://github.com/oumi-ai/oumi/pull/452
  • Add the base classes for inference. Pull out logic from infer to a native text inference engine. by @taenin in https://github.com/oumi-ai/oumi/pull/456
  • Telemetry improvements for tracking GPU temperature and in general by @xrdaukar in https://github.com/oumi-ai/oumi/pull/457
  • Add integration tests for native inference (not using the CLI). by @taenin in https://github.com/oumi-ai/oumi/pull/460
  • Update README.md by @mkoukoumidis in https://github.com/oumi-ai/oumi/pull/462
  • Update README to make installation steps more prominent by @taenin in https://github.com/oumi-ai/oumi/pull/464
  • Fix several broken links and update installation instructions by @taenin in https://github.com/oumi-ai/oumi/pull/465
  • Update inference to pass the generation config to inference engines. by @taenin in https://github.com/oumi-ai/oumi/pull/466
  • Update README.md by @taenin in https://github.com/oumi-ai/oumi/pull/467
  • Fixed issue with metadata extraction failure by @jgreer013 in https://github.com/oumi-ai/oumi/pull/469
  • Add fsdp support to lema loop by @oelachqar in https://github.com/oumi-ai/oumi/pull/463
  • Combine telemetry from all ranks by @xrdaukar in https://github.com/oumi-ai/oumi/pull/468
  • Add sample for full fine-tuned and LoRA-tuned model inference using vLLM by @wizeng23 in https://github.com/oumi-ai/oumi/pull/470
  • Update chattemplatebuilder by @oelachqar in https://github.com/oumi-ai/oumi/pull/472
  • Removed duplicate task_done call by @jgreer013 in https://github.com/oumi-ai/oumi/pull/473
  • Add flag to enable experimental torch data pipes processing pipeline by @oelachqar in https://github.com/oumi-ai/oumi/pull/474
  • Vision-languange datasets & fine-tuning MVP by @oelachqar in https://github.com/oumi-ai/oumi/pull/459
  • Rebuild docs, add multi-modal tutorial by @oelachqar in https://github.com/oumi-ai/oumi/pull/475
  • Add test coverage target, update pyproject.toml metadata by @oelachqar in https://github.com/oumi-ai/oumi/pull/476
  • Create a local inference engine for vLLM by @taenin in https://github.com/oumi-ai/oumi/pull/471
  • Add llava chat template, QoL improvement to multimodal testing script by @oelachqar in https://github.com/oumi-ai/oumi/pull/478
  • [Polaris Judge Inference] Adjusting script for Llama 70B quantized by @kaisopos in https://github.com/oumi-ai/oumi/pull/461
  • Add example for running inference using vLLM on GCP, single-node multi-gpu by @oelachqar in https://github.com/oumi-ai/oumi/pull/479
  • [tiny] Remove deepspeed from required dependencies by @oelachqar in https://github.com/oumi-ai/oumi/pull/482
  • Update train path to save meta-info as files under telemetry sub-dir by @xrdaukar in https://github.com/oumi-ai/oumi/pull/480
  • Add inference engine applychattemplate helper, update example notebook by @oelachqar in https://github.com/oumi-ai/oumi/pull/481
  • Update arg names for vLLM inference job by @wizeng23 in https://github.com/oumi-ai/oumi/pull/477
  • Remove device_map for model init from config by @wizeng23 in https://github.com/oumi-ai/oumi/pull/484
  • Add log_model_summary call back by @xrdaukar in https://github.com/oumi-ai/oumi/pull/485
  • Small typo fix in the vllm notebook by @taenin in https://github.com/oumi-ai/oumi/pull/483
  • Cleanup FSDP wrap class auto guesser by @oelachqar in https://github.com/oumi-ai/oumi/pull/486
  • Add missing documentation for model_params by @oelachqar in https://github.com/oumi-ai/oumi/pull/487
  • Add callback builder function by @oelachqar in https://github.com/oumi-ai/oumi/pull/490
  • Minor fixes in DISTRIBUTED_TRAINING.md by @xrdaukar in https://github.com/oumi-ai/oumi/pull/488
  • Switch to using official UV action with dependency caching by @oelachqar in https://github.com/oumi-ai/oumi/pull/491
  • Introduce BaseTrainerCallback alias by @xrdaukar in https://github.com/oumi-ai/oumi/pull/492
  • Add documentation to peft_params by @oelachqar in https://github.com/oumi-ai/oumi/pull/493
  • Update TelemetryCallback to save final metrics to JSON by @xrdaukar in https://github.com/oumi-ai/oumi/pull/494
  • Increase the rsync timeout from 40s to 300s by @taenin in https://github.com/oumi-ai/oumi/pull/495
  • [tiny] fix missing import by @oelachqar in https://github.com/oumi-ai/oumi/pull/497
  • Rename builddataset -> builddataset_mixture by @oelachqar in https://github.com/oumi-ai/oumi/pull/498
  • Define a simple callback to detect NaN/INF-s during training by @xrdaukar in https://github.com/oumi-ai/oumi/pull/496
  • Replace pip install flash-attn with .[gpu] target by @wizeng23 in https://github.com/oumi-ai/oumi/pull/502
  • Add simpler builder for single dataset use cases by @oelachqar in https://github.com/oumi-ai/oumi/pull/499
  • Use HF's built-in gradient checkpointing argument by @wizeng23 in https://github.com/oumi-ai/oumi/pull/500
  • [Draft] Example changes to support 70B single-node inference by @jgreer013 in https://github.com/oumi-ai/oumi/pull/503
  • Various updates to Llama 2b configs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/489
  • Add Llama 2B FSDP config by @wizeng23 in https://github.com/oumi-ai/oumi/pull/505
  • Update TelemetryCallback to write JSON with GPU temperature summary by @xrdaukar in https://github.com/oumi-ai/oumi/pull/501
  • Rename src/lema to src/oumi by @wizeng23 in https://github.com/oumi-ai/oumi/pull/506
  • OpenAI Chat Engine - Custom servers by @taenin in https://github.com/oumi-ai/oumi/pull/504
  • Rename configs/lema to configs/oumi by @wizeng23 in https://github.com/oumi-ai/oumi/pull/507
  • Rename all relevant lema references in codebase by @wizeng23 in https://github.com/oumi-ai/oumi/pull/508
  • Re-generate Sphinx docs by @wizeng23 in https://github.com/oumi-ai/oumi/pull/509
  • Update conf.py by @taenin in https://github.com/oumi-ai/oumi/pull/510
  • Rename remaining lema references in docs/ by @wizeng23 in https://github.com/oumi-ai/oumi/pull/511
  • Update final lema references by @wizeng23 in https://github.com/oumi-ai/oumi/pull/512
  • Update dev setup guide by @wizeng23 in https://github.com/oumi-ai/oumi/pull/513
  • Update TOTALNUMGPUS compare commands in SkyPilot configs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/514
  • [Minor] Issues arose by "newcomer" exploration [1/K] by @optas in https://github.com/oumi-ai/oumi/pull/518
  • Freeze lm-eval and torch versions as a workaround for OPE-390 by @xrdaukar in https://github.com/oumi-ai/oumi/pull/516
  • 1. Write wandb telemetery 2. Reorder training callbacks by @xrdaukar in https://github.com/oumi-ai/oumi/pull/519
  • Multiple updates to Llama 2B by @wizeng23 in https://github.com/oumi-ai/oumi/pull/515
  • Rename OUMI to Oumi by @wizeng23 in https://github.com/oumi-ai/oumi/pull/520
  • Add llama.cpp Inference Engine by @oelachqar in https://github.com/oumi-ai/oumi/pull/524
  • Rename website references to oumi.ai by @wizeng23 in https://github.com/oumi-ai/oumi/pull/522
  • Add anthropic inference engine by @oelachqar in https://github.com/oumi-ai/oumi/pull/523
  • Update name typo by @oelachqar in https://github.com/oumi-ai/oumi/pull/526
  • Add a batch inference job runnable via the Oumi Launcher by @taenin in https://github.com/oumi-ai/oumi/pull/527
  • Auto-format pyproject and pre-commit configs by @xrdaukar in https://github.com/oumi-ai/oumi/pull/530
  • Update Makefile by @taenin in https://github.com/oumi-ai/oumi/pull/529
  • Fix failing tests after a new install. by @taenin in https://github.com/oumi-ai/oumi/pull/531
  • Fix a small bug in infer_interactive(): only prints the first character by @xrdaukar in https://github.com/oumi-ai/oumi/pull/532
  • Boosting User-friendliness by @optas in https://github.com/oumi-ai/oumi/pull/521
  • [tiny] add override from typing_extentions by @oelachqar in https://github.com/oumi-ai/oumi/pull/534
  • Create CODEOFCONDUCT.md by @taenin in https://github.com/oumi-ai/oumi/pull/536
  • Add conversation helper methods by @oelachqar in https://github.com/oumi-ai/oumi/pull/535
  • [tiny] cleanup multimodal benchmark script by @oelachqar in https://github.com/oumi-ai/oumi/pull/537
  • Auto-format shell scripts under scripts by @xrdaukar in https://github.com/oumi-ai/oumi/pull/539
  • Add builder function for data collators by @oelachqar in https://github.com/oumi-ai/oumi/pull/538
  • Make tokenizer optional by @oelachqar in https://github.com/oumi-ai/oumi/pull/540
  • Add an optional -t flag to scripts/polaris/jobs/llama2bptworker.sh by @xrdaukar in https://github.com/oumi-ai/oumi/pull/541
  • Fix initial issues found by shellcheck by @xrdaukar in https://github.com/oumi-ai/oumi/pull/542
  • [tiny] fix small typo by @oelachqar in https://github.com/oumi-ai/oumi/pull/544
  • Minor changes in scripts/benchmarks/minimal_multimodal_training.py by @xrdaukar in https://github.com/oumi-ai/oumi/pull/543
  • [tiny] Add util to get install folder root dir by @oelachqar in https://github.com/oumi-ai/oumi/pull/545
  • [tiny] Add fp paged_adam optimizer option by @oelachqar in https://github.com/oumi-ai/oumi/pull/547
  • [tiny] Allow conversation metadata to contain values other than str by @oelachqar in https://github.com/oumi-ai/oumi/pull/546
  • Switch from Flash Attention 2 to PyTorch SDPA by @wizeng23 in https://github.com/oumi-ai/oumi/pull/533
  • Use local_rank to query GPU temperature by @xrdaukar in https://github.com/oumi-ai/oumi/pull/550
  • Fix a bug for handling stopped sky clusters in the oumi launcher. by @taenin in https://github.com/oumi-ai/oumi/pull/549
  • Remove flash attention validation check by @wizeng23 in https://github.com/oumi-ai/oumi/pull/551
  • Add support for AWS and Azure jobs in Oumi by @taenin in https://github.com/oumi-ai/oumi/pull/552
  • Pass split param to datasets.load_dataset() by @xrdaukar in https://github.com/oumi-ai/oumi/pull/553
  • Implement Judge API MVP by @oelachqar in https://github.com/oumi-ai/oumi/pull/548
  • Log dataset info: shape, columns, other metainfo by @xrdaukar in https://github.com/oumi-ai/oumi/pull/555
  • Update experimental pretokenize_dataset tool by @xrdaukar in https://github.com/oumi-ai/oumi/pull/554
  • Various improvements to Llama eval scripts by @wizeng23 in https://github.com/oumi-ai/oumi/pull/556
  • Add a couple of gc.collect() calls by @xrdaukar in https://github.com/oumi-ai/oumi/pull/560
  • [tiny] Fix Makefile setup command by @wizeng23 in https://github.com/oumi-ai/oumi/pull/561
  • Support datasets generated by dataset.save_to_disk() by @xrdaukar in https://github.com/oumi-ai/oumi/pull/559
  • Add support for LoRA adapters in vLLM inference engine by @wizeng23 in https://github.com/oumi-ai/oumi/pull/562
  • Updates in VisionLanguageCollator and in coco_captions by @xrdaukar in https://github.com/oumi-ai/oumi/pull/563
  • Update DEV_SETUP.md with Windows instructions by @taenin in https://github.com/oumi-ai/oumi/pull/566
  • Make the remote inference engine runnable in jupyter notebooks. by @taenin in https://github.com/oumi-ai/oumi/pull/565
  • Configure freezelayer map in `minimalmultimodal_training.py` by @xrdaukar in https://github.com/oumi-ai/oumi/pull/569
  • Clean up legacy evaluate_oumi code paths by @taenin in https://github.com/oumi-ai/oumi/pull/568
  • Update model builder to use default_chat_template if available by @xrdaukar in https://github.com/oumi-ai/oumi/pull/571
  • Add package build and deployment workflow to google artifact registry by @oelachqar in https://github.com/oumi-ai/oumi/pull/570

New Contributors

  • @oelachqar made their first contribution in https://github.com/oumi-ai/oumi/pull/1
  • @kaisopos made their first contribution in https://github.com/oumi-ai/oumi/pull/7
  • @jgreer013 made their first contribution in https://github.com/oumi-ai/oumi/pull/9
  • @optas made their first contribution in https://github.com/oumi-ai/oumi/pull/10
  • @xrdaukar made their first contribution in https://github.com/oumi-ai/oumi/pull/12
  • @wizeng23 made their first contribution in https://github.com/oumi-ai/oumi/pull/30
  • @taenin made their first contribution in https://github.com/oumi-ai/oumi/pull/95
  • @mkoukoumidis made their first contribution in https://github.com/oumi-ai/oumi/pull/462

Full Changelog: https://github.com/oumi-ai/oumi/commits/v0.1-alpha

- Python
Published by oelachqar over 1 year ago