llm-recipes

Ongoing Research Project for continaual pre-training LLM(dense mode)

https://github.com/okoge-kaz/llm-recipes

Last synced: 10 months ago · JSON representation ·

Repository

Ongoing Research Project for continaual pre-training LLM(dense mode)

Basic Info

Host: GitHub
Owner: okoge-kaz
Language: Python
Default Branch: main
Homepage:
Size: 18.4 MB

Statistics

Stars: 39
Watchers: 3
Forks: 4
Open Issues: 3
Releases: 0

Created over 2 years ago · Last pushed over 1 year ago

Metadata Files

Readme Citation

llm-recipes

User-friendly tool for seamless continual pre-training of Large Language Models

llm-recipes is a tool designed to make the continual pre-training of Large Language Models (LLMs) easy and efficient. With an intuitive interface and flexible configuration options, researchers and developers can effortlessly manage training on any model or dataset. The tool supports distributed training on large GPU clusters and offers extensive customization, enabling users to leverage cutting-edge techniques with ease.

What sets llm-recipes apart is its seamless integration with Hugging Face Transformers, allowing you to continue pre-training or perform instruction tuning on Dense LLMs (non-MoE models) with minimal changes. This means there’s no need to convert checkpoints or deal with complex workflows—just focus on refining your model.

| Feature | llm-recipes | llama-recipes | torchtune | |---------------------------------|-------------|---------------|-----------| | SFT(Supervised Fine-Tuning) | ✅ | ✅ | ✅ | | Continual Pre-Training | ✅ | ✅ | ✅ | | DPO(Direct Preference Optimization) | ✅ | ❌ | ✅ | | Llama Models Support | ✅ | ✅ | ✅ | | Non-Llama Models Support | ✅ | ❌ | ✅ | | Multi-Node Support | ✅ | ✅ | ❌ |

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "Fujii"
  given-names: "Kazuki"
- family-names: "Nakamura"
  given-names: "Taishi"
- family-names: "Yokota"
  given-names: "Rio"
title: "llm-recipes"
version: 1.0.0
date-released: 2024-5-24
url: "https://github.com/okoge-kaz/llm-recipes"

GitHub Events

Total

Issues event: 2
Watch event: 17
Push event: 15
Create event: 2

Last Year

Issues event: 2
Watch event: 17
Push event: 15
Create event: 2

Issues and Pull Requests

Last synced: over 1 year ago

All Time

Total issues: 2
Total pull requests: 24
Average time to close issues: N/A
Average time to close pull requests: 3 days
Total issue authors: 1
Total pull request authors: 1
Average comments per issue: 0.0
Average comments per pull request: 0.0
Merged pull requests: 24
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 2
Pull requests: 24
Average time to close issues: N/A
Average time to close pull requests: 3 days
Issue authors: 1
Pull request authors: 1
Average comments per issue: 0.0
Average comments per pull request: 0.0
Merged pull requests: 24
Bot issues: 0
Bot pull requests: 0

View more stats

Top Authors

Issue Authors

mchi-zg (1)
etsurin (1)
okoge-kaz (1)

Pull Request Authors

okoge-kaz (29)

Top Labels

Issue Labels

Pull Request Labels

bug (2)

Dependencies

requirements.txt pypi

accelerate *
appdirs *
bitsandbytes *
black *
datasets *
deepspeed *
fire *
flake8 *
loralib *
mpi4py *
nltk *
optimum *
peft *
py7zr *
pybind11 *
scipy *
sentencepiece *
torch ==2.1.2
transformers >=4.35.0
wandb *

tools/requirements.txt pypi

accelerate *
appdirs *
bitsandbytes *
black *
datasets *
deepspeed *
fire *
flake8 *
loralib *
mpi4py *
nltk *
optimum *
peft *
py7zr *
pybind11 *
scipy *
sentencepiece *
torch ==2.1.2
transformers >=4.35.0
wandb *

llm-recipes

Science Score: 54.0%

Repository

Basic Info

Statistics

Metadata Files

README.md

llm-recipes

User-friendly tool for seamless continual pre-training of Large Language Models

Table of Contents

Installation

Multi-node Support

FlashAttention

Usage

LLM Instruction Tuning

1. Data Preparation

2. Change Dataset Class

3. Indexing

4. Training

LLM Continual Pre-Training

1. Data Preparation

2. Tokenize Data

3. Training

LLM DPO

Checkpoint formats

llm-recipes format

PyTorch format to Hugging Face format

PyTorch distributed format to Hugging Face format

Inference

Training Speed and Scalability

Projects Using llm-recipes

Citation

Owner

Citation (CITATION.cff)

GitHub Events

Total

Last Year

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels

Dependencies