https://github.com/daniel-furman/sft-demos

Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.

Keywords

chatbot deep-learning instruction-tuning llama nlp text-generation transformers

Last synced: 5 months ago · JSON representation

Repository

Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.

Basic Info

Host: GitHub
Owner: daniel-furman
License: apache-2.0
Language: Jupyter Notebook
Default Branch: main
Homepage: https://huggingface.co/dfurman
Size: 9.64 MB

Statistics

Stars: 78
Watchers: 1
Forks: 9
Open Issues: 0
Releases: 0

Topics

chatbot deep-learning instruction-tuning llama nlp text-generation transformers

Created over 2 years ago · Last pushed over 1 year ago

Metadata Files

Readme License

Finetuning demos for LLMs

📚 Intro

This repo contains demos for finetuning of Large Language Models (LLMs), like Meta's llama-3. In particular, we focus on training for short-form instruction following.

🔎 Finetunes

Note: See _peft for training runs, as organized by base model.

Some examples:

🏆 Evaluation

Note: See _eval for evaluation runs.

An example:

As of Oct 2024, dfurman/CalmeRys-78B-Orpo-v0.1 is the top ranking model on the Open LLM Leaderboard 🏆

| Metric |Value| |-------------------|----:| |Avg. |50.78| |IFEval (0-Shot) |81.63| |BBH (3-Shot) |61.92| |MATH Lvl 5 (4-Shot)|37.92| |GPQA (0-shot) |20.02| |MuSR (0-shot) |36.37| |MMLU-PRO (5-shot) |66.80|

💻 Usage

Note: Use the code below to get started on text generation (inference). Be sure to have a GPU-enabled cluster.

Setup

```python !pip install -qU transformers accelerate bitsandbytes !huggingface-cli download dfurman/CalmeRys-78B-Orpo-v0.1 ``` ```python from transformers import AutoTokenizer, BitsAndBytesConfig import transformers import torch if torch.cuda.get_device_capability()[0] >= 8: !pip install -qqq flash-attn attn_implementation = "flash_attention_2" torch_dtype = torch.bfloat16 else: attn_implementation = "eager" torch_dtype = torch.float16 # # quantize if necessary # bnb_config = BitsAndBytesConfig( # load_in_4bit=True, # bnb_4bit_quant_type="nf4", # bnb_4bit_compute_dtype=torch_dtype, # bnb_4bit_use_double_quant=True, # ) model = "dfurman/CalmeRys-78B-Orpo-v0.1" tokenizer = AutoTokenizer.from_pretrained(model) pipeline = transformers.pipeline( "text-generation", model=model, model_kwargs={ "torch_dtype": torch_dtype, # "quantization_config": bnb_config, "device_map": "auto", "attn_implementation": attn_implementation, } ) ```

Example 1

```python question = "Is the number 9.11 larger than 9.9?"

messages = [ {"role": "system", "content": "You are a helpful assistant that thinks step by step."}, {"role": "user", "content": question}, ] prompt = tokenizer.applychattemplate(messages, tokenize=False, addgenerationprompt=True)

print("***Prompt:\n", prompt)

outputs = pipeline( prompt, maxnewtokens=1000, dosample=True, temperature=0.7, topk=50, top_p=0.95 ) print("***Generation:") print(outputs[0]["generated_text"][len(prompt) :]) ```

***Generation: To compare these two numbers, it's important to look at their decimal places after the whole number part, which is 9 in both cases. Comparing the tenths place, 9.11 has a '1' and 9.9 has a '9'. Since '9' is greater than '1', 9.9 is larger than 9.11.

Example 2

```python question = """The bakers at the Beverly Hills Bakery baked 200 loaves of bread on Monday morning. They sold 93 loaves in the morning and 39 loaves in the afternoon. A grocery store then returned 6 unsold loaves back to the bakery. How many loaves of bread did the bakery have left? Respond as succinctly as possible. Format the response as a completion of this table: |step|subquestion|procedure|result| |:---|:----------|:--------|:-----:|"""

messages = [ {"role": "system", "content": "You are a helpful assistant."}, {"role": "user", "content": question}, ] prompt = tokenizer.applychattemplate(messages, tokenize=False, addgenerationprompt=True)

print("***Prompt:\n", prompt)

outputs = pipeline(prompt, maxnewtokens=1000, dosample=True, temperature=0.7, topk=50, top_p=0.95) print("***Generation:") print(outputs[0]["generated_text"][len(prompt):])

```

***Generation: |1|Calculate total sold|Add morning and afternoon sales|132| |2|Subtract sold from total|200 - 132|68| |3|Adjust for returns|Add returned loaves to remaining|74|

Example 3

```python question = "What's a good recipe for a spicy margarita?"

messages = [ {"role": "system", "content": "You are a helpful assistant."}, {"role": "user", "content": question}, ] prompt = tokenizer.applychattemplate(messages, tokenize=False, addgenerationprompt=True)

print("***Prompt:\n", prompt)

outputs = pipeline(prompt, maxnewtokens=1000, dosample=True, temperature=0.7, topk=50, top_p=0.95) print("***Generation:") print(outputs[0]["generated_text"][len(prompt):]) ```

``` ***Generation: To make a Spicy Margarita, you'll need to incorporate a chili or pepper element into your classic margarita recipe. Here’s a simple way to do it:

Ingredients:

2 oz tequila (blanco or reposado)
1 oz fresh lime juice
1/2 oz triple sec (Cointreau or Grand Marnier)
1/2 oz agave syrup or simple syrup
1-2 slices of jalapeño (or more depending on how spicy you like it)
Salt and/or chili powder for rimming the glass
Ice
Lime wheel for garnish

Instructions:

Muddle Jalapeño: In a shaker, muddle the jalapeño slices slightly. This will release the oils and heat from the peppers.
Add Remaining Ingredients: Add the tequila, lime juice, triple sec, and agave syrup or simple syrup.
Shake and Strain: Fill the shaker with ice and shake vigorously until cold. Strain into a salt and/or chili powder rimmed glass filled with ice.
Garnish and Serve: Garnish with a lime wheel and enjoy.

If you prefer a smoother spiciness that doesn't overpower the drink, you could also consider making a jalapeño-infused tequila by leaving the jalapeño slices in the bottle of tequila for several hours to a couple of days, adjusting the time based on desired level of spiciness. Then use this infused tequila instead of regular tequila in the recipe above.

Another variation is to use a spicy syrup. To make this, combine equal parts water and sugar with a few sliced jalapeños in a saucepan. Bring to a boil, stirring occasionally to dissolve the sugar. Reduce heat and simmer for about 5 minutes. Let cool, strain out the jalapeños, then store in a sealed container in the refrigerator until ready to use. Use this spicy syrup instead of regular syrup in the recipe.

As always, adjust the quantity of jalapeño or the type of chili used to suit your taste. Enjoy responsibly! ```

🤝 References

Base models:

Datasets:

Compute providers:

Reccomended venv setup

python3 -m venv .venv source .venv/bin/activate pip3 install -r requirements.txt

Owner

Name: Daniel Furman
Login: daniel-furman
Kind: user
Location: San Francisco
Company: @twosixcapital

Repositories: 6
Profile: https://github.com/daniel-furman

Master’s student, UC Berkeley School of Information. University of Pennsylvania alum. DS @twosixcapital. Prev MLE @understory.ai.

GitHub Events

Total

Watch event: 11
Push event: 7
Fork event: 1

Last Year

Watch event: 11
Push event: 7
Fork event: 1

Committers

Last synced: 5 months ago

All Time

Total Commits: 254
Total Committers: 1
Avg Commits per committer: 254.0
Development Distribution Score (DDS): 0.0

Past Year

Commits: 16
Committers: 1
Avg Commits per committer: 16.0
Development Distribution Score (DDS): 0.0

Top Committers

Name	Email	Commits
daniel-furman	d**n@g**m	254

Issues and Pull Requests

Last synced: 6 months ago

All Time

Total issues: 2
Total pull requests: 30
Average time to close issues: 1 day
Average time to close pull requests: less than a minute
Total issue authors: 2
Total pull request authors: 1
Average comments per issue: 1.5
Average comments per pull request: 0.0
Merged pull requests: 30
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 0
Pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Issue authors: 0
Pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

https://github.com/daniel-furman/sft-demos

Science Score: 13.0%

Keywords

Repository

Basic Info

Statistics

Topics

Metadata Files

README.md

Finetuning demos for LLMs

📚 Intro

🔎 Finetunes

🏆 Evaluation

💻 Usage

Example 1

print("***Prompt:\n", prompt)

Example 2

print("***Prompt:\n", prompt)

Example 3

print("***Prompt:\n", prompt)

Ingredients:

Instructions:

🤝 References

Reccomended venv setup

Owner

GitHub Events

Total

Last Year

Committers

All Time

Past Year

Top Committers

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels

Dependencies