257-chameleon-plug-and-play-compositional-reasoning-with-large-language-models

https://github.com/szu-advtech-2024/257-chameleon-plug-and-play-compositional-reasoning-with-large-language-models

Science Score: 13.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (3.5%) to scientific vocabulary
Last synced: 9 months ago · JSON representation

Repository

Basic Info
  • Host: GitHub
  • Owner: SZU-AdvTech-2024
  • Default Branch: main
  • Size: 0 Bytes
Statistics
  • Stars: 0
  • Watchers: 0
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created over 1 year ago · Last pushed over 1 year ago
Metadata Files
Citation

https://github.com/SZU-AdvTech-2024/257-Chameleon-Plug-and-Play-Compositional-Reasoning-with-Large-Language-Models/blob/main/

# README

## 1.

openai 0.27.0

```
python==3.8.10
huggingface-hub
numpy==1.23.2
openai==0.27.0
pandas==1.4.3
transformers==4.21.1
requests==2.28.1
```

## 2.OpenAI API Key

OpenAI API model.pyOpenAI API Key

utilities.py

## 3.Google Search API Key

Google Search API model.pyGoogle Search API Key

## 4.ScienceQA

gpt4

```
cd run_scienceqa

python run.py \
--model chameleon \
--label chameleon_gpt4 \
--policy_engine gpt-4 \
--kr_engine gpt-4 \
--qg_engine gpt-4 \
--sg_engine gpt-4 \
--test_split test \
--test_number -1
```

gpt3.5

```
python run.py \
--model chameleon \
--label chameleon_gpt4 \
--policy_engine gpt-3.5-turbo \
--kr_engine gpt-3.5-turbo \
--qg_engine gpt-3.5-turbo \
--sg_engine gpt-3.5-turbo \
--test_split test \
--test_number -1
```



```
python evaluate.py \
--data_file ../data/scienceqa/problems.json \
--result_root ../my_results/scienceqa \
--result_files chameleon_chatgpt_test_cache.jsonl
```

## 5.TabMWP

gpt4

```
cd run_tabmwp

python run.py \
--model chameleon \
--label chameleon_gpt4 \
--test_split test \
--policy_engine gpt-4 \
--rl_engine gpt-4 \
--cl_engine gpt-4 \
--tv_engine gpt-4 \
--kr_engine gpt-4 \
--sg_engine gpt-4 \
--pg_engine gpt-4 \
--test_number -1 \
--rl_cell_threshold 18 \
--cl_cell_threshold 18
```

gpt3.5

```
python run.py \
--model chameleon \
--label chameleon_chatgpt \
--test_split test \
--policy_engine gpt-3.5-turbo \
--rl_engine gpt-3.5-turbo \
--cl_engine gpt-3.5-turbo \
--tv_engine gpt-3.5-turbo \
--kr_engine gpt-3.5-turbo \
--sg_engine gpt-3.5-turbo \
--pg_engine gpt-3.5-turbo \
--test_number -1 \
--rl_cell_threshold 18 \
--cl_cell_threshold 18
```



```
python evaluate.py \
--data_file ../data/tabmwp/problems_test.json \
--result_root ../my_results/tabmwp \
--result_files chameleon_chatgpt_test_cache.jsonl
```

Owner

  • Name: SZU-AdvTech-2024
  • Login: SZU-AdvTech-2024
  • Kind: organization

GitHub Events

Total
  • Push event: 3
  • Create event: 3
Last Year
  • Push event: 3
  • Create event: 3