citationintentopenllm
Science Score: 26.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (10.0%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: athenarc
- Language: Jupyter Notebook
- Default Branch: main
- Size: 33.5 MB
Statistics
- Stars: 3
- Watchers: 2
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
Citation Intent Open LLMs
Supplementary material for paper "Can LLMs Predict Citation Intent? An Experimental Analysis of In-context Learning and Fine-tuning on Open LLMs".
Experiential evaluation
Current top results for each model
| SciCite | ACL-ARC |
|---|---|
| | Rank | Model | F1-Score | | ---- | ------------------- | -------- | | 1 | Qwen 2.5 - 14B | 78.33 | | 2 | Gemma 2 - 27B | 77.86 | | 3 | Mistral Nemo - 12B | 77.39 | | 4 | Gemma 2 - 9B | 75.12 | | 5 | Phi 3 Medium - 14B | 74.67 | | 6 | LLaMA 3 - 8B | 74.39 | | 7 | Qwen 2 - 7B | 72.89 | | 8 | LLaMA 3.1 - 8B | 72.46 | | 9 | Gemma 2 - 2B | 68.79 | | 10 | Phi 3.5 Mini - 3.8B | 68.25 | | 11 | LLaMA 3.2 - 3B | 67.99 | | 12 | LLaMA 3.2 - 1B | 45.44 | | | Rank | Model | F1-Score | | ---- | ------------------- | -------- | | 1 | Qwen 2.5 - 14B | 63.68 | | 2 | Gemma 2 - 27B | 58.95 | | 3 | Gemma 2 - 9B | 57.19 | | 4 | Qwen 2 - 7B | 51.26 | | 5 | LLaMA 3.1 - 8B | 48.45 | | 6 | Mistral Nemo - 12B | 48.11 | | 7 | Phi 3.5 Mini - 3.8B | 43.74 | | 8 | Phi 3 Medium - 14B | 43.46 | | 9 | Gemma 2 - 2B | 40.96 | | 10 | LLaMA 3.2 - 3B | 40.07 | | 11 | LLaMA 3 - 8B | 38.06 | | 12 | LLaMA 3.2 - 1B | 24.60 | |
Instructions
Prerequisites
Support for additional inference providers is under development - LM Studio (version 0.3.10 or higher) - LM Studio CLI (lms)
Setup and Configuration
Configure Models
The default configuration includes all models used in the paper
- Open
experimental-configs/models.q8.json - Select your target models and specify their context lengths
- Open
Model Installation - Choose one of these methods to download the required models:
- Use the LM Studio UI
- Run the command:
lms get <model-name>
Experiment Configuration
In the default configuration, all parameters are selected
- Open
experimental-configs\experimens-cfg.json - Select your desired evaluation parameters
- Open
Running the Evaluation
- Navigate to the root directory
- Execute the evaluation script:
bash
python citation_intent_classification_experiments.py
Fine-tuning
Prerequisites
- LLaMA-Factory (commit:
24c7842) - All LLaMA-Factory dependencies installed
LLaMA-Factory is very quick to iterate, so later versions may not be totally compatible with the current config files - although the changes are usually very minor).
The training parameters in
llama-factory-configs/{dataset}/training_args.yamlare platform-independent and can be used with any Supervised Fine-tuning system.
Dataset Preparation
Copy Dataset Files
- Source locations: ``` datasets/aplacaformatscicite/ scicitetrainalpaca.json scicitedevalpaca.json
datasets/alpacaformat/acl-arc/ aclarctrainalpaca.json aclarcdev_alpaca.json ``
- Destination:LLaMA-Factory/data/`Update Dataset Information
- Add the following to
LLaMA-Factory/data/dataset_info.json:
json "scicite": { "file_name": "scicite_train_alpaca.json", "columns": { "prompt": "instruction", "query": "input", "response": "output", "system": "system" } }, "scicite-calibration": { "file_name": "scicite_dev_alpaca.json", "columns": { "prompt": "instruction", "query": "input", "response": "output", "system": "system" } }, "aclarc": { "file_name": "aclarc_train_alpaca.json", "columns": { "prompt": "instruction", "query": "input", "response": "output", "system": "system" } }, "aclarc-calibration": { "file_name": "aclarc_dev_alpaca.json", "columns": { "prompt": "instruction", "query": "input", "response": "output", "system": "system" } }
- Add the following to
Configuration Setup
- Create a new directory:
LLaMA-Factory/config/ - Copy all configuration files from
llama-factory-configs/to the new directory
Training
For this step consult the LLaMA-Factory docs as well.
Choose one of these methods:
1. GUI Method
- Launch LLaMA Board interface
- Load your configuration
- Start training run
2. CLI Method
bash
llamafactory-cli train path/to/training_args.yaml
Model Export
Export the model using the dev set of the selected dataset (either scicite_dev_alpaca.json or aclarc_dev_alpaca.json) as a calibration dataset
Optional: GGUF Conversion
To create GGUF model versions, install llama.cpp and run:
bash
python convert_hf_to_gguf_update.py
Owner
- Name: ATHENA RC
- Login: athenarc
- Kind: organization
- Email: github@athenarc.gr
- Location: Athens, Greece
- Website: www.athenarc.gr
- Repositories: 65
- Profile: https://github.com/athenarc
ATHENA Research & Innovation Information Technologies
GitHub Events
Total
- Watch event: 4
- Member event: 2
- Push event: 8
- Public event: 1
- Create event: 2
Last Year
- Watch event: 4
- Member event: 2
- Push event: 8
- Public event: 1
- Create event: 2