citationintentopenllm

https://github.com/athenarc/citationintentopenllm

Science Score: 26.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (10.0%) to scientific vocabulary

Last synced: 10 months ago · JSON representation

Repository

Basic Info

Host: GitHub
Owner: athenarc
Language: Jupyter Notebook
Default Branch: main
Size: 33.5 MB

Statistics

Stars: 3
Watchers: 2
Forks: 0
Open Issues: 0
Releases: 0

Created over 1 year ago · Last pushed about 1 year ago

Metadata Files

Readme Citation

Citation Intent Open LLMs

Supplementary material for paper "Can LLMs Predict Citation Intent? An Experimental Analysis of In-context Learning and Fine-tuning on Open LLMs".

Experiential evaluation

Current top results for each model

SciCite	ACL-ARC
\| Rank \| Model \| F1-Score \| \| ---- \| ------------------- \| -------- \| \| 1 \| Qwen 2.5 - 14B \| 78.33 \| \| 2 \| Gemma 2 - 27B \| 77.86 \| \| 3 \| Mistral Nemo - 12B \| 77.39 \| \| 4 \| Gemma 2 - 9B \| 75.12 \| \| 5 \| Phi 3 Medium - 14B \| 74.67 \| \| 6 \| LLaMA 3 - 8B \| 74.39 \| \| 7 \| Qwen 2 - 7B \| 72.89 \| \| 8 \| LLaMA 3.1 - 8B \| 72.46 \| \| 9 \| Gemma 2 - 2B \| 68.79 \| \| 10 \| Phi 3.5 Mini - 3.8B \| 68.25 \| \| 11 \| LLaMA 3.2 - 3B \| 67.99 \| \| 12 \| LLaMA 3.2 - 1B \| 45.44 \|	\| Rank \| Model \| F1-Score \| \| ---- \| ------------------- \| -------- \| \| 1 \| Qwen 2.5 - 14B \| 63.68 \| \| 2 \| Gemma 2 - 27B \| 58.95 \| \| 3 \| Gemma 2 - 9B \| 57.19 \| \| 4 \| Qwen 2 - 7B \| 51.26 \| \| 5 \| LLaMA 3.1 - 8B \| 48.45 \| \| 6 \| Mistral Nemo - 12B \| 48.11 \| \| 7 \| Phi 3.5 Mini - 3.8B \| 43.74 \| \| 8 \| Phi 3 Medium - 14B \| 43.46 \| \| 9 \| Gemma 2 - 2B \| 40.96 \| \| 10 \| LLaMA 3.2 - 3B \| 40.07 \| \| 11 \| LLaMA 3 - 8B \| 38.06 \| \| 12 \| LLaMA 3.2 - 1B \| 24.60 \|

SciCite

ACL-ARC

| Rank | Model | F1-Score | | ---- | ------------------- | -------- | | 1 | Qwen 2.5 - 14B | 78.33 | | 2 | Gemma 2 - 27B | 77.86 | | 3 | Mistral Nemo - 12B | 77.39 | | 4 | Gemma 2 - 9B | 75.12 | | 5 | Phi 3 Medium - 14B | 74.67 | | 6 | LLaMA 3 - 8B | 74.39 | | 7 | Qwen 2 - 7B | 72.89 | | 8 | LLaMA 3.1 - 8B | 72.46 | | 9 | Gemma 2 - 2B | 68.79 | | 10 | Phi 3.5 Mini - 3.8B | 68.25 | | 11 | LLaMA 3.2 - 3B | 67.99 | | 12 | LLaMA 3.2 - 1B | 45.44 |

| Rank | Model | F1-Score | | ---- | ------------------- | -------- | | 1 | Qwen 2.5 - 14B | 63.68 | | 2 | Gemma 2 - 27B | 58.95 | | 3 | Gemma 2 - 9B | 57.19 | | 4 | Qwen 2 - 7B | 51.26 | | 5 | LLaMA 3.1 - 8B | 48.45 | | 6 | Mistral Nemo - 12B | 48.11 | | 7 | Phi 3.5 Mini - 3.8B | 43.74 | | 8 | Phi 3 Medium - 14B | 43.46 | | 9 | Gemma 2 - 2B | 40.96 | | 10 | LLaMA 3.2 - 3B | 40.07 | | 11 | LLaMA 3 - 8B | 38.06 | | 12 | LLaMA 3.2 - 1B | 24.60 |

Instructions

Prerequisites

Support for additional inference providers is under development - LM Studio (version 0.3.10 or higher) - LM Studio CLI (lms)

Setup and Configuration

Configure Models

The default configuration includes all models used in the paper
- Open experimental-configs/models.q8.json
- Select your target models and specify their context lengths
Model Installation - Choose one of these methods to download the required models:
- Use the LM Studio UI
- Run the command: lms get <model-name>
Experiment Configuration

In the default configuration, all parameters are selected
- Open experimental-configs\experimens-cfg.json
- Select your desired evaluation parameters

Running the Evaluation

Navigate to the root directory
Execute the evaluation script:

bash python citation_intent_classification_experiments.py

Fine-tuning

Prerequisites

LLaMA-Factory (commit: 24c7842)
All LLaMA-Factory dependencies installed

LLaMA-Factory is very quick to iterate, so later versions may not be totally compatible with the current config files - although the changes are usually very minor).

The training parameters in llama-factory-configs/{dataset}/training_args.yaml are platform-independent and can be used with any Supervised Fine-tuning system.

Dataset Preparation

Copy Dataset Files
- Source locations: ``` datasets/aplacaformatscicite/ scicitetrainalpaca.json scicitedevalpaca.json
datasets/alpacaformat/acl-arc/ aclarctrainalpaca.json aclarcdev_alpaca.json ``- Destination:LLaMA-Factory/data/`
Update Dataset Information
- Add the following to LLaMA-Factory/data/dataset_info.json:
  json "scicite": { "file_name": "scicite_train_alpaca.json", "columns": { "prompt": "instruction", "query": "input", "response": "output", "system": "system" } }, "scicite-calibration": { "file_name": "scicite_dev_alpaca.json", "columns": { "prompt": "instruction", "query": "input", "response": "output", "system": "system" } }, "aclarc": { "file_name": "aclarc_train_alpaca.json", "columns": { "prompt": "instruction", "query": "input", "response": "output", "system": "system" } }, "aclarc-calibration": { "file_name": "aclarc_dev_alpaca.json", "columns": { "prompt": "instruction", "query": "input", "response": "output", "system": "system" } }

Configuration Setup

Create a new directory: LLaMA-Factory/config/
Copy all configuration files from llama-factory-configs/ to the new directory

Training

For this step consult the LLaMA-Factory docs as well.

Choose one of these methods: 1. GUI Method - Launch LLaMA Board interface - Load your configuration - Start training run 2. CLI Method bash llamafactory-cli train path/to/training_args.yaml

Model Export

Export the model using the dev set of the selected dataset (either scicite_dev_alpaca.json or aclarc_dev_alpaca.json) as a calibration dataset

Optional: GGUF Conversion

To create GGUF model versions, install llama.cpp and run: bash python convert_hf_to_gguf_update.py

Owner

Name: ATHENA RC
Login: athenarc
Kind: organization
Email: github@athenarc.gr
Location: Athens, Greece

Website: www.athenarc.gr
Repositories: 65
Profile: https://github.com/athenarc

ATHENA Research & Innovation Information Technologies

GitHub Events

Total

Watch event: 4
Member event: 2
Push event: 8
Public event: 1
Create event: 2

Last Year

Watch event: 4
Member event: 2
Push event: 8
Public event: 1
Create event: 2

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

citationintentopenllm

Science Score: 26.0%

Repository

Basic Info

Statistics

Metadata Files

README.md

Citation Intent Open LLMs

Experiential evaluation

Current top results for each model

Instructions

Prerequisites

Setup and Configuration

Running the Evaluation

Fine-tuning

Prerequisites

Dataset Preparation

Configuration Setup

Training

Model Export

Optional: GGUF Conversion

Owner

GitHub Events

Total

Last Year