Citation Intent Open LLMs

Supplementary material for paper "Can LLMs Predict Citation Intent? An Experimental Analysis of In-context Learning and Fine-tuning on Open LLMs".

Experiential evaluation

Current top results for each model

SciCite

ACL-ARC

Rank	Model	F1-Score
1	Qwen 2.5 - 14B	78.33
2	Mistral Nemo - 12B	77.38
3	Gemma 2 - 27B	76.16
4	Gemma 2 - 9B	74.97
5	Phi 3 Medium - 14B	74.67
6	LLaMA 3 - 8B	74.39
7	Qwen 2 - 7B	72.89
8	LLaMA 3.1 - 8B	72.46
9	Gemma 2 - 2B	68.79
10	Phi 3.5 Mini - 3.8B	68.25
11	LLaMA 3.2 - 3B	67.99
12	LLaMA 3.2 - 1B	45.44

Rank	Model	F1-Score
1	Qwen 2.5 - 14B	61.04
2	Gemma 2 - 9B	57.19
3	Gemma 2 - 27B	57.00
4	Mistral Nemo - 12B	48.11
5	Qwen 2 - 7B	47.50
6	LLaMA 3.1 - 8B	46.43
7	Phi 3.5 Mini - 3.8B	41.82
8	Phi 3 Medium - 14B	39.30
9	LLaMA 3.2 - 3B	37.82
10	LLaMA 3 - 8B	37.29
11	Gemma 2 - 2B	36.80
12	LLaMA 3.2 - 1B	24.60

Instructions

Prerequisites

Support for additional inference providers is under development

LM Studio (version 0.3.10 or higher)
LM Studio CLI (lms)

Setup and Configuration

Configure Models

The default configuration includes all models used in the paper
- Open experimental-configs/models.q8.json
- Select your target models and specify their context lengths
Model Installation - Choose one of these methods to download the required models:
- Use the LM Studio UI
- Run the command: lms get <model-name>
Experiment Configuration

In the default configuration, all parameters are selected
- Open experimental-configs\experimens-cfg.json
- Select your desired evaluation parameters

Running the Evaluation

Navigate to the root directory
Execute the evaluation script:

python citation_intent_classification_experiments.py

Fine-tuning

Prerequisites

LLaMA-Factory (commit: 24c7842)
All LLaMA-Factory dependencies installed

LLaMA-Factory is very quick to iterate, so later versions may not be totally compatible with the current config files - although the changes are usually very minor).

The training parameters in llama-factory-configs/{dataset}/training_args.yaml are platform-independent and can be used with any Supervised Fine-tuning system.

Dataset Preparation

Copy Dataset Files

Source locations:

datasets/aplaca_format_scicite/
└── scicite_train_alpaca.json
└── scicite_dev_alpaca.json

datasets/alpaca_format/acl-arc/
└── aclarc_train_alpaca.json
└── aclarc_dev_alpaca.json

Destination: LLaMA-Factory/data/

Update Dataset Information

Add the following to LLaMA-Factory/data/dataset_info.json:

"scicite": {
    "file_name": "scicite_train_alpaca.json",
    "columns": {
        "prompt": "instruction",
        "query": "input",
        "response": "output",
        "system": "system"
    }
},
"scicite-calibration": {
    "file_name": "scicite_dev_alpaca.json",
    "columns": {
        "prompt": "instruction",
        "query": "input",
        "response": "output",
        "system": "system"
    }
},
"aclarc": {
    "file_name": "aclarc_train_alpaca.json",
    "columns": {
        "prompt": "instruction",
        "query": "input",
        "response": "output",
        "system": "system"
    }
},
"aclarc-calibration": {
    "file_name": "aclarc_dev_alpaca.json",
    "columns": {
      "prompt": "instruction",
      "query": "input",
      "response": "output",
      "system": "system"
    }
}

Configuration Setup

Create a new directory: LLaMA-Factory/config/
Copy all configuration files from llama-factory-configs/ to the new directory

Training

For this step consult the LLaMA-Factory docs as well.

Choose one of these methods:

GUI Method
- Launch LLaMA Board interface
- Load your configuration
- Start training run

CLI Method

llamafactory-cli train path/to/training_args.yaml

Model Export

Export the model using the dev set of the selected dataset (either scicite_dev_alpaca.json or aclarc_dev_alpaca.json) as a calibration dataset

Optional: GGUF Conversion

To create GGUF model versions, install llama.cpp and run:

python convert_hf_to_gguf_update.py

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
datasets		datasets
experimental-configs		experimental-configs
llama-factory-configs		llama-factory-configs
notebooks		notebooks
results		results
system_prompts		system_prompts
.gitignore		.gitignore
README.md		README.md
citation_intent_classification_experiments.py		citation_intent_classification_experiments.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Citation Intent Open LLMs

Experiential evaluation

Current top results for each model

Instructions

Prerequisites

Setup and Configuration

Running the Evaluation

Fine-tuning

Prerequisites

Dataset Preparation

Configuration Setup

Training

Model Export

Optional: GGUF Conversion

About

Releases

Packages

Languages

athenarc/CitationIntentOpenLLM

Folders and files

Latest commit

History

Repository files navigation

Citation Intent Open LLMs

Experiential evaluation

Current top results for each model

Instructions

Prerequisites

Setup and Configuration

Running the Evaluation

Fine-tuning

Prerequisites

Dataset Preparation

Configuration Setup

Training

Model Export

Optional: GGUF Conversion

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages