Refined Evaluation and Fine-Tuning Toolkit

This repository provides tools and demonstrations for evaluating and fine-tuning large language models using a refined reward modeling framework.

Structure

reft_demo.py / reft_demo.ipynb: Entry point for demonstrating model reward evaluation and generation.
reft_OG/: Core implementation of reward-based fine-tuning and evaluation modules.
- examples/: Includes multiple subfolders showcasing practical applications including LoRA, ICL, DPO, and more.
- pyreft/: Core configuration, trainer, and model adaptation code.
reft_and_lora/: Simple contrast setup between pure LoRA and integrated Reft+LoRA.

Key Features

Modular training and evaluation of models using custom rewards
LoRA and reward intervention compatibility
End-to-end demo notebooks for ICL, reward tuning, memorization, safety, and more
Plotting utilities for evaluating training performance

Usage

Install dependencies listed in your environment or Dockerfile.
Run one of the provided demo scripts or notebooks.
Customize datasets, templates, or reward functions as needed.

License

This project is made available for research and educational purposes.

Setting up the environment:

conda create -n reft python=3.10 # python >=3.9 is required
conda activate reft
# IMPORTANT: check your CUDA version before running the following step and match w/ that instead
conda install transformers pytorch torchvision torchaudio pytorch-cuda=12.1 -c pytorch -c nvidia #I believe it is 12.4 on the lab machines
pip install nnsight pyreft

Running on CHTC:

Make a .hf_token and put your HuggingFace token directly in there
Make sure you have access to LLaMa 2
Run the standard condor_submit train.sub on a CHTC machine
You can also run the Docker container in interactive mode using the -i flag on previous command. Note that it won't execute the exec.sh though--you'll have to do that yourself if you want it to run

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Refined Evaluation and Fine-Tuning Toolkit

Structure

Key Features

Usage

License

Setting up the environment:

Running on CHTC:

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
reft_OG		reft_OG
reft_and_lora		reft_and_lora
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
exec.sh		exec.sh
reft_demo.ipynb		reft_demo.ipynb
reft_demo.py		reft_demo.py
train.sub		train.sub

License

albertw7711/CHTC-ML-Finetuning

Folders and files

Latest commit

History

Repository files navigation

Refined Evaluation and Fine-Tuning Toolkit

Structure

Key Features

Usage

License

Setting up the environment:

Running on CHTC:

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages