Skip to content

Latest commit

 

History

History

reward_model

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 

Running Scripts for the AI Feedback Module

Download The Model

Download the llama-7b-hf model from Hugging Face

Download The Dataset

cd dataset

Download the dataset for empathetic response classification from behavioral-data/Empathy-Mental-Health.

Download the dataset for strategy classification from Motivational-Interviewing-Dataset.

Process the dataset (including obtain the dataset for coherence classification) using the following command:

python data_process.py
cd ../

Instruction Tuning

python finetune.py --base_model decapoda-research/llama-7b-hf --output_dir ./lora-alpaca 

Test the Multifaceted AI Feedback Model

python test.py --base_model decapoda-research/llama-7b-hf --lora_weight ./lora-alpaca