GitHub - EliasK93/qlora-based-efficient-fine-tuning-for-sentence-pair-classification: Example application for applying QLoRA-based Parameter-Efficient Fine-Tuning (PEFT) to a sentence pair classification task using Mistral-7B and Llama3-8B

QLoRA-based efficient Fine-Tuning for Sentence Pair Classification

Example application for applying a parameter-efficient fine-tuning (PEFT) approach to a sentence pair classification task (Stance Detection).

QLoRA (Quantized Low-Rank Adaptation) essentially combines two approaches:

Quantization: Loading and fine-tuning the model parameters at a lower bit precision to fit more parameters into the same memory
Low-Rank Adaption: Freezing the pre-trained model weights while injecting much smaller rank decomposition matrices into each layer and fine-tuning only those

Using these approaches, a total of four language models were fine-tuned for three epochs on the task using a single RTX 4080 GPU:

Corpus

Each model was fine-tuned on a 5,000 sentences Stance Detection corpus that I manually annotated during my Master's Thesis. Stance Detection aims to classify the stance a sentence takes towards a claim (topic) as either Pro, Contra or Neutral. The sentences originate from Reddit's r/ChangeMyView subreddit in the time span between January 2013 and October 2018, as provided in the ConvoKit subreddit corpus. They cover five topics: abortion, climate change, gun control, minimum wage and veganism. The table below shows some examples.

topic	sentence	stance label
There should be more gun control.	It's the only country with a "2nd Amendment", yet 132 countries have a lower murder rate.	Pro
Humanity needs to combat climate change.	The overhwelming evidence could be lies and you would never know because you're content to live your life as a giant appeal to authority.	Contra
Vegans are right.	It's all about finding a system that works for you.	Neutral

Results

Model	total parameters	trainable in PEFT	Accuracy	Micro-F1	Macro-F1
Mistral-7B-v0.1	7,124,316,160	13,643,776 (0.192%)	0.87	0.87	0.87
Mistral-7B-v0.3	7,127,461,888	13,643,776 (0.191%)	0.88	0.88	0.88
Llama-3-8B	7,518,580,736	13,643,776 (0.182%)	0.85	0.85	0.85
Llama-3.1-8B	7,518,580,736	13,643,776 (0.182%)	0.87	0.87	0.86

Requirements

- Python >= 3.10

- Conda

pytorch==2.4.0
cudatoolkit=12.1

- pip

transformers
datasets
sentencepiece
protobuf
peft
bitsandbytes
openpyxl
scikit-learn

Notes

The dataset files in this repository are cut off after the first 50 rows. The trained model files adapter_model.safetensors, optimizer.pt and tokenizer.json are omitted in this repository.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data		data
models		models
results		results
README.md		README.md
run_qlora_finetuning.py		run_qlora_finetuning.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

QLoRA-based efficient Fine-Tuning for Sentence Pair Classification

Corpus

Results

Requirements

- Python >= 3.10

- Conda

- pip

Notes

About

Uh oh!

Releases

Packages

Languages

EliasK93/qlora-based-efficient-fine-tuning-for-sentence-pair-classification

Folders and files

Latest commit

History

Repository files navigation

QLoRA-based efficient Fine-Tuning for Sentence Pair Classification

Corpus

Results

Requirements

- Python >= 3.10

- Conda

- pip

Notes

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages