Unsloth training script

Introduction

Script to execute the fine-tuning of the Llama 3 (or other language) model with the artificially generated query data.

These are the steps that need to be taken in order to execute the script.

Add the generated dataset files in the "data" folder. There should be two .tsv files for the train and the dev set.
Edit the training parameters and other settings in the shell script "scripts/train_unsloth.sh"
To allow for the pushing of the trained model to huggingface, create a ".env" file and add the token via a variable "HF_TOKEN" in the unsloth-training folder
While in the unsloth-training folder, execute the shell script via the command "sh scripts/train_unsloth.sh"
The final trained model will be saved in the folder "models", while test results are stored in "test_results"

The model is now trained and can be used for benchmarking or for deployment, e.g. via huggingface.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data		data
logs		logs
models		models
scripts		scripts
test_results		test_results
README.md		README.md
train_unsloth.py		train_unsloth.py