Skip to content

Official repository for the paper 'Meta-Reasoning Improves Tool Use in Large Language Models'.

License

Notifications You must be signed in to change notification settings

lisaalaz/tecton

Repository files navigation

Official repository for Meta-Reasoning Improves Tool Use in Large Language Models

ToolDec

Instructions

Before running the repository, the Llama3 8B Instruct checkpoint must be obtained. See instructions here.

Installation:

git clone https://github.com/lisaalaz/tecton.git
cd tecton
python -m venv 'tecton_env'
. tecton_env/bin/activate
pip install -r requirements.txt

To fine-tune on GSM8K-XL:

CUDA_VISIBLE_DEVICES=0 python -m torch.distributed.run --nproc_per_node 1 --master_port 1200 train.py \
--ckpt_dir llama3/8B \
--tokenizer_path llama3/8B/tokenizer.model \
--input_file data/gsm8k-xl/train.json \
--lr 1e-3 \
--num_epochs 10 \
--dataset gsm8k-xl

To run inference on GSM8K-XL with TECTON-score:

CUDA_VISIBLE_DEVICES=0 python -m torch.distributed.run --nproc_per_node 1 --master_port 3000 inference.py \
--ckpt_dir llama3/8B-instruct \
--tokenizer_path llama3/8B-instruct/tokenizer.model \
--mode score \
--dataset gsm8k-xl \
--func_load_path checkpoints/best_checkpoint.pth \
--logits_bias 3.0

To run inference on GSM8K-XL with TECTON-generate:

CUDA_VISIBLE_DEVICES=0 python -m torch.distributed.run --nproc_per_node 1 --master_port 3000 inference.py \
--ckpt_dir llama3/8B-instruct \
--tokenizer_path llama3/8B-instruct/tokenizer.model \
--mode generate \
--dataset gsm8k-xl \
--func_load_path checkpoints/best_checkpoint.pth \
--logits_bias 3.0

Citation

@misc{alazraki2024metareasoningimprovestooluse,
      title={Meta-Reasoning Improves Tool Use in Large Language Models}, 
      author={Lisa Alazraki and Marek Rei},
      year={2024},
      eprint={2411.04535},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2411.04535}, 
}

About

Official repository for the paper 'Meta-Reasoning Improves Tool Use in Large Language Models'.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages