About Llama2

MaxText supports Llama2 pretraining, finetuning and decoding for its 7B and 70B flavors. To get started on decoding and finetuning of Llama2, you will first need to download weights along with its tokenizer from Meta.

The file test_llama2_7b.sh provides details on how to convert the PyTorch weights in orbax checkpoint format, and thereafter use it for running decoding and finetuning. test_llama2_7b.sh also shows how to run pretraining and also how to run decoding on the finetuned model checkpoint.

MaxText supports pretraining and finetuning with high performance.

Model Flop utilization for training on v5e and v5p and v4 TPUs with MaxText.

Model	v4-128 (bf16)	v5p-128 (bf16)	v5e-256 (bf16)
Llama2-70b	57%	65%	57%

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run_Llama2.md

Run_Llama2.md

About Llama2

MaxText supports pretraining and finetuning with high performance.

Files

Run_Llama2.md

Latest commit

History

Run_Llama2.md

File metadata and controls

About Llama2

MaxText supports pretraining and finetuning with high performance.