Convert2HF

NinedayWang

and

VHellendoorn

Update Convert2HF/README.md

Oct 18, 2022

49d8698 · Oct 18, 2022

History

Name	Name	Last commit message	Last commit date
parent directory ..
polycoder/configs	polycoder/configs	Extend to support models of different sizes, and add "argparse" and R…	Oct 14, 2022
README.md	README.md	Update Convert2HF/README.md	Oct 18, 2022
convert.sh	convert.sh	Parameterize the model size	Oct 18, 2022
convert_neox_pt_to_huggingface_neox.py	convert_neox_pt_to_huggingface_neox.py	Extend to support models of different sizes, and add "argparse" and R…	Oct 14, 2022
generate.py	generate.py	Extend to support models of different sizes, and add "argparse" and R…	Oct 14, 2022

README.md

Convert to HuggingFace

This directory contains a script convert_neox_pt_to_huggingface_neox.py to convert PolyCoder checkpoints trained by gpt-neox into HuggingFace format, and a script generate.py to load the converted model and generate code from a given prompt. Shoutout to @NinedayWang for implementing this!

Environment

transformers 4.23.1

Convert

You can use the convert.sh script to convert specified model to the HuggingFace format, using ./convert.sh 0-4B (or pass a different model size). This script in turn invokes convert_neox_pt_to_huggingface_neox.py, which you can also call directly as follows:

python convert_neox_pt_to_huggingface_neox.py \
    --checkpoint_dir ../checkpoints/checkpoints-0-4B/global_step150000 \
    --vocab_file ../Data/code-vocab.json \
    --merge_file ../Data/code-merges.txt \
    --hf_config_path ./polycoder/configs/config_0-4B.json \
    --hf_save_dir ./polycoder/0-4B

HuggingFace configuration files for different size models are provided in polycoder/configs/, including config_0-4B.json, config_2-7B.json and config_160M.json.

After running, you can get a complete HuggingFace model in the directory specified by hf_save_dir. If the directory does not exist, it can be built automatically.

Generate

The following is an example to load the converted 0.4B HuggingFace model and generate code from a given prompt:

python generate.py \
    --model_name_or_path ./polycoder/0-4B \
    --temperature 0.2 \
    --top_p 0.95 \
    --max_length 128

You can evaluate models of other sizes by specifying model_name_or_path.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Files

Convert2HF

Convert2HF

README.md

Convert to HuggingFace

Environment

Convert

Generate

Collapse file tree

Files

Convert2HF

Directory actions

More options

Directory actions

More options

Latest commit

History

Convert2HF

Folders and files

parent directory

README.md

Convert to HuggingFace

Environment

Convert

Generate