TinyGPT

Tiny C++11 GPT-2 inference implementation from scratch, which is mainly based on the project picoGPT.

Accompanying blog post: Write a GPT from scratch (TinyGPT)

Core class

Tensor: Tensor class similar to the numpy interface.
Model: GPT-2 model implementation with reference to gpt2_pico.py.
Tokenizer: BPE tokenizer with exactly the same logic as GPT-2 encoder.py.

Build and Run

1. Get the code

git clone --recurse-submodules https://github.com/keith2018/TinyGPT.git

2. Install Intel MKL(Math Kernel Library)

Official website: Intel®-Optimized Math Library for Numerical Computing on CPUs & GPUs

3. Download GPT-2 model file

python3 tools/download_gpt2_model.py

if success, you'll see the file model_file.data in directory assets/gpt2

4. Build and Run

mkdir build
cmake -B ./build -DCMAKE_BUILD_TYPE=Release
cmake --build ./build --config Release

This will generate the executable file and copy assets to directory app/bin, then you can run the demo:

cd app/bin
./TinyGPT_demo
[DEBUG] TIMER TinyGPT::Model::loadModelGPT2: cost: 800 ms
[DEBUG] TIMER TinyGPT::Encoder::getEncoder: cost: 191 ms
INPUT:Alan Turing theorized that computers would one day become
GPT:the most powerful machines on the planet.
INPUT:exit

Dependencies

GEMM acceleration
- intel-mkl https://www.intel.com/content/www/us/en/developer/tools/oneapi/onemkl.html
Json parser
- json11 https://github.com/dropbox/json11
Tokenizer regular matching
- re2 https://github.com/google/re2
- abseil-cpp https://github.com/abseil/abseil-cpp

License

This code is licensed under the MIT License (see LICENSE).

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
app		app
assets/gpt2		assets/gpt2
src		src
test		test
third_party		third_party
tools		tools
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TinyGPT

Core class

Build and Run

1. Get the code

2. Install Intel MKL(Math Kernel Library)

3. Download GPT-2 model file

4. Build and Run

Dependencies

License

About

Releases

Packages

Languages

License

keith2018/TinyGPT

Folders and files

Latest commit

History

Repository files navigation

TinyGPT

Core class

Build and Run

1. Get the code

2. Install Intel MKL(Math Kernel Library)

3. Download GPT-2 model file

4. Build and Run

Dependencies

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages