GitHub - RParedesPalacios/llama2.cpp: Inference Llama 2 in C++ to use Eigen

llama2.cpp

Forked from Karpathy repo llama2.c

I implementled a modification over llama2.c to use Eigen for fast matrix multiplication. I did it trying to minimze the modifications over the original C version.

Eigen requires C++. Then to compile use:

g++ -I. -Ofast -fopenmp run.cpp  -lm  -o run

This version is not faster in my old computer but using Eigen you can use AVX2 (among others) set of instructions that could speedup the inference.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 167 Commits
Eigen		Eigen
LICENSE		LICENSE
README.md		README.md
run.cpp		run.cpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

llama2.cpp

License

About

Uh oh!

Releases

Packages

Languages

License

RParedesPalacios/llama2.cpp

Folders and files

Latest commit

History

Repository files navigation

llama2.cpp

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages