forked from ggerganov/llama.cpp
-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
模型支持情况 #6
Comments
hipudding
added
enhancement
New feature or request
and removed
enhancement
New feature or request
labels
Jul 18, 2024
该栏为fp16模型 AquilaChat2-7B
Baichuan-7b
Baichuan2-7B-Chat
bitnet_b1_58-large
bloom-560m
bloomz-alpaca-560m
c4ai-command-r-35B-v01
chatglm3-6B
chinese-alpaca-2-1.3b
CodeShell-7B
deepseek-ai_deepseek-coder-1.3B-base
deepseek-ai_DeepSeek-V2-Lite
deepseek-coder-6.7B-instruct
DeepSeek-V2-Lite-64x1.5B
falcon-7b-instruct
flan-t5-large
gemma-2-9b-it
glm-4-9B
gpt2
Gpt2-163M
granite-3B-code-instruct
GritLM-7B
internlm2_5-7b-chat
koala-7B-HF
Llama-2-7b-chat-hf
Llama-3-Smaug-8B
Llama2-Chinese-7b-Chat
Llama3-8B:
Llama3-8b-chinese
mamba-130m-hf
Mistral-7B-Instruct-v0.2
Mixtral-8x7B-Instruct-v0.1
MPT-7B
OLMo-1B-hf
OpenELM-3B-Instruct
Orion-14b-base
phi: 1.6 GB
phi3
Phi-3-mini-4k-instruct
plamo-13b
pythia-70M
Qwen-7B无对话
Qwen2-1.5B-Instruct
Refact-1_6B-fim
SmolLM-135M
stablelm-zephyr
stablelm-2-zephyr-1_6b
starcoderbase-1b
starcoder2-3b
vigogne-7b-chat
xverse-7b-chat
Yi-6b
|
该栏为q8_0模型 AquilaChat2-7B
Baichuan-7b
Baichuan2-7B-Chat
bitnet_b1_58-large
bloom-560m
bloomz-alpaca-560m
c4ai-command-r-35B-v01
chatglm3-6B
chinese-alpaca-2-1.3b
CodeShell-7B
deepseek-ai_DeepSeek-V2-Lite
deepseek-ai_deepseek-coder-1.3B-base
deepseek-coder-6.7B-instruct
DeepSeek-V2-Lite-64x1.5B
falcon-7b-instruct
flan-t5-large
gemma-2-9b-it
glm-4-9B
gpt2gpt2-163M-F16
granite-3B-code-instruct
GritLM-7B
internlm2_5-7b-chat
koala-7B-HF
Llama-2-7b-chat-hf
Llama-3-Smaug-8B
Llama2-Chinese-7b-Chat
Llama3-8B:
Llama3-8b-chinese
mamba-130m-hf
Mistral-7B-Instruct-v0.2
Mixtral-8x7B-Instruct-v0.1
MPT-7B
OLMo-1B-hf
OpenELM-3B-Instruct
Orion-14b-base
phi1
phi2
Phi-3-mini-4k-instruct
plamo-13b
pythia-70M
Qwen-7B
Qwen2-1.5B-Instruct
Refact-1_6B-fim
SmolLM-135M
stablelm-zephyr
stablelm-2-zephyr-1_6b
starcoderbase-1b
starcoder2-3b
sea-lion
vigogne-7b-chat
xverse-7b-chat
Yi-6b-Chat
|
|
@wangshuai09 把这个表格贴到readme里吧。但是这些模型中有些能运行,但是切了很多子图,应该是有一些算子还没有实现。这类模型运行就非常慢了。readme中说明下吧。 |
该栏为q4_0模型 AquilaChat2-7B
Baichuan-7B
Baichuan2-7B-Chat
bitnet_b1_58-large
bloom-560m
bloomz-alpaca-560m
bitnet_b1_58-large
c4ai-command-r-35B-v01
chatglm3-6B
chinese-alpaca-2-1.3b
CodeShell-7B
deepseek-ai_deepseek-coder-1.3B-base
Deepseek-ai_DeepSeek-V2-Lite
deepseek-coder-6.7B-instruct
DeepSeek-V2-Lite-64x1.5B
falcon-7b-instruct
flan-t5-large
gemma-2-9b-it
gpt2
Gpt2-163M
granite-3B-code-instruct
GritLM-7B
internlm2_5-7b-chat
koala-7B-HF
Llama-2-7b-chat-hf
Llama-3-Smaug-8B
Llama2-Chinese-7b-Chat
Llama-3-8B
Llama3-Chinese_v2
mamba-130m-hf
Mistral-7B-Instruct-v0.2
Mixtral-8x7B-Instruct-v0.1
mpt-7B
OLMo-1B-hf
OpenELM-3B-Instruct
Orion-14b-base
phi1
phi2
Phi-3-mini-4k-instruct
plamo-13b
pythia-70M
Qwen-7B
Qwen_Qwen2-1.5B-Instruct
Refact-1_6B-fim
SmolLM-135M
stablelm-2-zephyr-1.6b
stablelm-zephyr-3B
starcoderbase-1b
starcoder2-3b
vigogne-7b-chat
xverse-7b-chat
Yi-6B-Chat
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
No description provided.
The text was updated successfully, but these errors were encountered: