Skip to content

Issues: ggerganov/llama.cpp

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Llama-Quantize : Layers quantized in the wrong order, thus damaging the variable bits tensor quants scheme consistency. bug-unconfirmed medium severity Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#9005 opened Aug 12, 2024 by Nexesenex
cannot import name 'BaseVocab' from 'gguf' bug-unconfirmed low severity Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#8996 opened Aug 12, 2024 by garyyang85
Bug: Phi-3 mini 128k performance degradation with kv size > 8k (server) bug-unconfirmed high severity Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#8995 opened Aug 12, 2024 by steampunque
Feature Request: Add support for EXAONE-3.0-7.8B-Instruct model enhancement New feature or request
#8991 opened Aug 12, 2024 by chris-jaehoon
4 tasks done
Bug: GGML_ASSERT(llama_add_eos_token(model) != 1) failed llama-server critical error with flan-t5 models bug-unconfirmed medium severity Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#8990 opened Aug 12, 2024 by fabiomatricardi
Bug: Long sample times with --top-k 0 bug-unconfirmed medium severity Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#8988 opened Aug 11, 2024 by Azirine
Feature Request: MiniCPM 2.6 model support? enhancement New feature or request
#8977 opened Aug 10, 2024 by ttamoud
4 tasks done
Bug: llama-cli out "error: input is empty" and end bug-unconfirmed low severity Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#8976 opened Aug 10, 2024 by yanite
Bug: llama-server with --system-prompt-file stops abruptly without any error bug-unconfirmed high severity Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#8975 opened Aug 10, 2024 by pritam-dey3
Bug: llama-export-lora fails merging a T5 model with its LoRA adapter bug-unconfirmed high severity Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#8974 opened Aug 10, 2024 by cyanic-selkie
Bug: uncached prompt is not used for penalty bug-unconfirmed medium severity Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#8971 opened Aug 10, 2024 by z80maniac
Bug: Adreno740 GPU device can't load model in Android system bug-unconfirmed critical severity Used to report critical severity bugs in llama.cpp (e.g. Crashing, Corrupted, Dataloss)
#8965 opened Aug 10, 2024 by FranzKafkaYu
Feature Request: Support AWS inferentia inf2 instances enhancement New feature or request
#8954 opened Aug 9, 2024 by virajkanwade
4 tasks done
Bug: BigLlama-3.1-681B-Instruct requires llama_model_max_nodes to return a higher value bug-unconfirmed medium severity Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#8950 opened Aug 9, 2024 by nicoboss
Bug: Speed regression from early this year bug-unconfirmed medium severity Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#8945 opened Aug 9, 2024 by IndustrialOne
Feature Request: add support to LLaVA OneVision enhancement New feature or request
#8944 opened Aug 9, 2024 by alexrah
4 tasks done
Feature Request: echo=true in llama-server enhancement New feature or request
#8942 opened Aug 9, 2024 by ciaran-regan-ie
4 tasks done
Bug: BF16 is very slow bug-unconfirmed low severity Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#8941 opened Aug 9, 2024 by calvintwr
Feature Request: Ovis1.5-Gemma2-9B model support? enhancement New feature or request
#8940 opened Aug 9, 2024 by liuzl
4 tasks done
Feature Request: Support vulkan when building on Android enhancement New feature or request
#8933 opened Aug 8, 2024 by XinyuGroceryStore
4 tasks done
Bug: Kompute exits before loading model when offloading to GPU bug-unconfirmed medium severity Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#8932 opened Aug 8, 2024 by mi4code
Bug: exception while rasing a another exception in convert_llama_ggml_to_gguf script bug-unconfirmed low severity Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#8929 opened Aug 8, 2024 by farbodbj
Bug: Latest version of convert_hf_to_gguf not compatible with gguf 0.9.1 from pip bug-unconfirmed high severity Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#8925 opened Aug 8, 2024 by Ru13en
ProTip! What’s not been updated in a month: updated:<2024-07-12.