Skip to content

Issues: turboderp/exllamav2

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[BUG] Quantization of Qwen return garbage bug Something isn't working
#621 opened Sep 10, 2024 by fahadh4ilyas
3 tasks done
how can i solve this problem
#611 opened Sep 2, 2024 by Sultan0ML
Async Stream Genenerator?
#604 opened Aug 28, 2024 by KingBipo
Tensor parallelism issues
#598 opened Aug 24, 2024 by dirkson
Error in quant
#587 opened Aug 8, 2024 by Orion-zhen
Llama 3 speed
#585 opened Aug 4, 2024 by freQuensy23-coder
Add more docs and type annotations
#579 opened Jul 30, 2024 by Dan-wanna-M
Will it support CPU offloading?
#578 opened Jul 30, 2024 by fzyzcjy
Triton Support
#574 opened Jul 26, 2024 by rjmehta1993
orig_func Quantization error
#573 opened Jul 25, 2024 by Masterjp123
Curious about Exllama+TP
#571 opened Jul 25, 2024 by grimulkan
Manual model merges
#555 opened Jul 18, 2024 by dnhkng
Conversion error
#552 opened Jul 16, 2024 by TMust77
ProTip! Adding no:label will show everything without a label.