Skip to content

Pull requests: turboderp-org/exllamav2

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

DRY: Fix Greedy Clone typo in sampler.
#801 opened Jul 28, 2025 by Ph0rk0z
model load performance tweak idea
#799 opened Jul 10, 2025 by metaclassing
Llama-3_1-Nemotron 51B support
#726 opened Jan 28, 2025 by ymcki
Adding stream to 1 kernel.
#590 opened Aug 14, 2024 by Narsil
Simple QuaRot proof of concept.
#407 opened Apr 11, 2024 by sgsdxzy
Refactor token healing initialization.
#330 opened Feb 10, 2024 by bjj
add QuiP quant support
#217 opened Dec 7, 2023 by waters222
Adding return_lowest_perplexity
#206 opened Dec 3, 2023 by ziadloo
Add copilot server example
#23 opened Sep 13, 2023 by chenhunghan
ProTip! Add no:assignee to see everything that’s not assigned.