Skip to content

Pull requests: turboderp/exllamav2

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Add Exclude Top Choice (XTC) sampler
#625 opened Sep 14, 2024 by Cyrus-Hei Loading…
Improvement to enable quantization of Merge Models
#623 opened Sep 10, 2024 by PedroPareja Loading…
added option for tokenized input to dynamic generator
#613 opened Sep 2, 2024 by KT313 Loading…
Adding stream to 1 kernel.
#590 opened Aug 14, 2024 by Narsil Loading…
Simple QuaRot proof of concept.
#407 opened Apr 11, 2024 by sgsdxzy Loading…
Refactor token healing initialization.
#330 opened Feb 10, 2024 by bjj Loading…
Repeat layers to create FrankenModels
#275 opened Jan 12, 2024 by dnhkng Loading…
add QuiP quant support
#217 opened Dec 7, 2023 by waters222 Loading…
Adding return_lowest_perplexity
#206 opened Dec 3, 2023 by ziadloo Loading…
Add copilot server example
#23 opened Sep 13, 2023 by chenhunghan Loading…
ProTip! Filter pull requests by the default branch with base:master.