Skip to content

IBM's Granite 20B Code Instruct goes off the rails when using either: FP8, Q4 cache or speculative decoding (n-gram). #478

LlamaEnjoyer started this conversation in General
Discussion options

You must be logged in to vote

Replies: 2 comments 2 replies

Comment options

You must be logged in to vote
1 reply
@LlamaEnjoyer
Comment options

Comment options

You must be logged in to vote
1 reply
@LlamaEnjoyer
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants