You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have been trying to use other kernels with this implementation but none of them load the state dict without mismatch. I don't know if marlin AWQ is special, but in any case, it would be nice to know how to quantize models. Even with this implementation we don't have schnell.
Please post a quanting script or give some hints.
The text was updated successfully, but these errors were encountered:
I have been trying to use other kernels with this implementation but none of them load the state dict without mismatch. I don't know if marlin AWQ is special, but in any case, it would be nice to know how to quantize models. Even with this implementation we don't have schnell.
Please post a quanting script or give some hints.
The text was updated successfully, but these errors were encountered: