You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
as model is big for limited GPU memories, is it possible to compress the model by binarization or quantization method not only to reduce the size of models, but also to speed up the process?
The text was updated successfully, but these errors were encountered:
Hi,
as model is big for limited GPU memories, is it possible to compress the model by binarization or quantization method not only to reduce the size of models, but also to speed up the process?
The text was updated successfully, but these errors were encountered: