You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hey I am just wondering is there way to load any pre-trained models with A100 on CUDA 11? It seems that deepspeed==0.4.4 and triton==0.4.2 do not work with CUDA 11 but pre-trained models require those old versions of deepspeed and triton. Thanks in advance!
The text was updated successfully, but these errors were encountered:
Hey I am just wondering is there way to load any pre-trained models with A100 on CUDA 11? It seems that deepspeed==0.4.4 and triton==0.4.2 do not work with CUDA 11 but pre-trained models require those old versions of deepspeed and triton. Thanks in advance!
The text was updated successfully, but these errors were encountered: