You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I want to use the GRPO algorithm to train Qwen7B, but I failed using 4 H20 (96GB) GPUs with the trl library. I hope to train it with the verl library and would like to know how many H20 GPUs are needed.
The text was updated successfully, but these errors were encountered:
I want to use the GRPO algorithm to train Qwen7B, but I failed using 4 H20 (96GB) GPUs with the trl library. I hope to train it with the verl library and would like to know how many H20 GPUs are needed.
The text was updated successfully, but these errors were encountered: