You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
fix: Rename variable to control PG timeout to not refer to NCCL
The variable is not backend specific. In the future, if/when we support
other backends, this will become more evidently a problem.
Signed-off-by: Ihar Hrachyshka <[email protected]>
Copy file name to clipboardExpand all lines: README.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -375,4 +375,4 @@ run_training(
375
375
376
376
Below is a list of custom environment variables users can set in the training library.
377
377
378
-
1.`INSTRUCTLAB_NCCL_TIMEOUT_MS`, this environment variable controls the NCCL timeout in milliseconds. Consider increasing if seeing FSDP related NCCL errors.
378
+
1.`INSTRUCTLAB_PROCESS_GROUP_TIMEOUT_MS`, this environment variable controls the process group timeout in milliseconds. Consider increasing if seeing FSDP related errors.
0 commit comments