Revisit and maybe optimize Collectors #1069
Labels
optimization
Performance optimization (throughout, memory, processing speed)
tentative
Up to discussion, may be dismissed
Milestone
There are some constraints in front of this assumption:
These are very strong constraints. If either is not true, we can switch to full async rollout implementation to get better throughput, i.e., achieving shorter wall-clock
collector.collect
time. For example, in RLHF case:Originally posted by @Trinkle23897 in #1058 (comment)
The text was updated successfully, but these errors were encountered: