reward_model.micro_batch_size_per_gpu not work #395

yuki-666 · 2025-02-26T08:18:21Z

In verl/workers/reward_model/megatron/reward_model.py

# split into micro-batches
if self.config is not None and 'ppo_micro_batch_size_per_gpu' in self.config:
    infer_batch_size = self.config.ppo_micro_batch_size_per_gpu
else:
    infer_batch_size = data.batch.batch_size[0]

It seems use ppo_micro_batch_size_per_gpu instead of micro_batch_size_per_gpu, so reward_model.micro_batch_size_per_gpu not work, always infer_batch_size = data.batch.batch_size[0].

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reward_model.micro_batch_size_per_gpu not work #395

reward_model.micro_batch_size_per_gpu not work #395

yuki-666 commented Feb 26, 2025

reward_model.micro_batch_size_per_gpu not work #395

reward_model.micro_batch_size_per_gpu not work #395

Comments

yuki-666 commented Feb 26, 2025