Skip to content

AssertionError: Expected 0 FFN layers on rank 0/8, got 1` in test_puzzletron.py -k "Qwen3-8B #1657

@danielkorzekwa

Description

@danielkorzekwa

python -m pytest tests/gpu/torch/puzzletron/test_puzzletron.py -k "Qwen3-8B" fails with AssertionError: Expected 0 FFN layers on rank 0/8, got 1

Note!

  • test_puzzletron works with 1 GPU (failing with 8), I did not test with other values.
  • this issue is different from the one described in Puzzletron README initial setup fails with number of issues #1637 (test_puzzletron.py also fails). There I used modelopt main branch, while here 0.44.0, and the exception was different from this one.

How to reproduce:

using modelopt branch: release/0.44.0

submit_job (srun wrapper) --partition interactive --time 4 --image  $EXPERIMENT_DIR/docker/nemo_26_02.sqsh --mounts $EXPERIMENT_DIR:/workspace --interactive --gpu 8

python -m pip uninstall nvidia-lm-eval -y 2>/dev/null
python -m pip install -e ".[hf,puzzletron,dev-test]"
python -m pip install -r examples/puzzletron/requirements.txt

bash-5.2# python3 -m pip list |grep modelopt
nvidia-modelopt                             0.44.1.dev0+gc897fbeaa.d20260609                /workspace/Model-Optimizer

python -m pytest tests/gpu/torch/puzzletron/test_puzzletron.py -k "Qwen3-8B"

exception:
[puzzletron_qwen_test_error.log](https://github.com/user-attachments/files/28744951/puzzletron_qwen_test_error.log)

Metadata

Metadata

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions