AssertionError: Expected 0 FFN layers on rank 0/8, got 1` in test_puzzletron.py -k "Qwen3-8B

`python -m pytest tests/gpu/torch/puzzletron/test_puzzletron.py -k "Qwen3-8B"` fails with `AssertionError: Expected 0 FFN layers on rank 0/8, got 1`

Note! 
- test_puzzletron works with 1 GPU (failing with 8), I did not test with other values.
- this issue is different from the one described in https://github.com/NVIDIA/Model-Optimizer/issues/1637 (test_puzzletron.py also fails). There I used modelopt main branch, while here 0.44.0, and the exception was different from this one.

How to reproduce:

```
using modelopt branch: release/0.44.0

submit_job (srun wrapper) --partition interactive --time 4 --image  $EXPERIMENT_DIR/docker/nemo_26_02.sqsh --mounts $EXPERIMENT_DIR:/workspace --interactive --gpu 8

python -m pip uninstall nvidia-lm-eval -y 2>/dev/null
python -m pip install -e ".[hf,puzzletron,dev-test]"
python -m pip install -r examples/puzzletron/requirements.txt

bash-5.2# python3 -m pip list |grep modelopt
nvidia-modelopt                             0.44.1.dev0+gc897fbeaa.d20260609                /workspace/Model-Optimizer

python -m pytest tests/gpu/torch/puzzletron/test_puzzletron.py -k "Qwen3-8B"

exception:
[puzzletron_qwen_test_error.log](https://github.com/user-attachments/files/28744951/puzzletron_qwen_test_error.log)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AssertionError: Expected 0 FFN layers on rank 0/8, got 1` in test_puzzletron.py -k "Qwen3-8B #1657

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

AssertionError: Expected 0 FFN layers on rank 0/8, got 1` in test_puzzletron.py -k "Qwen3-8B #1657

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions