Baseline FedNova does not work with batchnorms #4345

wittenator · 2024-10-21T14:48:01Z

Describe the bug

I am currently running the baselines of Flower with different models. I tried Fednova and it seems to work well with the standard config. Once I activate batchnorms in the VGG model, an error gets thrown:

File "/home/korjakow/.cache/pypoetry/virtualenvs/fednova-0w39Djqe-py3.10/lib/python3.10/site-packages/flwr/simulation/ray_transport/ray_client_proxy.py", line 196, in fit
    return maybe_call_fit(
  File "/home/korjakow/.cache/pypoetry/virtualenvs/fednova-0w39Djqe-py3.10/lib/python3.10/site-packages/flwr/client/client.py", line 217, in maybe_call_fit
    return client.fit(fit_ins)
  File "/home/korjakow/.cache/pypoetry/virtualenvs/fednova-0w39Djqe-py3.10/lib/python3.10/site-packages/flwr/client/app.py", line 333, in _fit
    results = self.numpy_client.fit(parameters, ins.config)  # type: ignore
  File "/home/korjakow/projects/flower-1/baselines/fednova/fednova/client.py", line 71, in fit
    self.set_parameters(parameters)
  File "/home/korjakow/projects/flower-1/baselines/fednova/fednova/client.py", line 65, in set_parameters
    self.optimizer.set_model_params(parameters)
  File "/home/korjakow/projects/flower-1/baselines/fednova/fednova/models.py", line 360, in set_model_params
    p.data.copy_(param_tensor)
RuntimeError: The size of tensor a (3) must match the size of tensor b (64) at non-singleton dimension 3

I assume that the baseline wasn't executed with the batchnorm options and the current code does not handle the running stats of the batchnorm layer well.

Steps/Code to Reproduce

Apply the fix from State dict creation bug for e.g. resnet18 #4344 since otherwise the code errors beforehands.
Activate batchnorms in

flower/baselines/fednova/fednova/models.py

Line 48 in cdc8c43

def make_layers(network_cfg, batch_norm=False):
Run the baseline: python -m fednova.main

Expected Results

The training should progress.

Actual Results

An error gets thrown as shown above.

The text was updated successfully, but these errors were encountered:

wittenator added the bug Something isn't working label Oct 21, 2024

WilliamLindskog added stale If issue/PR hasn't been updated within 3 weeks. part: baselines Add or update baseline labels Dec 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Baseline FedNova does not work with batchnorms #4345

Baseline FedNova does not work with batchnorms #4345

wittenator commented Oct 21, 2024

Baseline FedNova does not work with batchnorms #4345

Baseline FedNova does not work with batchnorms #4345

Comments

wittenator commented Oct 21, 2024

Describe the bug

Steps/Code to Reproduce

Expected Results

Actual Results