Does Torch.ao Support  FullyShardedDataParallel?

When I add FullyShardedDataParallel to the model,

net_model_fsdp = FullyShardedDataParallel(net, **settings)
and then try to quantize it using:

quantize_(net_model_fsdp, int8_dynamic_activation_int8_weight())
I encounter the following error with torch.ao:

RuntimeError: CUDA error: an illegal memory access was encountered.

If I do not use FullyShardedDataParallel and directly quantize net (as shown below), there is no problem:
quantize_(net, int8_dynamic_activation_int8_weight())

Please help me analyze the reason.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Does Torch.ao Support FullyShardedDataParallel? #1413

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Does Torch.ao Support FullyShardedDataParallel? #1413

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions