Closed
Description
When I add FullyShardedDataParallel to the model,
net_model_fsdp = FullyShardedDataParallel(net, **settings)
and then try to quantize it using:
quantize_(net_model_fsdp, int8_dynamic_activation_int8_weight())
I encounter the following error with torch.ao:
RuntimeError: CUDA error: an illegal memory access was encountered.
If I do not use FullyShardedDataParallel and directly quantize net (as shown below), there is no problem:
quantize_(net, int8_dynamic_activation_int8_weight())
Please help me analyze the reason.
Metadata
Metadata
Assignees
Labels
No labels