I was using synflow for my resnet like model for a classification problem. I found that all gradients of the masked_parameters are zeros. Any ideas why it happens?
self.scores[id(p)] = torch.clone(p.grad * p).detach().abs_()
p is not zero but p.grad is always zero. Thanks!
I was using synflow for my resnet like model for a classification problem. I found that all gradients of the masked_parameters are zeros. Any ideas why it happens?
self.scores[id(p)] = torch.clone(p.grad * p).detach().abs_()p is not zero but p.grad is always zero. Thanks!