Why the zeroed channels still receive gradients? #36

AlexSunNik · 2023-07-20T22:07:43Z

Theoretically speaking, when you prune the channels according to the output dimension, you shouldn't get any gradient for the corresponding weights during your backward pass. How do you solve this from the code? Could you point me to the corresponding section?

I do notice that BN layers are not masked. If you keep the BN bias, there will indeed be gradients through the BN bias, but this seems like a very hacky workaround.

lzd19981105 · 2024-05-29T03:26:56Z

When i use the scripts to prune the resnet20 on cifar10, the weights that are pruned do not receive gradients backward. That means the weights cannot be reconstructed. And also the BN bias. BN bias is set 0 at the begining, so the BN bias donot affect the output channel value when this channel is masked as zero.
This code cannot work as the paper it came from.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why the zeroed channels still receive gradients? #36

Why the zeroed channels still receive gradients? #36

AlexSunNik commented Jul 20, 2023

lzd19981105 commented May 29, 2024

Why the zeroed channels still receive gradients? #36

Why the zeroed channels still receive gradients? #36

Comments

AlexSunNik commented Jul 20, 2023

lzd19981105 commented May 29, 2024