Did you improve the performance using per-channel in weight quantization? #10

talenz · 2021-04-14T03:01:58Z

Hi,
Great implementation! Since per-channel weight quantization is implemented in you code, I'm wondering if there is any improvement compared to per-tensor weight quantization.

zhutmost · 2021-04-14T06:53:23Z

I have tried it on ResNet/ImageNet, but I found the initial value selection of the hyperparameter s is very tricky.
I an not sure how I should modify the original expression (self.s = t.nn.Parameter(x.detach().abs().mean() * 2 / (self.thd_pos ** 0.5))). I have tried some, but they cannot achieve an accuracy as high as the original one (i.e. without per-channel quan).

(And I don't have enough GPUs to do many experiments. It costs much of my spare time. :D

talenz · 2021-04-14T09:50:38Z

I have tried it on ResNet/ImageNet, but I found the initial value selection of the hyperparameter s is very tricky.
I an not sure how I should modify the original expression (self.s = t.nn.Parameter(x.detach().abs().mean() * 2 / (self.thd_pos ** 0.5))). I have tried some, but they cannot achieve an accuracy as high as the original one (i.e. without per-channel quan).

(And I don't have enough GPUs to do many experiments. It costs much of my spare time. :D

I used your implementation on Mobinetnet_v2@ImageNet and only quantize the conv weight to 4bit (fc weight and activation are float). It didn't work well, even I try per-channel and top1 score only gives about 0.68 (float is 71.88), any advice?

zhutmost · 2021-04-14T10:23:46Z

I have tried it on ResNet/ImageNet, but I found the initial value selection of the hyperparameter s is very tricky.
I an not sure how I should modify the original expression (self.s = t.nn.Parameter(x.detach().abs().mean() * 2 / (self.thd_pos ** 0.5))). I have tried some, but they cannot achieve an accuracy as high as the original one (i.e. without per-channel quan).
(And I don't have enough GPUs to do many experiments. It costs much of my spare time. :D

I used your implementation on Mobinetnet_v2@ImageNet and only quantize the conv weight to 4bit (fc weight and activation are float). It didn't work well, even I try per-channel and top1 score only gives about 0.68 (float is 71.88), any advice?

You can try to modify: 1) the scaling factor of the gradients, and 2) the initialization value of s. And you can read another paper, LSQ+ (https://arxiv.org/abs/2004.09576), which analyzes the disadvantages of LSQ and provides some advices.

talenz · 2021-04-21T01:02:11Z

I have tried it on ResNet/ImageNet, but I found the initial value selection of the hyperparameter s is very tricky.
I an not sure how I should modify the original expression (self.s = t.nn.Parameter(x.detach().abs().mean() * 2 / (self.thd_pos ** 0.5))). I have tried some, but they cannot achieve an accuracy as high as the original one (i.e. without per-channel quan).
(And I don't have enough GPUs to do many experiments. It costs much of my spare time. :D

I used your implementation on Mobinetnet_v2@ImageNet and only quantize the conv weight to 4bit (fc weight and activation are float). It didn't work well, even I try per-channel and top1 score only gives about 0.68 (float is 71.88), any advice?

You can try to modify: 1) the scaling factor of the gradients, and 2) the initialization value of s. And you can read another paper, LSQ+ (https://arxiv.org/abs/2004.09576), which analyzes the disadvantages of LSQ and provides some advices.

Thanks~

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Did you improve the performance using per-channel in weight quantization? #10

Did you improve the performance using per-channel in weight quantization? #10

talenz commented Apr 14, 2021

zhutmost commented Apr 14, 2021 •

edited

Loading

talenz commented Apr 14, 2021

zhutmost commented Apr 14, 2021

talenz commented Apr 21, 2021

Did you improve the performance using per-channel in weight quantization? #10

Did you improve the performance using per-channel in weight quantization? #10

Comments

talenz commented Apr 14, 2021

zhutmost commented Apr 14, 2021 • edited Loading

talenz commented Apr 14, 2021

zhutmost commented Apr 14, 2021

talenz commented Apr 21, 2021

zhutmost commented Apr 14, 2021 •

edited

Loading