This repository is an official PyTorch implementation of the Paper Learning Filter Basis for Convolutional Neural Network Compression accepted by ICCV 2019.
Convolutional neural networks (CNNs) based solutions have achieved state-of-the-art performances for many computer vision tasks, including classification and super-resolution of images. Usually the success of these methods comes with a cost of millions of parameters due to stacking deep convolutional layers. Moreover, quite a large number of filters are also used for a single convolutional layer, which exaggerates the parameter burden of current methods. Thus, in this paper, we try to reduce the number of parameters of CNNs by learning a basis of the filters in convolutional layers. For the forward pass, the learned basis is used to approximate the original filters and then used as parameters for the convolutional layers. We validate our proposed solution for multiple CNN architectures on image classification and image super-resolution benchmarks and compare favorably to the existing state-of-the-art in terms of reduction of parameters and preservation of accuracy.
Comparison between different filter decomposition methods.
Illustration of the proposed basis learning method. Operations are converted to convolutions. Unlike the normal convolution, our method splits both the input feature map and the 3D filter along the channel dimension. A set of basis is learned for the ensemble of splits. Every split of the input feature map is convolved with the basis. A final 1x1 convolution generates the output.
- Python 3.7.3
- PyTorch >= 1.1.0
- numpy
- matplotlib
- tqdm
- scikit-image
- easydict
For image classfication, go to ./image_classification
For image super-resolution, go to ./super_resolution
If you find our work useful in your research of publication, please cite our work:
@inproceedings{li2019learning,
title = {Learning Filter Basis for Convolutional Neural Network Compression},
author = {Li, Yawei and Gu, Shuhang and Van Gool, Luc and Timofte, Radu},
booktitle = {Proceedings of the IEEE International Conference on Computer Vision},
year = {2019}
}
This repository is built on EDSR (PyTorch). We thank the authors for making their EDSR codes public.