CBP: Constrained backpropagation based on a pseudo-Lagrange multiplier method

Overview

This package provides the training code of CBP: Constrained backpropagation based on a pseudo-Lagrange multiplier method. The defining characteristic of the CBP algorithm is the utilization of a Lagrangian function (loss function plus constraint function) as its objective function. We considered various types of constraints — binary, ternary, one-bit shift, and two-bit shift weight constraints. For all cases, the proposed algorithm outperforms the state-of-the-art methods on ImageNet, e.g., 66.6% and 74.4% top-1 accuracy for ResNet-18 and ResNet-50 with binary weights, respectively. This highlights CBP as a learning algorithm to address diverse constraints with the minimal performance loss by employing appropriate constraint functions.

Run environment

To install requirements:

pip install -r requirements.txt

NVIDIA-DALI : We have used NVIDIA-DALI to implement fast dataloaders. You can install it as follows.

pip install --extra-index-url https://developer.download.nvidia.com/compute/redist --upgrade nvidia-dali-cuda110

Train

Download the ImageNet dataset and decompress into the structure like

dir/
  data/
      imagenet/
        train/
          n01440764/
            n01440764_10026.JPEG
        ...
        val/
          n01440764/
            ILSVRC2012_val_00000001.JPEG
        ...

You can train the model using the following command:

python main.py [--model] [--quant] [--batch_size] [--gpu_devices] ...

optional arguments: --gpu_devices gpu devices to use (e.g. --gpu_devices 0 1 for using two gpus)

--model model to be quantized (default: resnet18, option: alexnet, resnet18, resnet50) --quant quantization constraint (default: bin, option: bin, ter, 1bit, 2bit) --batch-size batch size of distilled data (default: 256) --lr initial learning rate of network parameters (default: 0.001) --lr_lambda update rate of lagrangian multiplier (default: 0.0001) --weight_decay L2-regularization (default: 0.0001)

Note that to train AlexNet, you should download alexnet_pretrained.pth from the link and put it into model folder. Note that we used initial learning rate of 1e-4 for 1-bit shift and 2-bit shift quantization of AlexNet and ResNet-50.

You should obtain the following results:

Model	Binary	Ternary	1-bit shift	2-bit shift
AlexNet	58.0%	58.8%	60.8%	60.9%
ResNet-18	66.6%	68.9%	69.4%	69.4%
ResNet-50	74.4%	75.0%	76.0%	76.0%

Evaluation

Before evaluating pre-quantized models, you should download the pre-quantized models and put them into model folder.

You can evaluate the pre-quantized model using the following command:

python evaluate.py [--model] [--quant]

optional arguments: --model quantized model (default: resnet18, option: alexnet, resnet18, resnet50) --quant quantization constraint (default: bin, option: bin, ter, 1bit, 2bit)

Disclaimer: Preliminary work. Under review by the 35th Conference on Neural Information Processing Systems (NeurIPS 2021). Do not distribute.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
data		data
model		model
LICENSE		LICENSE
README.md		README.md
alexnet_mode.py		alexnet_mode.py
dali.py		dali.py
evaluate.py		evaluate.py
main.py		main.py
module.py		module.py
requirements.txt		requirements.txt
resnet_mode.py		resnet_mode.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CBP: Constrained backpropagation based on a pseudo-Lagrange multiplier method

Overview

Run environment

Train

Evaluation

About

Releases

Packages

Contributors 2

Languages

License

dooseokjeong/CBP

Folders and files

Latest commit

History

Repository files navigation

CBP: Constrained backpropagation based on a pseudo-Lagrange multiplier method

Overview

Run environment

Train

Evaluation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages