Skip to content

Latest commit

 

History

History
 
 

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 
 
 

Step-by-Step

This document is used to enable Tensorflow Keras inception_v3 model quantization and benchmark using Intel® Neural Compressor. This example can run on Intel CPUs and GPUs.

Prerequisite

1. Environment

Installation

# Install Intel® Neural Compressor
pip install neural-compressor

Install Requirements

The Tensorflow and intel-extension-for-tensorflow is mandatory to be installed to run this QAT example. The Intel Extension for Tensorflow for Intel CPUs is installed as default.

pip install -r requirements.txt

Note: Validated TensorFlow Version.

2. Prepare Pretrained model

The pretrained model is provided by Keras Applications. prepare the model, Run as follow:

python prepare_model.py --output_model=/path/to/model

--output_model the model should be saved as SavedModel format or H5 format.

3. Prepare Dataset

TensorFlow models repo provides scripts and instructions to download, process and convert the ImageNet dataset to the TF records format. We also prepared related scripts in imagenet_prepare directory. To download the raw images, the user must create an account with image-net.org. If you have downloaded the raw data and preprocessed the validation data by moving the images into the appropriate sub-directory based on the label (synset) of the image. we can use below command ro convert it to tf records format.

cd examples/tensorflow/image_recognition/keras_models/
# convert validation subset
bash prepare_dataset.sh --output_dir=/inception_v3/quantization/ptq/data --raw_dir=/PATH/TO/img_raw/val/ --subset=validation
# convert train subset
bash prepare_dataset.sh --output_dir=/inception_v3/quantization/ptq/data --raw_dir=/PATH/TO/img_raw/train/ --subset=train
cd inception_v3/quantization/ptq

Run Command

Quantization Config

The Quantization Config class has default parameters setting for running on Intel CPUs. If running this example on Intel GPUs, the 'backend' parameter should be set to 'itex' and the 'device' parameter should be set to 'gpu'.

config = PostTrainingQuantConfig(
    device="gpu",
    backend="itex",
    ...
    )

Quantization

bash run_quant.sh --input_model=./inception_v3_keras/ --output_model=./result --dataset_location=/path/to/evaluation/dataset

Benchmark

bash run_benchmark.sh --input_model=./result --dataset_location=/path/to/evaluation/dataset --mode=performance --batch_size=1
bash run_benchmark.sh --input_model=./result --dataset_location=/path/to/evaluation/dataset --mode=accuracy --batch_size=32