Name		Name	Last commit message	Last commit date
parent directory ..
config		config
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
checkpoint.py		checkpoint.py
classify.py		classify.py
data.py		data.py
distill.py		distill.py
finetune.py		finetune.py
models.py		models.py
optim.py		optim.py
pretrain.py		pretrain.py
tokenization.py		tokenization.py
trainer.py		trainer.py
utils.py		utils.py

README.md

Step-by-Step

This document describes the step-by-step instructions for reproducing PyTorch BlendCNN distillation(with MRPC dataset) results with Intel® Neural Compressor.

Prerequisite

1. Environment

cd examples/pytorch/nlp/blendcnn/distillation/eager
pip install torch>=1.6.0 tqdm

2. Prepare model

Download BERT-Base, Uncased

mkdir models/ && mv uncased_L-12_H-768_A-12.zip models/
cd models/ && unzip uncased_L-12_H-768_A-12.zip

3. Prepare Datasets

Download GLUE MRPC Benchmark Datasets, After downloads dataset, you need to put dataset at ./MRPC/, list this:

ls MRPC/
dev_ids.tsv  dev.tsv  test.tsv  train.tsv

Distillation

1 Fine-tune the pretrained BERT-Base model on MRPC dataset

mkdir -p models/bert/mrpc
# fine-tune the pretrained BERT-Base model
python finetune.py config/finetune/mrpc/train.json

2 Distilling the BlendCNN with BERT-Base

Now BERT-Base model weights model_final.pt at ./models/bert/mrpc/.

mkdir -p models/blendcnn/
# distilling the BlendCNN
python distill.py --loss_weights 0.1 0.9

Follow the above steps, you will find distilled BlendCNN model weights best_model_weights.pt in ./models/blendcnn/.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

eager

eager

README.md

Step-by-Step

Prerequisite

1. Environment

2. Prepare model

3. Prepare Datasets

Distillation

1 Fine-tune the pretrained BERT-Base model on MRPC dataset

2 Distilling the BlendCNN with BERT-Base

Files

eager

Directory actions

More options

Directory actions

More options

Latest commit

History

eager

Folders and files

parent directory

README.md

Step-by-Step

Prerequisite

1. Environment

2. Prepare model

3. Prepare Datasets

Distillation

1 Fine-tune the pretrained BERT-Base model on MRPC dataset

2 Distilling the BlendCNN with BERT-Base