tensorpack/examples/DisturbLabel at master · revilokeb/tensorpack

History

Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
disturb.py		disturb.py
mnist-disturb.py		mnist-disturb.py
mnist.png		mnist.png
svhn-disturb.py		svhn-disturb.py
svhn.png		svhn.png

README.md

DisturbLabel

I ran into the paper DisturbLabel: Regularizing CNN on the Loss Layer on CVPR16, which basically said that noisy data gives you better performance. As many, I didn't believe the method and the results.

This is a simple mnist training script with DisturbLabel. It uses the simple architecture in the paper, and hyperparameters in my original mnist example. The results surprised me, clean labels give the worst accuracy:

Experiements were repeated 15 times for p=0, 10 times for p=0.02 & 0.05, and 5 times for other values of p. All experiements run for 100 epochs, with lr decay, which are enough for them to converge.

I suppose the disturb method works as a random noise that could prevent SGD from getting stuck, if training data are too easy to fit or too few. The method didn't work for slightly harder problems such as SVHN:

The SVHN experiements used the model & hyperparemeters as my original svhn example. Experiements were all repeated 10 times to get the error bar.

And I don't believe it will work for ImageNet either.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DisturbLabel

DisturbLabel

README.md

DisturbLabel

Files

DisturbLabel

Directory actions

More options

Directory actions

More options

Latest commit

History

DisturbLabel

Folders and files

parent directory

README.md

DisturbLabel