MixMatch

PyTorch implementation of the MixMatch algorithm for image classication.

Semi-Supervised Learning leverages unlabeled images to learn representations and allows us to train with a very reduced labeled dataset. Results are surprisingly close to fully supervised learning.

Papers

MixMatch: A Holistic Approach to Semi-Supervised Learning

Environment creation

$ conda env create --file environment.yml

Change the desired parameters in config.yml and run:

$ python3 main.py

Datasets

CIFAR-10: The dataset gets downloaded automatically in data/cifar-10-batches-py, if it has not been downloaded before.

Results

Accuracy (%)	250 Labels	1000 labels	4000 labels	Fully supervised
This code	86.52	90.28	93.33	94.39
MixMatch paper	88.92 ± 0.87	92.25 ± 0.32	93.76 ± 0.06	95.87

Beyond MixMatch

In this project, we attempt to improve MixMatch using pseudo labels. The basic idea is to use the most confident predictions by the model trained with MixMatch as one-hot labels. This allows unlabeled images to be inlcuded in the labeled dataset. Training then resumes with the extended labeled dataset.

Results

We don't see a significant improvement with the use of pseudo labels. An interesing results was to find out that the model makes wrong guesses even in very confident predictions (>99% confidence), possibly as a side effect of MixMatch's entropy minimization. Results are for only 200,000 update steps.

Accuracy (%)	4000 labels
Plain MixMatch	92.75
Pseudo-labels (threshold = 0.95)	92.25
Pseudo-labels (threshold = 0.99)	92.74
Pseudo-labels (top 10% of each class)	92.63

References

@article{berthelot2019mixmatch,
  title={MixMatch: A Holistic Approach to Semi-Supervised Learning},
  author={Berthelot, David and Carlini, Nicholas and Goodfellow, Ian and Papernot, Nicolas and Oliver, Avital and Raffel, Colin},
  journal={arXiv preprint arXiv:1905.02249},
  year={2019}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

MixMatch

Papers

Environment creation

Datasets

Results

Beyond MixMatch

Results

References

Files

README.md

Latest commit

History

README.md

File metadata and controls

MixMatch

Papers

Environment creation

Datasets

Results

Beyond MixMatch

Results

References