Skip to content

Latest commit

 

History

History
37 lines (24 loc) · 4.23 KB

README.md

File metadata and controls

37 lines (24 loc) · 4.23 KB

Zero-shot Knowledge Transfer via Adversarial Belief Matching (ZSKT)

Zero-shot Knowledge Transfer via Adversarial Belief Matching

Abstract

Performing knowledge transfer from a large teacher network to a smaller student is a popular task in modern deep learning applications. However, due to growing dataset sizes and stricter privacy regulations, it is increasingly common not to have access to the data that was used to train the teacher. We propose a novel method which trains a student to match the predictions of its teacher without using any data or metadata. We achieve this by training an adversarial generator to search for images on which the student poorly matches the teacher, and then using them to train the student. Our resulting student closely approximates its teacher for simple datasets like SVHN, and on CIFAR10 we improve on the state-of-the-art for few-shot distillation (with 100 images per class), despite using no data. Finally, we also propose a metric to quantify the degree of belief matching between teacher and student in the vicinity of decision boundaries, and observe a significantly higher match between our zero-shot student and the teacher, than between a student distilled with real data and the teacher. Code available at: https://github.com/polo5/ZeroShotKnowledgeTransfer

The teacher and student decision boundaries

distribution

Pseudo images sampled from the generator

synthesis

Results and models

Classification

Location Dataset Teacher Student Acc Acc(T) Acc(S) Config Download
backbone & logits Cifar10 resnet34 resnet18 93.05 95.34 94.82 config teacher |model | log

Citation

@article{micaelli2019zero,
  title={Zero-shot knowledge transfer via adversarial belief matching},
  author={Micaelli, Paul and Storkey, Amos J},
  journal={Advances in Neural Information Processing Systems},
  volume={32},
  year={2019}
}