StolenEncoder: Stealing Pre-trained Encoders in Self-supervised Learning

This repository contains the code of StolenEncoder, a model stealing attack that extracts the functionality of deployed self-supervised learning encoders with only black-box access.

Pre-trained encoders are general-purpose feature extractors that can be used for many downstream tasks. Recent progress in self-supervised learning can pre-train highly effective encoders using a large volume of unlabeled data, leading to the emerging encoder as a service (EaaS). A pre-trained encoder may be deemed confidential because its training often requires lots of data and computation resources as well as its public release may facilitate misuse of AI, e.g., for deepfakes generation. In this work, we propose the first attack called StolenEncoder to steal pre-trained image encoders.

Here is a figure to illustrate the difference between our attack and traditional model stealing attacks to supervised learning models:

Citation

If you use this code, please cite the following paper:

@inproceedings{liu2022stolenencoder,
  title={StolenEncoder: Stealing Pre-trained Encoders in Self-supervised Learning},
  author={Liu, Yupei and Jia, Jinyuan and Liu, Hongbin and Gong, Neil Zhenqiang},
  booktitle={ACM Conference on Computer and Communications Security (CCS)},
  year={2022}
}

Required python packages

Our code is tested under the following environment: Ubuntu 18.04.5 LTS, Python 3.8.5, torch 1.7.0, torchvision 0.8.1, numpy 1.18.5, pandas 1.1.5, pillow 7.2.0, and tqdm 4.47.0.

Extracted image encoders

Please refer to the link (google drive) for extracted models from encoders pre-trained on CIFAR10, STL10, and Food101.

Feedback

If you have any questions, please feel free to open an issue and ask me there. If you need the data we used in our experiments (including the intermediate feature vectors) or the extracted CLIP or ImageNet models, feel free to send me requests as well.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
CLIP		CLIP
code		code
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
illustration.png		illustration.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

StolenEncoder: Stealing Pre-trained Encoders in Self-supervised Learning

Citation

Required python packages

Extracted image encoders

Feedback

About

Releases

Packages

Languages

License

liu00222/StolenEncoder

Folders and files

Latest commit

History

Repository files navigation

StolenEncoder: Stealing Pre-trained Encoders in Self-supervised Learning

Citation

Required python packages

Extracted image encoders

Feedback

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages