Reference: "Benchmarking Zero-Shot Robustness of Multimodal Foundation Models: A Pilot Study".
Dependencies: python 3.7, requirements.txt
conda create --name roz python=3.7
conda activate roz
pip install -r requirements.txt
We run our experiments on 4 GeForce GTX 1080 GPUs. Inernet connection is required for downloading ImageNet and CIFAR-10 datasets.
https://image-net.org/download.php
python download_cifar.py
After downloading ImageNet data, you may need to modify the dataset path in the code in scripts/common_adversarial_attack/load_utils.py. We use ImageNetPATH to indicate the dataset location.
bash eval_natural_distribution_shifts.sh
bash eval_synthetic_distribution_shifts.sh
bash eval_common_adversarial_attacks.sh
Both CIFAR-10-T and ImageNet-T datasets are under the "dataset" folder.
bash eval_typographic_attacks.sh
bash create_benchmark.sh
Please refer to the paper and appendix for more results.
@article{wang2024roz,
title={Benchmarking Zero-Shot Robustness of Multimodal Foundation Models: A Pilot Study},
author={Wang, Chenguang and Jia, Ruoxi and Liu, Xin and Song, Dawn},
journal={arXiv preprint},
year={2024}
}