Failures to Find Transferable Image Jailbreaks Between Vision-Language Models Rylan Schaeffer, Dan Valentine, Luke Bailey, James Chua, Cristobal Eyzaguirre, Zane Durante, Joe Benton, Brando Miranda, Henry Sleight, John Hughes arXiv preprint Under Review July 2024 Vision-Language Models AI Safety Jailbreaking Adversarial Attacks Transfer Learning arXiv Summary Image-based jailbreaks don't transfer well between vision-language models.