Multi-Paradigm Collaborative Adversarial Attack Against Multimodal Large Language Models

This repository is the official implementation of Multi-Paradigm Collaborative Adversarial Attack Against Multimodal Large Language Models. [Paper]

Overview of the proposed MPCAttack: (a) Pipeline for MPCAttack in adversarial examples generation. (b) Pipeline for attacking MLLMs.

Requirements

To install requirements:

conda create -n MPCAttack python=3.10
conda activate MPCAttack
pip install torch==2.6.0 torchvision==0.21.0 torchaudio==2.6.0 --index-url https://download.pytorch.org/whl/cu118
pip install -U transformers
pip install hydra-core pytorch-lightning opencv-python scipy nltk timm==1.0.1 pandas
pip install git+https://github.com/openai/CLIP.git

Install from requirements file

pip install -r requirements.txt

Quick Start

Prepare Data
Download the datasets from this link.

Generate Adversarial Examples

python generate_adversarial_examples_MPCAttack.py --output ./MPCAttack

Evaluation

The evaluation is seperated into two parts:
1. generate descriptions for clean and adversarial images on target blackbox model
2. evaluate the Attack Success Rate (ASR) and Similarity score
For the first part, run:
```
 python python blackbox_text_generation.py --output ./MPCAttack --model_name Qwen2.5-VL-7B-Instruct
```
Note1: In the first run of the first part, the source image, the target image, and the text description of the adversarial example are generated simultaneously. When the text description file of the source image and the target image already exists, it will be skipped to avoid duplicate generation.

Note2: All open-source MLLMs are evaluated using the VLMEvalKit toolkit. You can update the vlmeval folder to reference VLMEvalKit to use the latest open-source models.

Note3: When the target model is a closed-source model, the corresponding API needs to be configured. Create api_keys.yaml under the root following this template:
```
# API Keys for different models
# DO NOT commit this file to git!

gpt4v: "your_api_key"
claude: "your_api_key"
claude4_5: "your_api_key"
gemini: "your_api_key"
gpt4o: "your_api_key"
gpt5: "your_api_key"
gpt-4o-mini: "your_api_key"
```
For the second part, run:
```
python gpt_evaluate.py --output ./MPCAttack --model_name Qwen2.5-VL-7B-Instruct
```
Note: The evaluation model is gpt-4o-mini model, so we also need to configure the api key.

Results

Visualization

Visualization of adversarial images and perturbations.

Visualization of adversarial images in attacking commercial MLLMs.

Acknowledgments

We sincerely thank M-Attack and FoA-Attack for their outstanding work.

Citation

@article{li2026multi,
  title={Multi-Paradigm Collaborative Adversarial Attack Against Multi-Modal Large Language Models},
  author={Li, Yuanbo and Xu, Tianyang and Hu, Cong and Zhou, Tao and Wu, Xiao-Jun and Kittler, Josef},
  journal={arXiv preprint arXiv:2603.04846},
  year={2026}
}

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
images		images
kmeans_pytorch		kmeans_pytorch
surrogates		surrogates
vlmeval		vlmeval
LICENSE		LICENSE
README.md		README.md
blackbox_text_generation.py		blackbox_text_generation.py
blackbox_text_generation_flickr30k.py		blackbox_text_generation_flickr30k.py
config_schema.py		config_schema.py
generate_adversarial_examples_MPCAttack.py		generate_adversarial_examples_MPCAttack.py
generate_adversarial_examples_MPCAttack_flickr30k.py		generate_adversarial_examples_MPCAttack_flickr30k.py
gpt_evaluate.py		gpt_evaluate.py
gpt_evaluate_flickr.py		gpt_evaluate_flickr.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Paradigm Collaborative Adversarial Attack Against Multimodal Large Language Models

Requirements

Quick Start

Results

Visualization

Acknowledgments

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Multi-Paradigm Collaborative Adversarial Attack Against Multimodal Large Language Models

Requirements

Quick Start

Results

Visualization

Acknowledgments

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages