Pixels Versus Priors: Controlling Knowledge Priors in Vision-Language Models through Visual Counterfacts

This repository contains code for analyzing multimodal large language models (MLLMs) using counterfactual images. It includes tools for evaluating model accuracy, probing internal activations, computing attention shifts, and applying steering vector interventions.

Quick Start

📖 Paper | 🤗 Hugging Face Dataset

Download the dataset from HuggingFace:

from datasets import load_dataset

dataset = load_dataset("mgolov/Visual-Counterfact")

df_color = dataset["color"].to_pandas()
df_size = dataset["size"].to_pandas()

Analysis

`test_MLLMs.py`

Runs inference on MLLMs using the Visual CounterFact dataset. Outputs model responses to different prompt-image combinations.

`early_decoding.py`

Implements early decoding to track model predictions across layers. Used to visualize when the model shifts from relying on world knowledge to visual input.

`get_hidden_states.py`

Extracts hidden activations from selected layers in the model. This must be run before computing steering vectors.

`task_vectors_inference_CF_WK.py`

Applies steering vectors to shift predictions from counterfactual (CF) answers to world knowledge (WK). Requires hidden states from get_hidden_states.py.

`task_vectors_inference_WK_CF.py`

Applies steering vectors to shift predictions from world knowledge (WK) to counterfactual (CF) answers. Requires hidden states from get_hidden_states.py.

`attention_mass.py`

Analyzes changes in attention mass between image and text tokens, comparing prompt-based versus intervention-based steering.

Creating Visual CounterFact

Usage Notes

Run get_hidden_states.py before either of the task_vectors_inference_*.py scripts.
All scripts assume access to the Visual CounterFact dataset and compatible MLLMs.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
image_preprocessing		image_preprocessing
janus		janus
making_visual_counterfact		making_visual_counterfact
README.md		README.md
attention_mass.py		attention_mass.py
early_decoding.py		early_decoding.py
get_hidden_states.py		get_hidden_states.py
plotting.ipynb		plotting.ipynb
task_vectors_inference_CF_WK.py		task_vectors_inference_CF_WK.py
task_vectors_inference_WK_CF.py		task_vectors_inference_WK_CF.py
test_MLLMs.py		test_MLLMs.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Pixels Versus Priors: Controlling Knowledge Priors in Vision-Language Models through Visual Counterfacts

Quick Start

Analysis

`test_MLLMs.py`

`early_decoding.py`

`get_hidden_states.py`

`task_vectors_inference_CF_WK.py`

`task_vectors_inference_WK_CF.py`

`attention_mass.py`

Creating Visual CounterFact

Usage Notes

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

rsinghlab/pixels_vs_priors

Folders and files

Latest commit

History

Repository files navigation

Pixels Versus Priors: Controlling Knowledge Priors in Vision-Language Models through Visual Counterfacts

Quick Start

Analysis

test_MLLMs.py

early_decoding.py

get_hidden_states.py

task_vectors_inference_CF_WK.py

task_vectors_inference_WK_CF.py

attention_mass.py

Creating Visual CounterFact

Usage Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

`test_MLLMs.py`

`early_decoding.py`

`get_hidden_states.py`

`task_vectors_inference_CF_WK.py`

`task_vectors_inference_WK_CF.py`

`attention_mass.py`

Packages