GitHub - manuragkhullar/XAI_thesis

📚 XAI Thesis Code Repository This repository contains the official code and experiments for the Master's Thesis:

Title: Improving Explainability of Transformer Models

🛠️ Overview This thesis proposes two novel methods to improve the faithfulness and interpretability of Transformer attention mechanisms:

Selective Attention Rollout

Selectively composes attention matrices across layers based on their dissimilarity, reducing noise amplification during Attention Rollout.

Improves Spearman correlation with input gradients by 6.5× for BERT-base and 3.1× for BERT-large.

Network Diffusion for Attention Smoothing

Applies a diffusion process to the attention matrices, based on singular value transformation.

Enhances the clarity of attention maps by propagating meaningful signals and suppressing noise.

The experiments include evaluation on SST-2 (GLUE benchmark) for text classification and ImageNet samples for Vision Mamba models.

📂 Repository Contents 4_01_experiments.py: Main script for running Selective Attention Rollout experiments on BERT-base and BERT-large using SST-2 data.

Network Diffusion_Attention Visualisation.ipynb: Jupyter Notebook demonstrating Network Diffusion for smoothing attention maps (Vision Mamba examples).

🧪 Key Experiments Selective Attention Rollout:

Calculate Spearman rank correlation between attention scores and input gradient saliency.

Compare standard Attention Rollout vs. selective layer composition based on Frobenius norm distance.

Network Diffusion:

Apply singular value-based diffusion to attention matrices.

Visualize before-and-after attention heatmaps for vision models.

🚀 Getting Started Clone the repository:

bash Copy Edit git clone https://github.com/manuragkhullar/XAI_thesis.git cd XAI_thesis Install required Python packages:

nginx Copy Edit pip install -r requirements.txt Run experiments:

For BERT Selective Rollout: run 4_01_experiments.py

For Vision Diffusion: open Network Diffusion_Attention Visualisation.ipynb

Acknowledgments

This repository builds upon and adapts code from the following open-source projects:

Attention Rollout by Samira Abnar and Willem Zuidema (ACL 2020): "Quantifying Attention Flow in Transformers."
The Hidden Attention of Mamba by Ameen Ali, Itamar Zimerman, and Lior Wolf (Tel Aviv University, 2024): Official PyTorch implementation.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
4_01_experiments.py		4_01_experiments.py
Network Diffusion_Attention Visualisation.ipynb		Network Diffusion_Attention Visualisation.ipynb
README.md		README.md
attention_graph_util.py		attention_graph_util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages