Sample-Based Quantum Diagonalization

This repository is a beginner-friendly walkthrough of sample-based quantum diagonalization (SQD).

If you are new to the topic, the short version is this:

a Hamiltonian matrix $H$ contains the energy information of a quantum system,
diagonalizing $H$ gives us its eigenvalues and eigenstates,
for large systems, full diagonalization becomes too expensive,
SQD uses samples to guess which basis states matter most and then solves a smaller problem inside that reduced space.

The project is written for someone who wants more than a quick demo. The notebook explains the full workflow slowly, in plain language, and with enough mathematics to make every step precise.

You can think of SQD as a two-part idea: first use samples to discover where the important physics seems to live, then solve the smaller problem inside that discovered region.

The main teaching resource is the notebook:

notebooks/sample_based_quantum_diagonalization_workflow.ipynb

If you only open one file in this repository, make it that notebook.

The notebook builds the SQD idea from first principles:

Start from a small Hamiltonian.
Solve the full problem exactly so we have a reference answer.
Turn the target eigenstate into a sampling problem.
Reconstruct a small subspace from the observed bitstrings.
Project the Hamiltonian into that subspace.
Solve a much smaller eigenvalue problem.
Compare the SQD estimate against the exact answer.

The goal is not only to run code, but to understand why each step works mathematically, what information enters at each stage, and how the final reduced diagonalization connects back to the original quantum problem.

Quick start

Clone the repository and move into the project folder:

git clone https://github.com/TSS99/Sample_Based_Quantum_Diagonalization.git
cd Sample_Based_Quantum_Diagonalization

Create the virtual environment and install the required libraries:

./setup_env.sh

This script creates .venv, installs the packages from requirements.txt, and registers a Jupyter kernel for the project.

Activate the environment whenever you return to the project:

source .venv/bin/activate

If your shell prompt changes after running that command, that is a good sign that the virtual environment is active.

Launch JupyterLab:

./start_notebook.sh

Open the notebook file from the notebooks/ folder once JupyterLab starts.

What the notebook teaches

The notebook is written for a reader who may be new to both quantum computing and numerical diagonalization.

It explains:

What an eigenvalue problem is in the language of quantum mechanics.
Why full diagonalization becomes expensive as the Hilbert space grows.
How measurement samples tell us which computational basis states matter most.
How to build a reduced subspace from those samples.
Why the reduced problem can be solved with a projected Hamiltonian.
When SQD succeeds and when it fails.

The notebook is therefore both a coding walkthrough and a mathematical tutorial.

Tiny glossary before you begin

If some of the words below feel unfamiliar, keep this section nearby while reading the notebook.

Hamiltonian: the matrix that represents the energy structure of the system.
Eigenvalue: a special number $E$ such that $H \lvert \psi \rangle = E \lvert \psi \rangle$.
Eigenstate: the state $\lvert \psi \rangle$ that belongs to that eigenvalue.
Ground state: the eigenstate with the smallest eigenvalue.
Basis state: one of the simple reference states used to describe all other states. In this notebook the basis states are bitstrings such as $000$ and $101$.
Amplitude: the coefficient in front of a basis state in a quantum superposition.
Born rule: the rule that turns an amplitude into a measurement probability through $|a_x|^2$.
Hilbert space: the vector space that contains all allowed states of the quantum system.
Subspace: a smaller vector space inside the full Hilbert space.
Projection: the act of restricting a large matrix problem to a smaller chosen subspace.
Shot: one measurement sample drawn from the probability distribution of the state.
Overlap matrix: a matrix that records how basis vectors relate to one another through inner products.

Workflow at a glance

The notebook follows the same sequence every time:

Define a Hamiltonian matrix $H$.
Solve $H \lvert \psi \rangle = E \lvert \psi \rangle$ exactly for a reference answer.
Convert the ground-state amplitudes into sampling probabilities with the Born rule.
Draw bitstring samples that imitate repeated measurements.
Keep the most important sampled configurations.
Form the projected Hamiltonian $H_{\mathrm{sub}} = B^T H B$.
Form the overlap matrix $S = B^T B$.
Solve the reduced problem $H_{\mathrm{sub}} c = E S c$.
Rebuild the approximate state in the full Hilbert space.
Compare the approximate energy with the exact one.

The key mental picture is that the full problem stays the same, but we try to solve it inside a smaller space suggested by the data. That is the main compression idea behind the workflow.

How to read the notebook

The notebook is designed to be read from top to bottom without skipping.

The best way to use it is:

Read each markdown cell slowly before running the next code cell.
Pause after every table or plot and ask what changed mathematically.
Compare the exact answer and the SQD answer instead of treating them as unrelated outputs.
Rerun the sampling sections with a different random seed or shot count to build intuition.
Keep the glossary nearby if a term appears that you have not used before.

If you are completely new, do not worry about memorizing everything on the first pass. The important thing is to follow the chain of logic from:

exact state $\rightarrow$ sampled bitstrings $\rightarrow$ reduced basis $\rightarrow$ projected Hamiltonian $\rightarrow$ approximate energy.

Reading with that chain in mind makes the notebook much easier to follow.

What you should expect to see while running it

If the notebook is working correctly, you should see all of the following along the way:

A printed Hamiltonian table indexed by three-bit basis states.
A table of exact eigenvalues and a clearly identified ground-state energy.
A support table showing which bitstrings carry the most probability.
A measurement-count table built from finite sampling.
A reduced-basis selection based on the most frequent sampled bitstrings.
A projected Hamiltonian and overlap matrix.
A comparison between the exact answer and the SQD approximation.
A convergence plot showing how error changes with shots and subspace size.

That sequence is helpful because it lets you match every mathematical idea to a concrete output. You should also expect the exact probabilities and sampled probabilities to be close, but not perfectly identical, because the samples come from finitely many shots.

That small mismatch is part of the lesson, not a bug.

Repository layout

.
├── README.md
├── requirements.txt
├── setup_env.sh
├── start_notebook.sh
├── notebooks/
│   └── sample_based_quantum_diagonalization_workflow.ipynb
└── scripts/
    ├── check_github_math.py
    └── generate_sqd_notebook.py

The scripts/ folder is there to keep the generated notebook and the notebook source in sync.

A note about the example

The notebook uses a small three-qubit Hamiltonian on purpose.

That is not because SQD only matters for small problems. It is because small problems are the best place to learn the workflow carefully:

you can see the full Hamiltonian,
you can inspect every basis state,
you can verify the exact answer,
you can see how sampling changes the result.

Once the workflow feels natural on a toy model, it becomes much easier to understand how the same structure scales to chemistry, materials, or larger fermionic systems.

In other words, this repository teaches the method on a small system so the logic is easy to see before the same ideas are applied to harder problems.

GitHub rendering note

The markdown in this repository uses GitHub-friendly math delimiters such as $...$ and $$...$$ so the equations render more reliably in both the README and the notebook preview on GitHub.

That is why the math expressions are written in simple GitHub-compatible markdown form rather than notebook-specific formatting. This keeps the repository easier to read directly on GitHub.

Regenerating the notebook

If you edit the generator script, recreate the notebook with:

python3 scripts/generate_sqd_notebook.py

That command rebuilds the .ipynb file from the generator source, so it is the right step after any notebook-content edit made in scripts/generate_sqd_notebook.py. It also helps avoid hand-editing raw notebook JSON.

You can also scan the repo for GitHub-unfriendly math delimiters with:

python3 scripts/check_github_math.py

That quick check is useful if you want the README and notebook preview to keep rendering cleanly on GitHub after small documentation edits. It is especially helpful after changing equations or markdown-heavy explanations.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sample-Based Quantum Diagonalization

Quick start

What the notebook teaches

Tiny glossary before you begin

Workflow at a glance

How to read the notebook

What you should expect to see while running it

Repository layout

A note about the example

GitHub rendering note

Regenerating the notebook

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 141 Commits
notebooks		notebooks
scripts		scripts
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
setup_env.sh		setup_env.sh
start_notebook.sh		start_notebook.sh

Folders and files

Latest commit

History

Repository files navigation

Sample-Based Quantum Diagonalization

Quick start

What the notebook teaches

Tiny glossary before you begin

Workflow at a glance

How to read the notebook

What you should expect to see while running it

Repository layout

A note about the example

GitHub rendering note

Regenerating the notebook

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages