SOL-ExecBench Solutions

This repository collects my solutions and writeups for the NVIDIA SOL-ExecBench benchmark.

Goals

Build a structured set of SOL-ExecBench solutions.
Provide reproducible implementations with clear code comments.
Document transferable GPU kernel optimization patterns.

Writeup Structure

Each problem writeup will typically include:

Problem understanding and constraints
Baseline implementation
Optimized versions (e.g., memory access, parallel strategy, fusion)
Performance comparison and key takeaways

Status

This repository is a work in progress and will be updated continuously.

Problems

001_attn_bwd: Backward pass for attention softmax, dropout, and value matmul.
002_vae_conv2d: Fused VAE residual block with Conv3x3, GroupNorm, SiLU, and residual addition.

Claude Skills

The .claude/skills/ directory contains model-invoked skills for this project:

Skill	Triggers when…
`new-kernel`	Creating a new kernel implementation from a torch reference
`b200-tuning`	Optimizing for B200/Blackwell performance (tiles, TMA, WGMMA, pipeline)
`kernel-testing`	Running `test.py`, diagnosing failures, or using Triton IR debug flags

Reference

Benchmark: https://research.nvidia.com/benchmarks/sol-execbench
Original repository: https://github.com/nvidia/sol-execbench

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.claude/skills		.claude/skills
001_attn_bwd		001_attn_bwd
002_vae_conv2d		002_vae_conv2d
030_atten_res		030_atten_res
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SOL-ExecBench Solutions

Goals

Writeup Structure

Status

Problems

Claude Skills

Reference

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SOL-ExecBench Solutions

Goals

Writeup Structure

Status

Problems

Claude Skills

Reference

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages