add causal_upper_left mask option to scaled_dot_product_attention by mm65x · Pull Request #3254 · ml-explore/mlx

mm65x · 2026-03-14T23:06:18Z

Proposed changes

adds "causal_upper_left" and "causal_lower_right" as explicit mask options
to mx.fast.scaled_dot_product_attention. "causal" stays as an alias for
"causal_lower_right", so nothing breaks.

when S_Q != S_KV, lower-right aligns the last query with the last key,
while upper-left aligns query i with keys 0..i (matching PyTorch's
is_causal=True). when S_Q == S_KV they're identical.

on Metal, the full-attention kernels already parameterize the causal diagonal
via qL_off in AttnParams, so the change is just passing 0 instead of
kL - qL for upper-left. the vector kernels previously hardcoded the offset
inline, so a causal_offset buffer argument was added instead. on CUDA, the
cuDNN path uses set_causal_mask vs set_causal_mask_bottom_right, and the
vector kernels use a causal_offset field in AttnParams (same approach as
Metal).

Checklist

I have read the CONTRIBUTING document
I have run pre-commit run --all-files to format my code / installed pre-commit prior to committing changes
I have added tests that prove my fix is effective or that my feature works
I have updated the necessary documentation (if needed)

zcbenz · 2026-03-25T04:34:29Z

Can you fix the lint error?

mm65x force-pushed the sdpa-causal-upper-left branch from 7693beb to 6b2d587 Compare March 14, 2026 23:07

add causal_upper_left mask option to scaled_dot_product_attention

a435cd1

mm65x force-pushed the sdpa-causal-upper-left branch from 670373d to a435cd1 Compare March 20, 2026 10:33

mm65x marked this pull request as ready for review March 20, 2026 10:33

fix formatting in causal mask update

9e85fd0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add causal_upper_left mask option to scaled_dot_product_attention#3254

add causal_upper_left mask option to scaled_dot_product_attention#3254
mm65x wants to merge 2 commits intoml-explore:mainfrom
mm65x:sdpa-causal-upper-left

mm65x commented Mar 14, 2026 •

edited

Loading

Uh oh!

zcbenz commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

mm65x commented Mar 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Proposed changes

Checklist

Uh oh!

zcbenz commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mm65x commented Mar 14, 2026 •

edited

Loading