Skip to content

[vello_common] Add CPU MSAA#1377

Draft
b0nes164 wants to merge 5 commits intomainfrom
thomas/cpu_msaa
Draft

[vello_common] Add CPU MSAA#1377
b0nes164 wants to merge 5 commits intomainfrom
thomas/cpu_msaa

Conversation

@b0nes164
Copy link
Member

Uploading this as a potential feature, and if we do want to land it, I can polish it up from here. Currently, the performance is pretty bad, about 3x Analytic anti-aliasing (~4.3ms vs ~1.5ms on tiger). However, the pixel intersection logic is currently written GPU-ish style, and I can imagine switching to more serial friendly DDA approach could yield some savings.

@b0nes164
Copy link
Member Author

4.3ms -> 3.6ms on tiger

@b0nes164
Copy link
Member Author

b0nes164 commented Jan 25, 2026

~3.6ms -> ~2.8ms on tiger.

Things that I tried but were either the same perf or worse:

  • morton encoding the LUT
  • u8x32 SIMD instead of the current u32x8
  • dda stepping the rows

Last thing that could be tried is switching from processing by row to processing by column. This would at the very least save the transpose necessary before writing to the alpha buffer.

@b0nes164 b0nes164 marked this pull request as draft January 30, 2026 05:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant

Comments