PoC: Fast-path for skipping coarse rasterization and scheduling by laurenz-canva · Pull Request #1454 · linebender/vello

laurenz-canva · 2026-02-18T10:40:32Z

Note that this was AI-generated, I haven't reviewed this fully in-depth yet and it's possible we can be smarter about the storing of strips, so no nitpicky review please. 😄 But this should demonstrate what I was imagining. And all tests seem to be passing.

As Alex rightly highlighted, this does have the disadvantage of not allowing the "if there is an opaque fill, clear all previous fill" optimization. However, it seems to me like this should be overshadowed by the improvements that come from not doing scheduling and coarse rasterization. Here are the timings for rendering 1000 frames of the GhotScript tiger:

Before (note in particular Wide::generate and Scheduler::do_scene:

After:

laurenz-canva · 2026-02-18T10:40:55Z

sparse_strips/vello_hybrid/src/schedule.rs

+/// This replicates the strip→GpuStrip conversion that normally happens across
+/// `Wide::generate` + `Scheduler::do_tile`, but for the simple case where all draws
+/// happen at depth=1 directly to the surface with no layers or blending.
+pub(crate) fn build_gpu_strips_direct(


We can probably reuse existing code here, as mentioned I haven't cleaned this up yet, just a PoC.

taj-p

There are parallels with this work and the blit pipeline.

If we go this route, I think we needn't flush the fast path in push layer. We could instead flush the paths that intersect the bounds produced in pop layer. And then, we mightn't even need to depending on the layer type (opacity layer with SRC over blending, for example).

This could also be batched. The fast path can be re-enabled after we pop layer.

In my work on the blit pipeline, there is a batching mechanism you could reuse if we think this is the right strategy to take

LaurenzV · 2026-02-18T19:26:45Z

Sounds good, looking forward to the PR! Yes you could probably optimize this even further, but this is the bare minimum that should already be a good improvement in many cases. 😄

taj-p · 2026-02-18T23:14:17Z

One thing I didn't state but that I hope is implied: AMAZING to have a POC so quickly to validate the approach. Very cool!!! 🎉

PoC: Fast-path for skipping coarse rasterization and scheduling

8c13476

laurenz-canva commented Feb 18, 2026

View reviewed changes

laurenz-canva marked this pull request as draft February 18, 2026 10:41

laurenz-canva requested review from grebmeg and taj-p February 18, 2026 10:41

taj-p reviewed Feb 18, 2026

View reviewed changes

LaurenzV mentioned this pull request Feb 20, 2026

sparse strips: Lazily push wide tile layer buffers #1414

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PoC: Fast-path for skipping coarse rasterization and scheduling#1454

PoC: Fast-path for skipping coarse rasterization and scheduling#1454
laurenz-canva wants to merge 1 commit intomainfrom
laurenz/poc_fast_path

laurenz-canva commented Feb 18, 2026

Uh oh!

laurenz-canva Feb 18, 2026

Uh oh!

taj-p left a comment

Uh oh!

LaurenzV commented Feb 18, 2026

Uh oh!

taj-p commented Feb 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

laurenz-canva commented Feb 18, 2026

Uh oh!

laurenz-canva Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

taj-p left a comment

Choose a reason for hiding this comment

Uh oh!

LaurenzV commented Feb 18, 2026

Uh oh!

taj-p commented Feb 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments