`vello_hybrid` implementation #831

grebmeg · 2025-03-05T05:04:08Z

Fully based on #818, my changes simply relocate the code from the root folder to sparse_strips/vello_hybrid. The vello_hybrid code is functional and serves as a good foundation for the next refactoring step, aiming to unify vello_cpu and vello_hybrid common components under vello_common.

This brings in the cpu-sparse prototype from the piet-next branch of the piet repo. No substantive changes, but cpu-sparse is renamed vello_hybrid and piet-next is renamed vello_api. Quite a bit of editing to satisfy the lint monster. There was a half-written SIMD implementation of flattening, that's removed. It should be finished and re-added, as it's a good speedup.

Renders a simple scene to the GPU, first by doing coarse rasterization the same as cpu-sparse, then doing a single draw call.

Adds a clip method to the (CPU) render context, plus a considerable amount of mechanism in coarse and fine rasterization to support clipping. The coarse rasterization logic contains a similar set of optimizations as Vello. In particular, all-zero tiles have drawing suppressed, and all-one tiles pass drawing commands through with no additional work to clip. Not extensively validated, but it does render a simple scene with clipping correctly.

DJMcNab · 2025-03-05T09:08:52Z

sparse_strips/vello_hybrid/src/simd.rs

I'd probably tear out the SIMD code entirely from this version, to make the future refactorings clearer

DJMcNab · 2025-03-05T09:10:53Z

sparse_strips/vello_hybrid/src/strip.rs

+                            let c = b.max(0.0);
+                            let d = xmin.max(0.0);
+                            let a = (b + 0.5 * (d * d - c * c) - xmin) / (xmax - xmin);
+                            areas[x as usize][y] += a * dy;


I get a panic on this line:
index out of bounds: the len is 4 but the index is 4, just running:

> cargo run --example gpu

raphlinus and others added 9 commits March 5, 2025 13:57

Fix lints in non-aarch64 cfg's

a98f1fd

Start wiring up GPU render pipeline

b792aab

Renders a simple scene to the GPU, first by doing coarse rasterization the same as cpu-sparse, then doing a single draw call.

Add missing file, fix lints

ae79c63

Remove vello_hybrid lib.rs file

0713d06

Move vello_api and vello_hybrid to sparse_strips

75b769e

Move vello_hybrid-specific code from vello_api to vello_hybrid

c565af7

Refactor vello_hybrid to use internal API module

b3e9a6c

DJMcNab reviewed Mar 5, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`vello_hybrid` implementation #831

`vello_hybrid` implementation #831

grebmeg commented Mar 5, 2025

DJMcNab Mar 5, 2025

DJMcNab Mar 5, 2025

vello_hybrid implementation #831

Are you sure you want to change the base?

vello_hybrid implementation #831

Conversation

grebmeg commented Mar 5, 2025

DJMcNab Mar 5, 2025

Choose a reason for hiding this comment

DJMcNab Mar 5, 2025

Choose a reason for hiding this comment

`vello_hybrid` implementation #831

`vello_hybrid` implementation #831