Initial commit of piet-next API sketch #589

raphlinus · 2024-12-03T15:55:47Z

Ideas for what an API might look like. We'll use the CPU sparse strip implementation to test those ideas.

Currently lots of TODO and very likely things will change, but this should give an idea.

Ideas for what an API might look like. We'll use the CPU sparse strip implementation to test those ideas. Currently lots of TODO and very likely things will change, but this should give an idea.

This copies the implementation from the cpu-sparse branch of compute-shader-101, which is currently the most up-to-date version (though there are still rendering flaws). We remove the bytemuck dependency and change colors the color crate (following tip of tree peniko), but otherwise unchanged. Lots of warnings as well, as basically everything is unused. There will be changes. In particular, path_id will be removed, as there will be a separate sort per path.

This is the point where the pieces start coming together. At this point, we should be doing basic coarse rasterization for path fills. Not well validated yet. Printing debug output of wide tiles. Doing checkpoint before starting in on fine raster.

This brings it to the point where it can rasterize a single triangle.

Adapt pico-svg from Vello. It now renders the tiger.

Small changes to the example, and basic stats added, to help with performance measurement. Not by any means a careful performance evaluation framework, but ok for doing experiments by hand.

Neon implementation of the 4 basic fine raster operations, all using f32 as the scratch buffer.

Optimize core render_strips implementation using Neon intrinsics.

Simpler scalar code, not designed for thread parallelism. This renders tiger but is likely to have numerical robustness issues; the robustness logic from Vello has not been ported.

CI is very scoldy and doesn't like it!

Again, feed the CI beast

Keep feeding the CI beast!

This might not be the last of them.

Also put allow(unused) on use_cpu, as whether that's used will vary by platform.

The logic for choosing whether to use the scalar or simd version of the strip kernel was backwards. This makes a pretty small performance difference; it just isn't a large part of the total time. Also optimize clamping behavior to take advantage of saturation in conversion operations.

raphlinus and others added 17 commits December 3, 2024 07:55

Initial commit of piet-next API sketch

7d90b93

Ideas for what an API might look like. We'll use the CPU sparse strip implementation to test those ideas. Currently lots of TODO and very likely things will change, but this should give an idea.

Start implementing renderer

6f38fbf

This is the point where the pieces start coming together. At this point, we should be doing basic coarse rasterization for path fills. Not well validated yet. Printing debug output of wide tiles. Doing checkpoint before starting in on fine raster.

Start fine rasterization

0632913

This brings it to the point where it can rasterize a single triangle.

Render Ghostscript tiger

ed45bc9

Adapt pico-svg from Vello. It now renders the tiger.

Tweaks for profiling

f48e1db

Small changes to the example, and basic stats added, to help with performance measurement. Not by any means a careful performance evaluation framework, but ok for doing experiments by hand.

Neon optimizations for f32 fine

5beec69

Neon implementation of the 4 basic fine raster operations, all using f32 as the scratch buffer.

Neon optimization of render_strips

19427a5

Optimize core render_strips implementation using Neon intrinsics.

Rework tiling

d5b8da6

Simpler scalar code, not designed for thread parallelism. This renders tiger but is likely to have numerical robustness issues; the robustness logic from Vello has not been ported.

Fix typo

f44dba3

CI is very scoldy and doesn't like it!

Merge branch 'main' into piet-next

fb4db4e

Commit Cargo.lock changes

4e92d2e

Again, feed the CI beast

Fix clippy lints

1f797e3

Keep feeding the CI beast!

Fix more lints

0c2a142

This might not be the last of them.

Fix broken doc link

ea98b8b

Also put allow(unused) on use_cpu, as whether that's used will vary by platform.

piet-next: Use released Peniko 0.3.0 (#592)

3468740

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial commit of piet-next API sketch #589

Initial commit of piet-next API sketch #589

raphlinus commented Dec 3, 2024

Initial commit of piet-next API sketch #589

Are you sure you want to change the base?

Initial commit of piet-next API sketch #589

Conversation

raphlinus commented Dec 3, 2024