Proof-of-concept: Hand-crafted optimizations to pave the way forward for code-gen #2954

jleibs · 2023-08-09T21:18:03Z

What

Demonstrating that with the right generated deserializer optimizations the new code-gen can out-perform the legacy queries.

There's a couple of performance improvements all rolled in here to really push the envelope:

Avoid dealing with Option on non-nullible components
Iterate over direct slices from arrow buffers where possible
Optimization for matched-length joining iterator (this implementation is incorrect but close enough for this profiling)
Fix silly allocations of annotationInfo in the noop case.

Using the photogrammetry example as a stress-test:

Previous Baseline (0.8)

Before (main):

After:

Checklist

I have read and agree to Contributor Guide and the Code of Conduct
I've included a screenshot or gif (if applicable)
I have tested [demo.rerun.io](https://demo.rerun.io/pr/{{ pr.number }}) (if applicable)

[PR Build Summary](https://build.rerun.io/pr/{{ pr.number }})
[Docs preview](https://rerun.io/preview/{{ "pr:%s"|format(pr.branch)|encode_uri_component }}/docs)
[Examples preview](https://rerun.io/preview/{{ "pr:%s"|format(pr.branch)|encode_uri_component }}/examples)

jleibs · 2023-08-09T21:29:17Z

crates/re_types/src/components/point3d.rs

+                    .unwrap()
+                    .values()
+                    .as_slice();
+                let data2: &[[f32; 3]] = bytemuck::cast_slice(data);


This is one of the main things we want to generate for fixed-sized-arrays of primitives.

…#2970) ### What This implements 2 optimizations: - The first is ArrowBuffer optimization returns an inner Buffer directly when we know that the type itself it just an array of primitives. This is useful for zero-copy returns for dense data such as Tensors. - The second is the optimizations from: #2954 . For this, we identify cases where we know the inner arrays are not nullable and instead of using validity-iterators map directly to slices. Significant speedups for batch queries: ![image](https://github.com/rerun-io/rerun/assets/3312232/7ea1f3a2-a45a-4813-b82c-eaee55914c32) TODO: - [x] We should be able to check that the contents don't actually contain a validity map with non-nulls and return a deserialization error in that case. - [x] Add handling for other ArrowBuffer types. ### Checklist * [x] I have read and agree to [Contributor Guide](https://github.com/rerun-io/rerun/blob/main/CONTRIBUTING.md) and the [Code of Conduct](https://github.com/rerun-io/rerun/blob/main/CODE_OF_CONDUCT.md) * [x] I've included a screenshot or gif (if applicable) * [x] I have tested [demo.rerun.io](https://demo.rerun.io/pr/2970) (if applicable) - [PR Build Summary](https://build.rerun.io/pr/2970) - [Docs preview](https://rerun.io/preview/pr%3Ajleibs%2Fcodegen_optimizations/docs) - [Examples preview](https://rerun.io/preview/pr%3Ajleibs%2Fcodegen_optimizations/examples)

jleibs · 2023-09-20T11:24:18Z

These have all now been addressed properly. Closing.

emilk and others added 7 commits August 9, 2023 15:38

Add profiling scopes around functions reading data from store

b02be17

Optimize point clouds by only reading positions once from the store

4e6df7e

Make it possible to bypass Opt / unwrap

c849070

Performance-optimized point impls

5451038

More optimizations

ca16a67

Some more scopes

e89f74f

Avoid stupid allocation of repeated annotation_info

bb3e7fc

jleibs added 🚀 performance Optimization, memory use, etc do-not-merge Do not merge this PR labels Aug 9, 2023

jleibs commented Aug 9, 2023

View reviewed changes

jleibs mentioned this pull request Aug 12, 2023

Introduce codegen optimizations for primitives and fixed-sized-arrays #2970

Merged

5 tasks

jleibs closed this Sep 20, 2023

jleibs deleted the jleibs/more_optimization branch June 14, 2024 13:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proof-of-concept: Hand-crafted optimizations to pave the way forward for code-gen #2954

Proof-of-concept: Hand-crafted optimizations to pave the way forward for code-gen #2954

jleibs commented Aug 9, 2023 •

edited

Loading

jleibs Aug 9, 2023

jleibs commented Sep 20, 2023

Proof-of-concept: Hand-crafted optimizations to pave the way forward for code-gen #2954

Proof-of-concept: Hand-crafted optimizations to pave the way forward for code-gen #2954

Conversation

jleibs commented Aug 9, 2023 • edited Loading

What

Checklist

jleibs Aug 9, 2023

Choose a reason for hiding this comment

jleibs commented Sep 20, 2023

jleibs commented Aug 9, 2023 •

edited

Loading