fix!: Prefix `HeapVector` with its `size` in `ForeignCallParams` by aakoshh · Pull Request #9288 · noir-lang/noir

aakoshh · 2025-07-23T12:14:34Z

Description

Problem*

#9287 tries to handle slices only in the printing process, which is a bit clunky and full of caveats. This is an alternative approach which prefixes slices when the Brillig VM reads them from memory.

Summary*

Changes brillig_vm::VM::get_memory_values to prefix HeapVector data by its size, which should be the capacity of slices. By doing it in get_memory_values, before the values are converted to ForeignCallParam::Array, we can easily limit this handling to just slices, but not HeapArray, where the length is known, and then nested types are not a problem during decoding.

This is a breaking change because any foreign call handler decoding slices would need to expect 1 extra field in the data.

Additional Context

Documentation*

Check one:

No documentation needed.
Documentation included in this PR.
[For Experimental Features] Documentation to be submitted in a separate PR.

PR Checklist*

I have tested the changes locally.
I have formatted the changes with Prettier and/or cargo fmt on default settings.

github-actions

⚠️ Performance Alert ⚠️

Possible performance regression was detected for benchmark 'Test Suite Duration'.
Benchmark result of this commit is worse than the previous benchmark result exceeding threshold 1.20.

Benchmark suite	Current: `2c4dfd3`	Previous: `ee66c98`	Ratio
`test_report_AztecProtocol_aztec-packages_noir-projects_noir-protocol-circuits_crates_rollup-lib`	`3` s	`2` s	`1.50`
`test_report_zkpassport_noir_rsa_`	`2` s	`1` s	`2`

This comment was automatically generated by workflow using github-action-benchmark.

CC: @TomAFrench

aakoshh · 2025-07-23T15:07:24Z

Okay, so I misunderstood the size in get_memory_values. Initially I thought it's the capacity, but then it turns out it's a bit more involved, because for a slice of tuples, it's going to be capacity*item_size, e.g. [(1,2), (3,4)] has a size of 4. Then I wanted implement flattened_size for the PrintableType, and divide to get the number of items I need to consume, however I didn't know what to do for Enum where the number of fields depends on the tag at runtime, and it turns out it crashes if I want to print it in #9294

Then for some reason I thought the size would correspond to the number of fields in the data (or actually inherited that approach from #9287), but that's not true: [[1,2], [3,4]] has a size of 2, ie. 2 arrays in the slice, and [(0, true, [1, 2]), (3, false, [5, 6]),] has a size of 6, despite all 8 fields being in the slice: the arrays only contribute once.

To formula I came up with is to derive the number of items in the slice involves calculating a type-dependent flattened size, where tuples are structs are flattened, but array length and content is treated as 1. We use this length to divide the capacity inserted in the VM, which gives us the number of items we need to consume in total to clear the slice from the data.

github-actions · 2025-07-23T17:48:58Z

Changes to Brillig bytecode sizes

Generated at commit: ed6eeb8912364a6f192bcdb8b64654d954da76ff, compared to commit: d88542d6ba7ca633f7456fb9346ac3cfcff7b70d

🧾 Summary (10% most significant diffs)

Program	Brillig opcodes (+/-)	%
regression_9271_inliner_max	+420 ❌	+88.79%
regression_9271_inliner_min	+420 ❌	+88.79%
regression_9271_inliner_zero	+420 ❌	+88.79%

Full diff report 👇

Program	Brillig opcodes (+/-)	%
regression_9271_inliner_max	893 (+420)	+88.79%
regression_9271_inliner_min	893 (+420)	+88.79%
regression_9271_inliner_zero	893 (+420)	+88.79%

github-actions · 2025-07-23T17:49:02Z

Changes to number of Brillig opcodes executed

Generated at commit: ed6eeb8912364a6f192bcdb8b64654d954da76ff, compared to commit: d88542d6ba7ca633f7456fb9346ac3cfcff7b70d

🧾 Summary (10% most significant diffs)

Program	Brillig opcodes (+/-)	%
regression_9271_inliner_max	+422 ❌	+94.41%
regression_9271_inliner_min	+422 ❌	+94.41%
regression_9271_inliner_zero	+422 ❌	+94.41%

Full diff report 👇

Program	Brillig opcodes (+/-)	%
regression_9271_inliner_max	869 (+422)	+94.41%
regression_9271_inliner_min	869 (+422)	+94.41%
regression_9271_inliner_zero	869 (+422)	+94.41%

github-actions · 2025-07-23T17:49:17Z

Changes to circuit sizes

Generated at commit: ed6eeb8912364a6f192bcdb8b64654d954da76ff, compared to commit: d88542d6ba7ca633f7456fb9346ac3cfcff7b70d

🧾 Summary (10% most significant diffs)

Program	ACIR opcodes (+/-)	%	Circuit size (+/-)	%
regression_9271	+1 ❌	+33.33%	0 ➖	0.00%

Full diff report 👇

Program	ACIR opcodes (+/-)	%	Circuit size (+/-)	%
regression_9271	4 (+1)	+33.33%	18 (0)	0.00%

github-actions

⚠️ Performance Alert ⚠️

Possible performance regression was detected for benchmark 'Compilation Time'.
Benchmark result of this commit is worse than the previous benchmark result exceeding threshold 1.20.

Benchmark suite	Current: `e551e90`	Previous: `7408cbe`	Ratio
`sha512-100-bytes`	`1.951` s	`1.553` s	`1.26`

This comment was automatically generated by workflow using github-action-benchmark.

CC: @TomAFrench

github-actions

⚠️ Performance Alert ⚠️

Possible performance regression was detected for benchmark 'Opcode count'.
Benchmark result of this commit is worse than the previous benchmark result exceeding threshold 1.10.

Benchmark suite	Current: `e551e90`	Previous: `7408cbe`	Ratio
`sha512-100-bytes`	`22169` opcodes	`13173` opcodes	`1.68`

This comment was automatically generated by workflow using github-action-benchmark.

CC: @TomAFrench

vezenovm

So it seems you ultimately decided to pass the capacity (when reading the foreign call inputs in the VM) due to the call inputs all being flattened?

compiler/noirc_printable_type/src/lib.rs

vezenovm · 2025-07-23T17:49:10Z

compiler/noirc_printable_type/src/lib.rs

-                Some(field_iterator.consumed() + capacity)
-            } else {
-                None
+            let flattened_capacity = field_iterator


I definitely prefer this approach to that of the parent. It didn't make sense to me that we should have to toggle with_length_prefix for both arrays and slices. As we have the types we should know that a slice always has a capacity prefix and an array never has that prefix.

Let me explain the difficulties that I saw with fully reasoning out which parameter need to be padded and which one doesn't. It might be that there is a way to do it, I haven't fully thought it through.

So we have a number of ForeignCallParams, and where it's an Array then it might come from a HeapVector or a HeapArray, we don't know without looking at the type. However, the PrintableType we have are of a different granularity: say we have a Tuple as a parameter for the print, and it happens to contain a Slice and an Array field, interspersed with some others, then these are going to appear as separate ForeignCallParams, but to tell which one is the Slice and which one is an Array, we have to unpack the Tuple and match it to the parameters.

Now, I wasn't sure if one of the Tuple fields was itself a Tuple, then I think it gets flattened out as well, so we can't just iterate over the parameters, we have to recursively unpack the types until they match the fields. It seemed messy.

decode_ does it by recursing over the type tree, while consuming flattened data, so I thought if I prefix the vectors, then I would know where to expect it and where not to. But it turned out to be rather artificial, as the Array itself can contain complex data, and only the top one prefixed.

Then I tried to prefix at the source, and by doing so it makes it so much more sense: we attach not some artificial "byte size" but an actual type-dependent vector capacity. That made me finally understand what this capacity even means, which isn't trivial, but at least it depends on the types, not the size of the data, which is easier to attach in the interpreter for example.

aakoshh · 2025-07-23T19:30:40Z

So it seems you ultimately decided to pass the capacity (when reading the foreign call inputs in the VM) due to the call inputs all being flattened?

The call inputs are not all flattened, although I kinda wish they would be, because it's a bit hard to understand still why something is or isn't flattened. Maybe it's how the variables in the SSA are?

For example the members of a tuple are passed as separate parameters, but a slice of tuples are passed as a single array.

And then there is the revert data, which is already flattened when the printer sees it, but cannot contain slices. The output of a foreign call also comes flattened. Debug variable content is stored flattened.

I thought if we include the capacity at the source in the VM we can limit this special handling to slices in a way that is easy to explain, without having to radically rewrite the decoding part to match types with fields precisely. Not sure if this is an acceptable break in backwards compatibility, but I wanted to throw it out there. Doing this limits the scope of this change, for example debug variables automatically get it.

vezenovm

I'm ok with this change, but I know @TomAFrench wanted to take a look before we merged. Either way we should make sure we do not break any Aztec oracles before merging.

I hacked together an alternative #9321 that avoids adding a pre-pending a capacity to slice foreign call inputs that we could consider as well. It instead removes any padding from the vector after calling into get_memory_values.

aakoshh · 2025-07-25T10:54:01Z

Closing in favour of #9321

aakoshh added 7 commits July 23, 2025 09:40

Add comments

c340cb2

Add integration test

36066ed

Special flattening and parsing to handle slices

0167a64

Add insta

f8c364a

Multi-dimensional arrays are flattened

678f686

Try prefixing just slices

245d134

Length can be zero

e92e552

aakoshh changed the title ~~Try prefixing just slices~~ fix!: Prefix HeapVector with its length in ForeignCallParams Jul 23, 2025

github-actions bot reviewed Jul 23, 2025

View reviewed changes

aakoshh added 2 commits July 23, 2025 14:22

Track number of consumed fields

7cba8ac

Merge branch 'af/9271-fix-print-slice' into af/9271-fix-print-slice-alt

f7a7729

aakoshh mentioned this pull request Jul 23, 2025

fix: Special handling for slices in printing #9287

Closed

5 tasks

aakoshh added 2 commits July 23, 2025 14:40

Clippy

a40400a

Merge branch 'af/9271-fix-print-slice' into af/9271-fix-print-slice-alt

7654e4a

aakoshh marked this pull request as ready for review July 23, 2025 13:46

aakoshh marked this pull request as draft July 23, 2025 14:12

aakoshh added 4 commits July 23, 2025 18:10

Only the top array needs to be length encoded

3807428

Merge branch 'af/9271-fix-print-slice' into af/9271-fix-print-slice-alt

24e446d

Figure out the number of items to consume by looking at the capacity

86ec160

Add example to integration test

e456a5d

github-actions bot reviewed Jul 23, 2025

View reviewed changes

vezenovm reviewed Jul 23, 2025

View reviewed changes

aakoshh added 3 commits July 23, 2025 19:02

In this version we prefix with the field count, not the capacity

b8f4f07

Update integration test

4ba85f2

Merge branch 'af/9271-fix-print-slice' into af/9271-fix-print-slice-alt

7eaa64c

Update unit tests

e551e90

aakoshh marked this pull request as ready for review July 23, 2025 19:39

aakoshh requested a review from a team July 23, 2025 19:40

aakoshh added 2 commits July 23, 2025 20:55

Merge branch 'master' into af/9271-fix-print-slice

d88542d

Merge branch 'af/9271-fix-print-slice' into af/9271-fix-print-slice-alt

a0bcea7

aakoshh changed the title ~~fix!: Prefix HeapVector with its length in ForeignCallParams~~ fix!: Prefix HeapVector with its size in ForeignCallParams Jul 23, 2025

Slice, Array and String are just 1 wide

2c4dfd3

aakoshh changed the base branch from af/9271-fix-print-slice to master July 24, 2025 06:57

vezenovm mentioned this pull request Jul 24, 2025

fix(brillig_vm): Remove slice padding for foreign call inputs #9321

Merged

5 tasks

vezenovm reviewed Jul 24, 2025

View reviewed changes

aakoshh closed this Jul 25, 2025

Conversation

aakoshh commented Jul 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Problem*

Summary*

Additional Context

Documentation*

PR Checklist*

Uh oh!

github-actions bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

⚠️ Performance Alert ⚠️

Uh oh!

aakoshh commented Jul 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jul 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes to Brillig bytecode sizes

🧾 Summary (10% most significant diffs)

Uh oh!

github-actions bot commented Jul 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes to number of Brillig opcodes executed

🧾 Summary (10% most significant diffs)

Uh oh!

github-actions bot commented Jul 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes to circuit sizes

🧾 Summary (10% most significant diffs)

Uh oh!

github-actions bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

⚠️ Performance Alert ⚠️

Uh oh!

github-actions bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

⚠️ Performance Alert ⚠️

Uh oh!

vezenovm left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

vezenovm Jul 23, 2025

Choose a reason for hiding this comment

Uh oh!

aakoshh Jul 23, 2025

Choose a reason for hiding this comment

Uh oh!

aakoshh commented Jul 23, 2025

Uh oh!

vezenovm left a comment

Choose a reason for hiding this comment

Uh oh!

aakoshh commented Jul 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

aakoshh commented Jul 23, 2025 •

edited

Loading

github-actions bot left a comment •

edited

Loading

aakoshh commented Jul 23, 2025 •

edited

Loading

github-actions bot commented Jul 23, 2025 •

edited

Loading

github-actions bot commented Jul 23, 2025 •

edited

Loading

github-actions bot commented Jul 23, 2025 •

edited

Loading

github-actions bot left a comment •

edited

Loading

github-actions bot left a comment •

edited

Loading