Avoid emitting a block in the binary format when it has no name #4912

kripken · 2022-08-17T18:09:24Z

We already did this if the block was a child of a control flow structure, which is
the common case (see the new added comment around that code, which clarifies
why). This does the same for all other blocks. This is simple to do and a minor
optimization, but the main benefit from this is just to make our handling of blocks
uniform: after this, we never emit a block with no name. This will make 1a non-
nullable locals easier to handle (since they will be able to assume that property;
and not emitting such blocks avoids some work to handle non-nullable locals
in them).

aheejin

Do we need to keep visitPossibleBlockContents and visitBlock separate? The only difference here is sometimes we want to be more aggressive (use BranchSeeker) and sometimes we don't. Can we have a single function with a parameter? Probably we can't modify the signature of PostWalker::visitBlock, but maybe we can add a seekBranch parameter to visitPossibleBlockContents and make visitBlock just call it with the parameter false?

aheejin · 2022-08-17T20:30:39Z

src/wasm-stack.h

+  // uses, it is equivalent to not having one). This is potentially quadratic,
+  // but it is extremely rare to have recursion on this function, since it is
+  // limited by the number of non-block control flow structures (the places that
+  // call here).


This is potentially quadratic,

Isn't this always quadratic, because we traverse all children?

it is extremely rare to have recursion on this function, since it is limited by the number of non-block control flow structures (the places that call here).

How do we have a recursion here? Can BranchSeeker::has call this BinaryenIRWriter::VisitPossibleBlockContents back or something?

Hmm, good points, my text was not accurate. Yes, it's always quadratic, just on smaller numbers. By "recursion" I really meant larger numbers (where quadratic time gets bad). I improved the comment now.

kripken · 2022-08-17T20:59:08Z

Can we have a single function with a parameter?

Interesting, maybe... but thinking about it, I'm not sure it's worth it. visitBlock and visitPossibleBlockContents already had some code duplication, because both iterate on the children, but in slightly different ways. And while we could share more code now after this PR, it's really just one line - to check if the block has a name. Adding a parameter seekBranch whether to scan the contents would already make it about the same size, I think, and not necessarily more readable.

aheejin · 2022-08-17T21:03:02Z

src/wasm-stack.h

+  // uses, it is equivalent to not having one). Scanning the children of the
+  // block means that this takes quadratic time, but it will be N^2 for a fairly
+  // small N since the number of nested non-block control flow structures tends
+  // to be very reasonable.


I'm still not sure... Doesn't BranchSeeker::has mostly only check branches, or more precisely, name uses defined by DELEGATE_FIELD_SCOPE_NAME_USE? (Currently br, br_table, br_on, rethrow, and try(due to delegate) have this field)

but it will be N^2 for a fairly small N since the number of nested non-block control flow structures tends to be very reasonable.

I'm not sure what do you mean by "non-block control flow structures". Do you mean branches here?

Also, it is N^2 because we need to check every child, and that N includes all children. We don't know whether it is a branch or not before checking it. So I'm also not sure what you mean by "a fairly small N".

I'm not suggesting the quadratic behavior is bad; it is preexisting anyway and it is for a reason and as you said it doesn't cause unreasonable slowdown. I'm just not very good at understanding the comments.

Non-block control flow structures are: If, Try, Loop. In each of their arms they call visitPossibleBlockContents (to try to avoid emitting a block there).

By small N I mean that there are few nested non-block control flow structures. This would be bad:

(if .. (if .. (if ..N such ifs.. .. ..innermost child..

Those nested ifs lead to the innermost child being scanned N times where N is the number of those ifs (since each if scans all children). So the total time is O(N*M) if M is the total number of children of the first if. But, N is small in the real world, such if stacks are very rare (unlike block stacks).

aheejin

Interesting, maybe... but thinking about it, I'm not sure it's worth it. visitBlock and visitPossibleBlockContents already had some code duplication, because both iterate on the children, but in slightly different ways. And while we could share more code now after this PR, it's really just one line - to check if the block has a name. Adding a parameter seekBranch whether to scan the contents would already make it about the same size, I think, and not necessarily more readable.

Hmm, yeah, I thought they are similar but as you said visitBlock has additional routines handling deeply nested blocks... Nevermind then.

tlively · 2022-08-17T21:42:28Z

src/wasm-stack.h

+  // block means that this takes quadratic time, but it will be N^2 for a fairly
+  // small N since the number of nested non-block control flow structures tends
+  // to be very reasonable.


We could still make this cheaper and also improve the later case by running a pass to remove unused names in linear time, then assuming we've already taken care of it here. We already have a pass that does this, don't we?

Yes, RemoveUnusedNames is run when optimizing. So this only matters for the unoptimized case. But it's still nice to handle that since that includes a simple roundtrip with wasm-opt.

since that includes a simple roundtrip with wasm-opt

I don't understand what you mean here. I see that this quadratic checking will only produce useful results if we haven't already optimized, but it would still be cheaper and produce better results to run the RemoveUnusedNames pass unconditionally here no matter what other optimizations we are doing.

Yeah, running the pass might be fine. We'd need to do some more work to be careful to avoid noticeable side effects, as right now binary writing does not modify the Module data, but running that pass would. But I think it could be done probably.

Looking at RemoveUnusedBrs, it looks like it does more optimizations than we would actually want for this use case. What Module data are you thinking about, though? That pass is function parallel, so I don't think it should change any Module-level state.

I meant any changes to the Module or its contents. Right now, writing a module does not modify the input in any way, but this would make changes to blocks etc. Still, it might be ok...

Oh, I see. Yeah, I think that's ok since there shouldn't really be anything happening after writing. This PR lgtm if you'd rather look at that as a follow-up.

Yeah, let's leave that as a separate topic. It's the existing behavior before this PR and I'd rather not change it here so each PR is focused.

Previously the wat parser would turn this input: (block (nop) ) into something like this: (block $block17 (nop) ) It just added a name all the time, in case the block is referred to by an index later even though it doesn't have a name. This PR makes us rountrip more precisely by not adding such names: if there was no name before, and there is no break by index, then do not add a name. In addition, this will be useful for non-nullable locals since whether a block has a name or not matters there. Like #4912, this makes us more regular in our usage of block names.

kripken added 4 commits August 17, 2022 09:40

yolo

85e1821

yolo

9cc2ed5

fix

f1f7567

fix

3834dc1

kripken requested review from tlively and aheejin August 17, 2022 18:09

kripken added 2 commits August 17, 2022 11:10

typo

3d65da8

update test

e123cdd

aheejin reviewed Aug 17, 2022

View reviewed changes

kripken added 2 commits August 17, 2022 13:52

Merge remote-tracking branch 'origin/main' into noblock

e1b6202

improve comment

a91f903

aheejin reviewed Aug 17, 2022

View reviewed changes

aheejin approved these changes Aug 17, 2022

View reviewed changes

tlively reviewed Aug 17, 2022

View reviewed changes

kripken merged commit 613fadc into main Aug 18, 2022

kripken deleted the noblock branch August 18, 2022 16:10

kripken mentioned this pull request Aug 19, 2022

Avoid adding new unneeded names to blocks in text roundtripping #4943

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid emitting a block in the binary format when it has no name #4912

Avoid emitting a block in the binary format when it has no name #4912

kripken commented Aug 17, 2022

aheejin left a comment

aheejin Aug 17, 2022

kripken Aug 17, 2022

kripken commented Aug 17, 2022

aheejin Aug 17, 2022

kripken Aug 17, 2022

aheejin left a comment

tlively Aug 17, 2022

kripken Aug 17, 2022

tlively Aug 17, 2022

kripken Aug 17, 2022 •

edited

Loading

tlively Aug 17, 2022

kripken Aug 17, 2022

tlively Aug 17, 2022

kripken Aug 17, 2022

Avoid emitting a block in the binary format when it has no name #4912

Avoid emitting a block in the binary format when it has no name #4912

Conversation

kripken commented Aug 17, 2022

aheejin left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kripken commented Aug 17, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

aheejin left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kripken Aug 17, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kripken Aug 17, 2022 •

edited

Loading