Support 5 segment source mappings #6795

osa1 · 2024-07-31T11:45:47Z

Support 5-segment source mappings.

Reference: https://github.com/tc39/source-map/blob/main/source-map-rev3.md#proposed-format

…ings

kripken · 2024-08-07T18:07:08Z

What are your plans for using this in practice? I'm not aware of other toolchains doing so, so I'm curious.

osa1 · 2024-08-08T07:10:10Z

When I created this PR I was using the name segment in dart2wasm to name generated code with the source code function that they belong.

However as you note, browsers don't use the name segments in source maps to give names to stack frames, and I'm unable to find where the name segments are used (if they actually are).

So currently I don't have a use case and this isn't a blocker for me. We may still want to merge it for completeness, the source maps spec allows this and there may be some toolchain that makes use of it.

To be able to know when we are generating a source map, make `dart compile wasm` aware of the `--no-source-maps` flag. Note: wasm-opt currently cannot handle the mappings we generate. This will be merged after WebAssembly/binaryen#6794 and WebAssembly/binaryen#6795.

kripken · 2024-08-08T17:23:29Z

I see, thanks for the update. In that case, I think I prefer not to merge this, if we can't test it against a known correct implementation, and don't have a current user. But we can leave the PR open, maybe there are other people that could benefit, and might find this and comment.

…ings

osa1 · 2024-09-16T09:02:49Z

@kripken we now have a use case for this, as described in dart-lang/sdk#56718 (comment).

The TL;DR is stack traces of crashes in optimized dart2wasm apps are logged in an external service which has access to the source map file.

The source map file is used to map code offsets in the stack trace to names.

Optimized dart2wasm apps don't have function names as names section can add ~70% overhead to binaries.

To support this use case we need to add names to mappings.

Would it be possible to merge this?

kripken · 2024-09-16T20:21:48Z

@osa1 If there is a use case then we should review and get this landed, I agree, but I don't think I understand the need there yet.

The standard solution I am aware of for that name problem is to store the name mapping on the side, which can be done by saving the name section in addition to the source map file. One way is to save the entire unstripped binary. Another approach is to save just the names themselves, for which e.g. wasm-opt --symbolmap can be used (that is what Emscripten does).

Is there a reason those existing solutions can't work for your use case?

mkustermann · 2024-09-16T20:59:37Z

In the native world this is done via DWARF debugging information obtained when stripping the production binary (i.e. one obtains a single file that can be used to decode production stack traces). For wasm we don't use DWARF (as that disables certain binaryen optimizations) - so we instead rely on source maps for e.g. line number information.

If the method names came from a symbolmap file, we'd still need to have the source map file as well to get line numbers. It seems very inconvenient to have this metadata distributed across two different files - with different formats, where one needs different tools to decode different components of the frames and then merge this information.

kripken · 2024-09-16T22:21:57Z

@mkustermann Two files does add some complexity, that is true. But in my experience on the Emscripten side that has not been much of an issue in practice. And as for needing separate tools, we have a small script that can call out to the various tools,

https://github.com/emscripten-core/emscripten/blob/main/emsymbolizer.py

That script can then handle a symbol map, a source map, or DWARF.

To be clear, I'm not saying I'm opposed to this PR. If you feel strongly that it is helpful for you then I can be convinced. I just want to make sure you have considered what I think is the simpler option of using the wasm names section as a symbol map.

osa1 · 2024-09-17T08:07:08Z

the simpler option of using the wasm names section as a symbol map

Do you mean simple for binaryen devs or end users? I think the simplicity here is for binaryen but not necessarily for the users of binaryen.

dart2wasm needs to pass the right flags to wasm-opt to generate symbol maps, then Flutter needs to pass the right flags to dart2wasm. (Flutter doesn't give users a way to pass extra compiler flags, so it needs to be updated)

End users will have to deal with 3 files instead of 2, and if they already had the implementation of mapping stack traces to source locations using a separately distributed source map file (as in our use case linked above) they won't be able to reuse much of their code.

mkustermann · 2024-09-17T08:19:59Z

@kripken Maybe as some background. The request originated due to sentry.io (a very popular crash reporting framework) to extending their flutter support to flutter web apps compiled to wasm (see sentry's bug: dart-lang/sdk#56718). They probably have existing support for uploading source maps and things just work for dart2js code but not for dart2wasm code.

So the real complexity comes due to the fact that from the lowest level of the stack (the dart2wasm compiler), to flutter tooling, to sentry tooling to sentry API server uploading to sentry API server symbolization code to all need to deal with the extra wasm symbols map. So it may require e.g. sentry's crash reporting service to change their client tooling & server APIs to allow upload 3 files (wasm, source map, wasm symbolmap), change their server code that symbolizes stacks to use different tools to get different information (function names and line numbers) and then combine that.

And this isn't dart/flutter specific. Anyone using WasmGC & Binaryen will use source maps (because DWARF disables optimizations) and would then need to have to deal with this extra symbolmap file all the way up their stack.

It seems like the source map format supports the function names for that exact reason, so why not take advantage of it and avoid this complexity?

kripken · 2024-09-17T17:10:52Z

Fair enough, as I said, if you feel strongly then I don't object. I'll find time later today to review this in detail.

kripken

Code lgtm % nits. Please add testing.

src/wasm/wasm-binary.cpp

Co-authored-by: Alon Zakai <[email protected]>

osa1 · 2024-09-19T14:08:50Z

@kripken thanks, ptal.

kripken

Code lgtm, so all that is left is testing.

…ings

osa1 · 2024-09-26T12:06:03Z

I believe this should be almost ready now, but I'm not sure how to test changes in module-utils.cpp.

@kripken do we have any tests right now that checks source map info after doing inlining, and after merging modules? The changes in module-utils.cpp will be used when inlining and merging modules, but I can't find where debug info sanity is tested after inlining and merging modules.

kripken · 2024-09-26T17:12:21Z

It looks like the code above your changes to module-utils.cpp was added in f44912b (#6372), which includes tests. See specifically the test code in test/lit/merge/sourcemap.wat* for merging.

I don't see specific tests for inlining + source maps, but one could be added alongside e.g. test/lit/debug/source-map-smearing.wast, by adding --inlining. Though perhaps the test can be even simpler, without writing a binary in the middle? Just adding ;;@ annotations to an inlining test like test/lit/passes/inlining_all-features.wast should check that we move the annotations around properly.

…ings

osa1 · 2024-09-27T09:45:53Z

Thanks @kripken.

This should be ready for review now.

I've added a separate test for inlining instead of modifying one of the existing tests, as there weren't any existing test that checks specifically source maps AFAICS.

The renumbering in lines and columns in existing tests is to make sure each source map annotation has a unique line and column number, to make sure we don't accidentally use numbers of a wrong annotation.

The TODO in the code is copy/paste from a few lines above. When writing strings to the .map files we don't escape characters according to the JSON spec. It's an existing issue.

src/binaryen-c.cpp

src/ir/module-utils.h

src/parser/contexts.h

src/ir/module-utils.cpp

src/passes/Print.cpp

src/wasm.h

…ings

osa1 · 2024-10-01T08:25:08Z

@kripken I've addressed the comments but I'm not sure how to update the C API with an optional uint32_t parameter, see my question above.

…ings

src/ir/module-utils.cpp

The order is specified in https://github.com/tc39/source-map/blob/main/source-map-rev3.md#proposed-format This was causing binaryen to fail after WebAssembly/binaryen#6795

osa1 added 2 commits July 31, 2024 11:46

Handle single-segment source mapping in source map header decoder

bc6dd69

Support 5-segment source mappings

a12e4ad

osa1 changed the title ~~5 segment source mappings~~ Implement 5 segment source mappings Jul 31, 2024

osa1 changed the title ~~Implement 5 segment source mappings~~ Support 5 segment source mappings Jul 31, 2024

osa1 added 3 commits August 7, 2024 09:11

Merge remote-tracking branch 'origin/main' into 5_segment_source_mapp…

363a03e

…ings

Revert formatting change

d8555c5

Fix formatting

3b986eb

osa1 added 2 commits September 16, 2024 10:55

Merge remote-tracking branch 'origin/main' into 5_segment_source_mapp…

2015e88

…ings

Revert debug change

9465b2e

kripken reviewed Sep 17, 2024

View reviewed changes

src/wasm/wasm-binary.cpp Outdated Show resolved Hide resolved

src/wasm/wasm-binary.cpp Outdated Show resolved Hide resolved

Apply suggestions from code review

4076b7a

Co-authored-by: Alon Zakai <[email protected]>

osa1 requested a review from kripken September 19, 2024 14:08

kripken reviewed Sep 19, 2024

View reviewed changes

osa1 added 5 commits September 20, 2024 09:54

Merge remote-tracking branch 'origin/main' into 5_segment_source_mapp…

b63ec19

…ings

Add test

6171fe9

Implement parsing and printing

0a9d251

Format

6858dac

Fix parsing

4111ce8

osa1 added 5 commits September 27, 2024 10:55

Update merge test

25e1d80

Update metadce test

95e24c7

Add inlining test

5dac139

Merge remote-tracking branch 'origin/main' into 5_segment_source_mapp…

7d6529d

…ings

Revert formatting change

836d387

osa1 marked this pull request as ready for review September 27, 2024 09:45

osa1 marked this pull request as draft September 27, 2024 09:56

Update lit output

1c4157c

osa1 marked this pull request as ready for review September 27, 2024 10:02

kripken reviewed Sep 30, 2024

View reviewed changes

osa1 added 4 commits October 1, 2024 09:31

Merge remote-tracking branch 'origin/main' into 5_segment_source_mapp…

a278429

…ings

Address some of the comments

0098182

Formatting

415bb88

Remove local

2f8cf68

osa1 added 2 commits October 1, 2024 22:03

Merge remote-tracking branch 'origin/main' into 5_segment_source_mapp…

b1dff53

…ings

Revert C API changes

8a9ee2d

osa1 requested a review from kripken October 1, 2024 20:06

kripken reviewed Oct 1, 2024

View reviewed changes

src/ir/module-utils.cpp Show resolved Hide resolved

osa1 added 2 commits October 1, 2024 22:40

Update comment

c54b226

Fix typo

a721d72

kripken approved these changes Oct 1, 2024

View reviewed changes

kripken merged commit 347fc8a into WebAssembly:main Oct 1, 2024
13 checks passed

osa1 deleted the 5_segment_source_mappings branch October 1, 2024 21:45

brendandahl mentioned this pull request Oct 2, 2024

Fix ordering of source maps fields. emscripten-core/emscripten#22670

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support 5 segment source mappings #6795

Support 5 segment source mappings #6795

osa1 commented Jul 31, 2024 •

edited

Loading

kripken commented Aug 7, 2024

osa1 commented Aug 8, 2024

kripken commented Aug 8, 2024

osa1 commented Sep 16, 2024

kripken commented Sep 16, 2024

mkustermann commented Sep 16, 2024

kripken commented Sep 16, 2024

osa1 commented Sep 17, 2024

mkustermann commented Sep 17, 2024 •

edited

Loading

kripken commented Sep 17, 2024

kripken left a comment

osa1 commented Sep 19, 2024

kripken left a comment

osa1 commented Sep 26, 2024

kripken commented Sep 26, 2024

osa1 commented Sep 27, 2024

osa1 commented Oct 1, 2024

Support 5 segment source mappings #6795

Support 5 segment source mappings #6795

Conversation

osa1 commented Jul 31, 2024 • edited Loading

kripken commented Aug 7, 2024

osa1 commented Aug 8, 2024

kripken commented Aug 8, 2024

osa1 commented Sep 16, 2024

kripken commented Sep 16, 2024

mkustermann commented Sep 16, 2024

kripken commented Sep 16, 2024

osa1 commented Sep 17, 2024

mkustermann commented Sep 17, 2024 • edited Loading

kripken commented Sep 17, 2024

kripken left a comment

Choose a reason for hiding this comment

osa1 commented Sep 19, 2024

kripken left a comment

Choose a reason for hiding this comment

osa1 commented Sep 26, 2024

kripken commented Sep 26, 2024

osa1 commented Sep 27, 2024

osa1 commented Oct 1, 2024

osa1 commented Jul 31, 2024 •

edited

Loading

mkustermann commented Sep 17, 2024 •

edited

Loading