[wgsl-in] Implement module-level scoping #2075

SparkyPotato · 2022-10-04T19:15:57Z

Try 2 :)

This PR separates parsing from lowering, by generating an AST, which desugars as much as possible down to something like Naga IR.

The AST is then used to resolve identifiers while lowering to Naga IR.

SparkyPotato · 2022-10-04T19:24:57Z

@jimblandy Would be great if you could a look at the AST definition!

Also, you were talking about how 2 passes would be all that was necessary.
However, lowering to Naga IR requires that the dependencies of each module-level declaration are known beforehand, as we need to (reverse) topologically sort the declaration list so that all dependencies are processed before their dependents.

The only way I can think of doing this in two passes is to maintain a set of the raw identifiers referenced by each declaration during parsing, and then using those to sort the declarations, but I'm not quite sure if this will actually be faster than resolving on the same IR, and then sorting.

jimblandy · 2022-10-04T21:56:35Z

Here's how I was thinking it could work (I haven't actually tried this yet):

For the first pass, we parse mostly as we do now, except that identifiers in expressions, types, statements, etc. are represented by placeholders that just store the name. Except for the placeholders, we try to produce something as close to the crate IR as possible immediately.

The placeholders mean that we will have types that look very much like crate::Expression, etc. except that certain variants that assume knowledge of definitions (LocalVariable, obviously, but also things like AccessIndex, since we don't know struct type definitions) hold identifiers instead of handles. Some cases will need to change in more profound ways; for example, we can't tell whether S(x) is a Call or a Compose until we know the definition of S. But as much as possible, the unresolved types should resemble the crate types, so that people familiar with the rest of Naga can make assumptions.
For the resolution pass, starting with the entry points, start building true crate::{Function, EntryPoint, Type, ...} values. When we hit a placeholder, check to see if we already have a previously-resolved definition for that name. If so, use it. Otherwise, immediately recurse on resolving that name (which adds the resulting fully-resolved definition to the Module under construction), and then resume processing the placeholder, whose definition we now know. When we're done, add our completed definition to the Module.

This amounts to a depth-first traversal of the DAG of definitions, where we "mark nodes as visited" by adding resolved definitions to the Module. This ensures that definitions appear in the Module in dependency order, in the process of replacing placeholders with Handles.

After we've processed the entry points, we may want to just walk the top-level unresolved definitions and finish off anything unvisited. Such definitions could be safely dropped from the module, since they're unreachable from any entry point (and we should probably produce warnings about this), but it's very useful for testing to have everything in the input appear in the Module. Maybe this step would be optional.

Naturally, we will need to check for cyclic references at the point of recursion.

As I say, I haven't tried this, so there may be aspects of the problem I've missed - but it seems like something along these lines ought to be possible somehow.

jimblandy · 2022-10-04T23:04:56Z

You should use Arena, UniqueArena, and Handle for the unresolved representation. That is one of the tricks Naga uses for performance: broadly speaking, caches and prefetchers are much happier with arrays than heap allocation, especially if you can phrase some computation as an iteration over the array.

Otherwise, try to make the change as undisruptive and limited as you can while still preserving legibility.

SparkyPotato · 2022-10-05T07:29:23Z

Recursion was something I thought of, but processing each declaration does use quite a large amount of stack, so we might end up stack overflowing quite quick, especially on Windows, where the default stack size is 2 MB.

We can probably use stacker, which is what rustc uses.

About the warnings, I do plan to add error recovery to the frontend so we can output more than one error, and will add warnings for unused items there (I did have this implemented in my other frontend, it isn't too big of a change).

jimblandy · 2022-10-05T18:48:52Z

Using arenas instead of trees for expressions should reduce the stack depth, though, because you can just iterate over the arena directly. And with a little care we could avoid recursing on statements at all - I think only Call and Atomic need resolution. So the depth would only be determined by the number of references we're in the midst of resolving, and not so much by the interior structure of the functions themselves.

jimblandy · 2022-10-05T18:53:52Z

stacker sounds 1) amazing and 2) wildly unsafe and definitely not something we want to ship in a browser. So that's one easy solution out the window. I think stack depth is a legitimate concern here.

SparkyPotato · 2022-10-05T19:05:54Z

I think keeping a set of (textual) dependencies used by each declaration during parsing, and then using that to post-order depth-first sort would be the easiest solution, and it shouldn't really impact performance too much.

The post-order sorting with cycle detection does require recursion, however it should require much less stack space than resolving the function.

I have also implemented parsing and finalized the AST structures, would be great if you could take a look, and tell me if any changes are needed :) (I have removed parts of the code that aren't necessary for parsing, will modify and re-add them later)

jimblandy · 2022-10-05T21:11:01Z

I think keeping a set of (textual) dependencies used by each declaration during parsing, and then using that to post-order depth-first sort would be the easiest solution, and it shouldn't really impact performance too much.

If I understand what you mean here, this seems reasonable, too: basically, just treat every identifier use as an edge in a DAG constructed during parsing. It is definitely preferable to avoid the recursion. I'll take a look soon.

SparkyPotato · 2022-10-22T10:31:14Z

I did have to keep track of locals during the initial parse phase, to fix an issue with code like this:

fn a() {
    let a = b();
}

fn b() -> i32 {
    let a = 20;
    return a;
}

If we just kept a set of textual dependencies, fn a being shadowed by let a would create a cyclic dependency error.

i509VCB · 2022-10-28T04:52:48Z

I'm curious if the ast generated here is publicly accessible. I do have a case where I need to look at the shader's ast to figure out how to setup bind groups

SparkyPotato · 2022-10-28T13:33:40Z

I'm curious if the ast generated here is publicly accessible. I do have a case where I need to look at the shader's ast to figure out how to setup bind groups

It's not public at the moment, but I can make it so.

However, can you not just use the Module that the front end produces instead? It will probably have more information than the semi-AST IR used here.

SparkyPotato · 2022-11-05T09:21:21Z

@jimblandy This has reached feature parity with the existing frontend, and passes all the functionality-based tests. The diagnostics snapshots don't pass yet, but fixing those shouldn't change too much. You can start reviewing now.

Surprisingly, performance has actually improved. The current frontend consistently takes around 3.6 ms for the front/wgsl benchmark, while this does it in around 2.9 ms (a 20% decrease).

The only difference in output is that I don't initialize LocalVariables with constants since I'm not too sure if that added complexity is actually worth it. I can add it back if it's something wanted, though.

teoxoy · 2022-11-14T16:59:11Z

Hey @SparkyPotato, thanks for this PR!

There seem to be quite a lot of copy-paste "changes" (code being moved around) that make it difficult to know what actually changed. Would it be possible for you to undo as many code moves as possible to make it easier to review this?

Right now, we are looking at having to review 11943 LOC (without test snapshots) which I don't consider particularly "reviewable".

SparkyPotato · 2022-11-14T17:57:09Z

@teoxoy I've moved as much as I think possible back (parsing + errors).

construction.rs hasn't been moved because I have made quite a few changes to it, and it also needs to be in the lower module for privacy reasons.

This brings the overall diff of src/front/wgsl to +5038 -3348.

Do tell me if there's anything else I need to do to make reviewing easier!

teoxoy · 2022-11-14T18:19:36Z

Thanks for reducing the diff, but it still looks like quite a lot of changes which will make browsing the history of the files challenging.

Let's move the code in lower/mod.rs to mod.rs and lower/construction.rs to construction.rs (with existing code where it previously was).

After this, the changes should be reviewable and after the review we can see how to proceed with new files/folders (and moving code around).

SparkyPotato · 2022-11-15T11:40:59Z

Got it down to +4495 -2811, not sure if anything more is possible.

teoxoy

Thanks for reducing the diff!
I took a brief look around the code and left a few comments.

General comment
Only user defined structs should have a crate::Type.name (some of the backends now generate extra type aliases for built-in types).

benches/criterion.rs

tests/in/lexical-scopes.wgsl

tests/wgsl-errors.rs

src/front/wgsl/construction.rs

src/lib.rs

SparkyPotato · 2022-11-16T11:00:51Z

Only user defined structs should have a crate::Type.name (some of the backends now generate extra type aliases for built-in types).

Wouldn't this lead to worse validation errors, where something like Vec { kind: Float, width: 4 } would be used instead of just vec4<f32>?

teoxoy · 2022-11-16T12:02:19Z

Wouldn't this lead to worse validation errors, where something like Vec { kind: Float, width: 4 } would be used instead of just vec4<f32>?

I think for errors we can have a method that gives us the proper name. Also, considering we have other front-ends that don't use the same syntax (i.e. vec4<f32>) for types, we should avoid passing those through IR.

SparkyPotato · 2022-11-16T12:33:16Z

I think for errors we can have a method that gives us the proper name. Also, considering we have other front-ends that don't use the same syntax (i.e. vec4<f32>) for types, we should avoid passing those through IR.

Done!

teoxoy · 2022-11-16T13:09:27Z

Thanks! There seem to be a few more places where we still set the name:

5 in Lowerer.construct
1 in ExpressionContext.resolve_type

SparkyPotato · 2022-11-16T14:25:07Z

@teoxoy I have done a few things:

Added Typifier::get_handle and used that in ExpressionContext. The GLSL frontend has not been edited.
Moved ensure_type_exists to OutputContext, and used that everywhere.
Created a sort of workaround for two-stage borrows to avoid the need to clone TypeInner - this does lead to code duplication, so let me know if you can think of a better solution.

teoxoy · 2022-11-16T14:41:34Z

Added Typifier::get_handle and used that in ExpressionContext. The GLSL frontend has not been edited.

Feel free to change the GLSL frontend as well.

Moved ensure_type_exists to OutputContext, and used that everywhere.

We can make use of ensure_type_exists in a few more places:

2 in ExpressionContext.create_zero_value_constant
1 in Lowerer.construct

Created a sort of workaround for two-stage borrows to avoid the need to clone TypeInner - this does lead to code duplication, so let me know if you can think of a better solution.

Can't think of doing it any other way right now since we match on those TypeInners but will keep it in mind.

Thanks for the updates!

src/front/wgsl/mod.rs

teoxoy · 2022-11-16T14:55:42Z

The only difference in output is that I don't initialize LocalVariables with constants since I'm not too sure if that added complexity is actually worth it. I can add it back if it's something wanted, though.

from #2075 (comment)

I also noticed this by looking at the test snapshots. I think it should be fine for now considering we'll have to deal with const expressions differently soon.

SparkyPotato · 2022-11-16T15:11:57Z

All changes done.

src/front/mod.rs

jimblandy · 2023-01-10T00:13:40Z

Hmm. This drops the name from the type:

type T = i32;
const y: T = 1;

SparkyPotato · 2023-01-10T05:17:11Z

Hmm. This drops the name from the type:
type T = i32;
const y: T = 1;

Yeah, we discard type aliases as soon as they are resolved. Using type alias names when possible is quite hard, even rustc doesn't do it.

Solutions to fixing that include:

Duplicating the type with a different name. This would require TypeInner: Clone, however.
Creating a TypeInner::Alias variant. This would work but I don't think type aliases should be a part of the IR.

jimblandy · 2023-01-11T21:22:14Z

@teoxoy Could you give [edited] 48122564 a review?

teoxoy · 2023-01-12T14:06:13Z

LGTM, I think this should be ready to merge now.

jimblandy

All right, let's go.

Fixes gfx-rs#1745: Support out-of-order module scope declarations in WGSL Fixes gfx-rs#1044: Forbid local variable shadowing in WGSL Fixes gfx-rs#2076: [wgsl-in] no error for duplicated type definition Fixes gfx-rs#2071: Global item does not support 'const' Fixes gfx-rs#2105: [wgsl-in] Type aliases for a vecN<T> doesn't work when constructing vec from a single argument Fixes gfx-rs#1775: Referencing a function without a return type yields an unknown identifier error. Fixes gfx-rs#2089: Error span reported on the declaration of a variable instead of its use Fixes gfx-rs#1996: [wgsl-in] Confusing error: "expected unsigned/signed integer literal, found '1'" Separate parsing from lowering by generating an AST, which desugars as much as possible down to something like Naga IR. The AST is then used to resolve identifiers while lowering to Naga IR. Co-authored-by: Teodor Tanasoaia <[email protected]> Co-authored-by: Jim Blandy <[email protected]>

vicary · 2023-01-13T05:56:38Z

Thanks for the hard work guys!

When would be the next release date with this included? I really want to see this land in wgpu and Deno!

jimblandy · 2023-01-13T07:05:33Z

@cwfitzgerald is planning the next release. Connor?

cwfitzgerald · 2023-01-13T08:04:27Z

25th of this month!

vicary · 2023-01-13T10:31:37Z

@cwfitzgerald That's exciting, thanks!

Make changes suggested in #2075, but put off to a separate PR because they would interfere with reviewing the change: - Split the new WGSL front end into modules in a logical way. - Rename `Parser` to `Frontend`.

haoyunfeix mentioned this pull request Oct 9, 2022

[webgpu] Update shader to support non module-level scoping function tensorflow/tfjs#6918

Merged

vicary mentioned this pull request Oct 13, 2022

Global item does not support 'const' #2071

Closed

SparkyPotato changed the title ~~Implement module-level scoping~~ [wgsl-in] Implement module-level scoping Nov 5, 2022

SparkyPotato marked this pull request as ready for review November 5, 2022 09:21

teoxoy requested changes Nov 15, 2022

View reviewed changes

benches/criterion.rs Outdated Show resolved Hide resolved

tests/in/lexical-scopes.wgsl Outdated Show resolved Hide resolved

tests/wgsl-errors.rs Show resolved Hide resolved

src/front/wgsl/construction.rs Outdated Show resolved Hide resolved

src/lib.rs Outdated Show resolved Hide resolved

teoxoy reviewed Nov 16, 2022

View reviewed changes

src/front/wgsl/mod.rs Outdated Show resolved Hide resolved

src/front/wgsl/mod.rs Outdated Show resolved Hide resolved

jimblandy mentioned this pull request Dec 27, 2022

[hlsl-out] clear named_expressions inserted by duplicated blocks #2116

Merged

jimblandy reviewed Dec 27, 2022

View reviewed changes

src/front/mod.rs Outdated Show resolved Hide resolved

jimblandy requested a review from teoxoy January 11, 2023 21:20

jimblandy force-pushed the module-scope branch 2 times, most recently from 56e1acd to e1a627f Compare January 12, 2023 00:22

teoxoy approved these changes Jan 12, 2023

View reviewed changes

teoxoy requested a review from jimblandy January 12, 2023 14:06

jimblandy approved these changes Jan 12, 2023

View reviewed changes

jimblandy force-pushed the module-scope branch from 5d1650e to 1976586 Compare January 12, 2023 17:33

jimblandy enabled auto-merge (rebase) January 12, 2023 17:34

jimblandy merged commit 6035b07 into gfx-rs:master Jan 12, 2023

SparkyPotato mentioned this pull request Jan 14, 2023

[wgsl-in] Split into multiple files #2207

Merged

teoxoy mentioned this pull request Jan 23, 2023

Update test snapshots #2219

Merged

This was referenced Jan 29, 2023

LayersModel#predict() results in all zeros when using WebGPU backend in Deno tensorflow/tfjs#6842

Closed

Update wgpu for TersorflowJS WebGPU support denoland/deno#17580

Closed

SparkyPotato deleted the module-scope branch February 1, 2023 17:40

jimblandy mentioned this pull request Feb 17, 2023

Workgroup uniform load #2201

Merged

teoxoy mentioned this pull request Apr 4, 2023

LocalVariable::init is barely used gfx-rs/wgpu#4488

Open

teoxoy mentioned this pull request Oct 9, 2023

Implement const-expressions (phase 2) #2309

Merged

SparkyPotato mentioned this pull request Dec 5, 2022

Allow shadowing of predeclared types and built-in functions gfx-rs/wgpu#4406

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[wgsl-in] Implement module-level scoping #2075

[wgsl-in] Implement module-level scoping #2075

SparkyPotato commented Oct 4, 2022 •

edited by teoxoy

Loading

SparkyPotato commented Oct 4, 2022

jimblandy commented Oct 4, 2022

jimblandy commented Oct 4, 2022

SparkyPotato commented Oct 5, 2022

jimblandy commented Oct 5, 2022 •

edited

Loading

jimblandy commented Oct 5, 2022

SparkyPotato commented Oct 5, 2022 •

edited

Loading

jimblandy commented Oct 5, 2022

SparkyPotato commented Oct 22, 2022

i509VCB commented Oct 28, 2022

SparkyPotato commented Oct 28, 2022

SparkyPotato commented Nov 5, 2022

teoxoy commented Nov 14, 2022

SparkyPotato commented Nov 14, 2022 •

edited

Loading

teoxoy commented Nov 14, 2022

SparkyPotato commented Nov 15, 2022

teoxoy left a comment •

edited

Loading

SparkyPotato commented Nov 16, 2022

teoxoy commented Nov 16, 2022

SparkyPotato commented Nov 16, 2022

teoxoy commented Nov 16, 2022

SparkyPotato commented Nov 16, 2022 •

edited

Loading

teoxoy commented Nov 16, 2022

teoxoy commented Nov 16, 2022

SparkyPotato commented Nov 16, 2022

jimblandy commented Jan 10, 2023 •

edited

Loading

SparkyPotato commented Jan 10, 2023

jimblandy commented Jan 11, 2023 •

edited

Loading

teoxoy commented Jan 12, 2023

jimblandy left a comment

vicary commented Jan 13, 2023

jimblandy commented Jan 13, 2023

cwfitzgerald commented Jan 13, 2023

vicary commented Jan 13, 2023

[wgsl-in] Implement module-level scoping #2075

[wgsl-in] Implement module-level scoping #2075

Conversation

SparkyPotato commented Oct 4, 2022 • edited by teoxoy Loading

SparkyPotato commented Oct 4, 2022

jimblandy commented Oct 4, 2022

jimblandy commented Oct 4, 2022

SparkyPotato commented Oct 5, 2022

jimblandy commented Oct 5, 2022 • edited Loading

jimblandy commented Oct 5, 2022

SparkyPotato commented Oct 5, 2022 • edited Loading

jimblandy commented Oct 5, 2022

SparkyPotato commented Oct 22, 2022

i509VCB commented Oct 28, 2022

SparkyPotato commented Oct 28, 2022

SparkyPotato commented Nov 5, 2022

teoxoy commented Nov 14, 2022

SparkyPotato commented Nov 14, 2022 • edited Loading

teoxoy commented Nov 14, 2022

SparkyPotato commented Nov 15, 2022

teoxoy left a comment • edited Loading

Choose a reason for hiding this comment

SparkyPotato commented Nov 16, 2022

teoxoy commented Nov 16, 2022

SparkyPotato commented Nov 16, 2022

teoxoy commented Nov 16, 2022

SparkyPotato commented Nov 16, 2022 • edited Loading

teoxoy commented Nov 16, 2022

teoxoy commented Nov 16, 2022

SparkyPotato commented Nov 16, 2022

jimblandy commented Jan 10, 2023 • edited Loading

SparkyPotato commented Jan 10, 2023

jimblandy commented Jan 11, 2023 • edited Loading

teoxoy commented Jan 12, 2023

jimblandy left a comment

Choose a reason for hiding this comment

vicary commented Jan 13, 2023

jimblandy commented Jan 13, 2023

cwfitzgerald commented Jan 13, 2023

vicary commented Jan 13, 2023

SparkyPotato commented Oct 4, 2022 •

edited by teoxoy

Loading

jimblandy commented Oct 5, 2022 •

edited

Loading

SparkyPotato commented Oct 5, 2022 •

edited

Loading

SparkyPotato commented Nov 14, 2022 •

edited

Loading

teoxoy left a comment •

edited

Loading

SparkyPotato commented Nov 16, 2022 •

edited

Loading

jimblandy commented Jan 10, 2023 •

edited

Loading

jimblandy commented Jan 11, 2023 •

edited

Loading