Refactor type-checking to not inline everything #1557

mitchmindtree · 2022-05-16T03:23:37Z

Currently, type-checking inlines everything into the entry point of the root module. As a result a call to compile_to_ast (or more specifically, TypedParseTree::type_check) leaves us with a single TypedParseTree representing only the root module with its inlined entry point. The TypedParseTrees for individual submodules are dropped during compile_inner_dependency.

Why inline everything (and why during type-check)?

The original motivation behind inlining everything was to simplify the old code generation step which translated the AST directly to ASM. By inlining everything into the entry point during type-checking, the burden of having to think about how to represent function calls in Fuel VM assembly was removed altogether. @sezna pls correct me if I'm wrong or missing something here!

Issues with inlining everything

We are starting to see some bubbling issues arise from the current approach of inlining everything, and in particular inlining everything during type checking.

Limitations of node_at_span implementation in sway-lsp due to inlined AST #1555
Current "inline-everything" approach without context limits debugger support #1527
Sacrifices made in sway-ir implementation Future IR desiderata. #738
Support function calls in codegen #65

We also see some shoe-horning of unrelated compilation steps into type-checking as a result. E.g. currently we pass the dead_code_graph through the entirety of type-checking solely because that's the only time we have access to the fully typed submodules. As a result there are mentions of dead_code_graph everywhere throughout semantic_analysis, when ideally it could be a small, distinct step that follows type-checking.

Potential Solution

Change type-checking to return a more structured, non-inlined representation of the typed program. E.g. I'm imagining something like the following pseudo:

pub struct TypedProgram {
    root: TypedModule,
    // TypedProgramKind - script/predicate/contract/library, indicates main fn, abi entries.
}

pub struct TypedModule {
    items: Vec<TypedItem>,
    submodules: BTreeMap<DepName, TypedModule>,
}

Having something like this would help a lot with the sway-lsp issue and help with the separation of concerns like the graph construction for control flow analysis.

I'm less clear on what the implications would be for sway-ir or what this would mean for keeping function mappings in code generation that could be used by the debugger - @otrho you might have some more insight?

Even if we do end up determining that it's useful to have an inlined AST for code generation, It would be beneficial to separate this into a dedicated AST-inlining step that operates on the structured, non-inlined AST.

The text was updated successfully, but these errors were encountered:

otrho · 2022-05-16T03:57:58Z

The IR would like to organise a program into a tree. Instructions gathered into blocks, gathered into functions, gathered into modules. So having distinct functions and all their meta available to IR would be beneficial structurally and analytically.

To the best of my knowledge, the debugger is mostly interested in attributing spans to instructions, but being able to resolve full paths of symbols to instructions is important to set breakpoints (or inspect memory values). Without the absolute naming of functions this becomes impossible. @Dentosal knows more.

sezna · 2022-05-21T12:25:23Z

For some context: function declarations were originally inlined upon application to assist with a 1:1 codegen. You're given a fully typed node and you just serialize it to opcodes. You are also right in that it allows us to skip implementing a function call stack in the Fuel VM. It has certainly caused a fair share of hurdles, though.

sezna · 2022-05-21T12:27:19Z

I dig your proposed module format for sure. I'm wondering if FunctionApplications could throw away their monomorphized function copies after type checking, or store them separately and denote them as their resolved type version, so both the plugin and the type system get their respective benefits? The inlining stuff does have pretty sweet gains for type resolution and simplicity of IR for our little DSL, so if we can find an API that resolves the issues we have with it, I think it would help us avoid some churn.

emilyaherbert · 2022-06-02T18:09:04Z

This will be solved by #1821.

mitchmindtree added compiler General compiler. Should eventually become more specific as the issue is triaged code quality labels May 16, 2022

otrho added the big this task is hard and will take a while label May 16, 2022

mitchmindtree mentioned this issue May 16, 2022

Limitations of node_at_span implementation in sway-lsp due to inlined AST #1555

Closed

Dentosal mentioned this issue May 16, 2022

Gas usage flamegraph FuelLabs/fuel-debugger#3

Open

emilyaherbert mentioned this issue May 17, 2022

Consider refactoring monomorphizing into a separate step following type_check? #1267

Open

mitchmindtree mentioned this issue May 18, 2022

Parse all modules in a project before beginning the type-checking stage #1578

Closed

emilyaherbert mentioned this issue Jun 2, 2022

Create function engine #1821

Closed

1 task

emilyaherbert mentioned this issue Jun 2, 2022

Restructure how functions are handled in the IR and codegen to eliminate function inlining #1823

Open

1 task

otrho mentioned this issue Oct 25, 2022

Future IR desiderata. #738

Closed

IGI-111 closed this as completed Mar 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor type-checking to not inline everything #1557

Refactor type-checking to not inline everything #1557

mitchmindtree commented May 16, 2022 •

edited

Loading

otrho commented May 16, 2022

sezna commented May 21, 2022 •

edited

Loading

sezna commented May 21, 2022 •

edited

Loading

emilyaherbert commented Jun 2, 2022

Refactor type-checking to not inline everything #1557

Refactor type-checking to not inline everything #1557

Comments

mitchmindtree commented May 16, 2022 • edited Loading

Why inline everything (and why during type-check)?

Issues with inlining everything

Potential Solution

otrho commented May 16, 2022

sezna commented May 21, 2022 • edited Loading

sezna commented May 21, 2022 • edited Loading

emilyaherbert commented Jun 2, 2022

mitchmindtree commented May 16, 2022 •

edited

Loading

sezna commented May 21, 2022 •

edited

Loading

sezna commented May 21, 2022 •

edited

Loading