Add optimizations for dead code removal #1444

bbannier · 2023-05-09T13:41:57Z

This PR adds optimizations related to dead code removal. For that we add detection of whether a function is side effect-free and elimination of dead stores. Marking a store as dead allows us to completely remove the function call then.

For void pure function we also add a pass which simplifies their bodies.

The dead code optimizations around pure functions shouldn't be observable at runtime, but users might be able to see them by e.g., looking at the profiling output.

tests/Baseline/hilti.optimization.dead_store/opt.hlt

rsmmr

This looks good overall. I can't quite judge all the cases we might run into, but I guess that's what the test suite is for: ensuring we don't remove too much. It seems the logic is a bit more pessimistic at some places than it'd need to be, but that can be extended later. In particular, we could extend the pure notion to more expressions: e.g., an access to a global can be safe if it's read-only (but not sure how much that would buy us in the end).

Have you done any performance comparisons on an actual analyzer yet (and/or an analysis on what/how much actually gets removed?)

tests/Baseline/hilti.optimization.dead_store/opt.hlt

hilti/toolchain/src/compiler/optimizer.cc

tests/Baseline/spicy.optimization.default-parser-functions/opt.hlt

hilti/toolchain/src/compiler/optimizer.cc

rsmmr

This looks good overall. I can't quite judge all the cases we might run into, but I guess that's what the test suite is for: ensuring we don't remove too much. It seems the logic is a bit more pessimistic at some places than it'd need to be, but that can be extended later. In particular, we could extend the pure notion to more expressions: e.g., an access to a global can be safe if it's read-only (but not sure how much that would buy us in the end).

Have you done any performance comparisons on an actual analyzer yet (and/or an analysis on what/how much actually gets removed?)

bbannier · 2024-01-23T22:24:53Z

Have you done any performance comparisons on an actual analyzer yet (and/or an analysis on what/how much actually gets removed?)

I benchmarked this with a big internal parser and the optimizations here make no measurable difference. I suspect that this is due to that parser not containing a lot local, dead code. Doing more global dead code removal based on data and control flow analysis might be able to remove more code in that code base. That being said, the changes here should still be able to remove unneeded code if it is actually present.

With that maybe the question whether to merge this or not depends whether the added complexity is too much for us. WDYT?

rsmmr · 2024-01-24T08:35:48Z

I benchmarked this with a big internal parser and the optimizations here make no measurable difference.

Can you tell how much code is actually removed (independent of performance)?

With that maybe the question whether to merge this or not depends whether the added complexity is too much for us. WDYT?

I'm leaning towards not merging at this point if there's no immediate benefit. The optimizer is getting pretty complex and I think it'd be a good time to revisit our approach to doing these things.

bbannier · 2024-01-24T12:39:25Z

Can you tell how much code is actually removed (independent of performance)?

From looking at the generated C++ code it looks like pass removed almost no actual user code in this case. There are a lot (thousands) of removals of code like

__location__("foo.spicy:131:2");
(void());

Such (void()) code comes from the optimizer removing other code (e.g., feature-dependent code which we optimize aggressively). Even though the code got removed we still emit location updates (which thankfully do not seem to have a measurable performance impact).

I also saw a few instances of temporaries we emit being removed

::hilti::rt::stream::SafeConstIterator __parse_lahe;
...
::hilti::rt::stream::SafeConstIterator __parse_lahe_2;

Overall it looks like for this code base this pass has no practical relevance.

I'm leaning towards not merging at this point if there's no immediate benefit.

I'll keep this branch around since should we at some point use control and data flow information for optimizations we will probably still need annotations for pure functions (at least if their bodies are invisible to our optimizer in C++ library code).

rsmmr · 2024-01-24T13:43:48Z

__location__("foo.spicy:131:2");
(void());

We could add a narrow, pattern-based optimization to the C++ code generator to skip these directly. Same for default<void>();; or, even better, the whole chunk of these: ```

(*self).__error = __error;
default<void>();
__error = (*self).__error;

Won't have an impact on performance, but makes the code a bit prettier.

bbannier · 2024-01-24T14:38:52Z

We could add a narrow, pattern-based optimization to the C++ code generator to skip these directly.

I was hoping to build something on top of your AST rewrite to implement true AST node removal instead of just emptying them out like now.

I am not really a fan of doing one-off fixes in the code generator since this seems fragile and also pretty hard (e.g., these roundtripping assignments between self.__error and __error).

bbannier self-assigned this May 9, 2023

bbannier commented May 9, 2023

View reviewed changes

tests/Baseline/hilti.optimization.dead_store/opt.hlt Show resolved Hide resolved

bbannier force-pushed the topic/bbannier/dead_stores branch 3 times, most recently from b93a7b9 to 4f33757 Compare May 10, 2023 13:09

bbannier changed the title ~~Remove dead stores~~ Add optimizations for dead code removal May 11, 2023

bbannier force-pushed the topic/bbannier/dead_stores branch 2 times, most recently from 9b0a754 to 1ab2f25 Compare May 23, 2023 14:09

bbannier marked this pull request as ready for review May 23, 2023 15:13

bbannier requested a review from rsmmr May 23, 2023 15:13

rsmmr reviewed May 24, 2023

View reviewed changes

bbannier force-pushed the topic/bbannier/dead_stores branch 2 times, most recently from a4be6ee to e9ee59e Compare May 25, 2023 11:43

bbannier linked an issue Jun 12, 2023 that may be closed by this pull request

Add optimization removing unused temporaries #1425

Open

bbannier mentioned this pull request Nov 10, 2023

Overhead due to unneeded __error assignments #1592

Closed

bbannier added 6 commits January 22, 2024 12:52

Simplify top-level Makefile.

a3b88c8

Add optimizer pass to remove dead stores.

84556c3

Implement short-circuiting of logical OR and AND in optimizer.

210c11d

Constant-fold some expressions involving variables.

da1bba0

Add optimizer pass removing dead code.

06970d4

Add setter for Function attributes.

39e67fe

bbannier force-pushed the topic/bbannier/dead_stores branch from e9ee59e to 3f7b548 Compare January 22, 2024 13:45

bbannier added 4 commits January 22, 2024 16:03

Add optimizer pass detecting pure functions and if possible elide calls.

68c8fc2

Functions with locals of structs with lifecycle hooks are not pure.

1b3842b

Take parameter kind into account when deciding function purity.

70c3502

Mark runtime functions &pure were possible.

234b156

bbannier force-pushed the topic/bbannier/dead_stores branch from 3f7b548 to 234b156 Compare January 22, 2024 15:03

bbannier requested a review from rsmmr January 23, 2024 22:25

bbannier marked this pull request as draft January 24, 2024 13:25

rsmmr removed their request for review January 29, 2024 09:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add optimizations for dead code removal #1444

Add optimizations for dead code removal #1444

bbannier commented May 9, 2023 •

edited

Loading

rsmmr left a comment

rsmmr left a comment

bbannier commented Jan 23, 2024

rsmmr commented Jan 24, 2024

bbannier commented Jan 24, 2024

rsmmr commented Jan 24, 2024 •

edited

Loading

bbannier commented Jan 24, 2024

Add optimizations for dead code removal #1444

Are you sure you want to change the base?

Add optimizations for dead code removal #1444

Conversation

bbannier commented May 9, 2023 • edited Loading

rsmmr left a comment

Choose a reason for hiding this comment

rsmmr left a comment

Choose a reason for hiding this comment

bbannier commented Jan 23, 2024

rsmmr commented Jan 24, 2024

bbannier commented Jan 24, 2024

rsmmr commented Jan 24, 2024 • edited Loading

bbannier commented Jan 24, 2024

bbannier commented May 9, 2023 •

edited

Loading

rsmmr commented Jan 24, 2024 •

edited

Loading