Document wf constraints on control flow in cleanup blocks #106612

JakobDegen · 2023-01-09T02:34:26Z

Was recently made aware of this code, which has this potential ICE:

rust/compiler/rustc_codegen_ssa/src/mir/analyze.rs

Lines 308 to 314 in a377893

    
           span_bug!( 
        
               mir.span, 
        
               "funclet {:?} has 2 parents - {:?} and {:?}", 
        
               funclet, 
        
               s, 
        
               succ 
        
           );

Roughly speaking, the code there is attempting to partition the cleanup blocks into funclets that satisfy a "unique successor" property, and the ICE is set off if that's not possible. This PR documents the well-formedness constraints that MIR must satisfy to avoid setting off that ICE.

The constraints documented are slightly stronger than the cases in which the ICE would have been set off in that code. This is necessary though, since whether or not that ICE gets set off can depend on iteration order in some graphs.

This sort of constraint is kind of ugly, but I don't know a better alternative at the moment. It's worth knowing that two important optimizations are still correct:

Removing edges in the cfg: Fewer edges => fewer paths => stronger dominance relations => more contractions, and more contractions can't turn a forest into not-a-forest.
Contracting an edge u -> v when u only has one successor and v only has one predecessor: u already dominated v, so this contraction was going to happen anyway.

There is definitely a MIR opt somewhere that can run afoul of this, but I don't know where it is. @saethlin was able to set it off though, so maybe he'll be able to shed some light on it.

r? @RalfJung I suppose, and cc @tmiasko who might have insight/opinions on this

rustbot · 2023-01-09T02:34:33Z

This PR changes MIR

cc @oli-obk, @RalfJung, @JakobDegen, @davidtwco, @celinval, @vakaras

JakobDegen · 2023-01-09T02:59:42Z

Advice on how to phrase an "I don't know graph theory"-readable description of this constraint much appreciated. It's unfortunately not exactly intuitive, but I also don't know a simpler version that is still within the bounds of Mir we generate and Mir codegen ICEs on

saethlin · 2023-01-09T03:12:26Z

For completeness, the MIR opt that can run afoul of this is #106613 (though it's definitely an optimization opportunity that this optimization opens up that is the direct problem)

tmiasko · 2023-01-09T11:03:53Z

The constraints documented are slightly stronger than the cases in which the ICE would have been set off in that code. This is necessary though, since whether or not that ICE gets set off can depend on iteration order in some graphs.

Can you give an example?

It would be nice to add a test case that fails validation once custom MIR supports cleanup blocks.

Implementations check that each vertex has at most one successor in the resulting graph. What rules out cycles?

JakobDegen · 2023-01-09T11:48:45Z

Can you give an example?

Consider this cfg:

Everything besides a is a cleanup block.

If our reverse postorder is abed we make three funclets and everything is fine. If our reverse postorder is instead adeb, then when processing the b->e edge, we'll attempt to make a new funclet at e, which will cause an ICE when it adds a second successor to d.

Implementations check that each vertex has at most one successor in the resulting graph. What rules out cycles?

Oops. Will fix

tmiasko · 2023-01-09T22:33:40Z

Thanks for explanation, I see now what you mean regarding iteration order.

As far as I understand, with caveat that I am not really familiar with Windows specific parts of exception handling code generation, in the example resulting nesting is invalid, since it doesn't satisfy "the funclet pads’ unwind destinations cannot form a cycle" condition. The existing checks are simply incomplete.

JakobDegen · 2023-01-10T22:02:27Z

Yeah, I don't think these were really intended to be "checks" in the first place, just "ICEing is the most reasonable thing to do in this code path."

JakobDegen · 2023-01-10T22:02:40Z

@rustbot author as well, for the cycle fix

compiler/rustc_const_eval/src/transform/validate.rs

RalfJung · 2023-01-12T17:01:39Z

@tmiasko can you take over the review? I'm a bit lost here...

RalfJung · 2023-01-12T17:03:06Z

compiler/rustc_middle/src/mir/syntax.rs

+///  4. The induced subgraph on cleanup blocks must look roughly like an upside down tree. This is
+///     necessary to ensure that landing pad information can be correctly codegened. More precisely:


What is the "subgraph on cleanup blocks"? Is it the graph where you remove all non-cleanup blocks and only keep edges between cleanup blocks?

What exactly is forbidden here? Is it essentially any kind of merging control flow? As in, every cleanup block must only have one cleanup block predecessor -- is that the condition? The analysis you implemented seems a lot more complicated than that, and I am lost in all the domination talk...

Or does the "upside down" mean that every cleanup block most only have one cleanup block successor? Branching is upside down merging. Though why branching control flow would be an issue is beyond me.

But anyway the usual definition of a (directed) forest is that each node has at most one predecessor and there are no cycles, so that might be a less graph-theoretic way of saying it.

What is the "subgraph on cleanup blocks"? Is it the graph where you remove all non-cleanup blocks and only keep edges between cleanup blocks?

Yes. I'll spell this out. I also realize this should have said "subgraph on reachable cleanup blocks" so will fix that as well.

Or does the "upside down" mean that every cleanup block most only have one cleanup block successor?

This is the idea, but only after contracting dominators. One of the things that makes this constraint hard to understand is that there are no local structures that are disallowed in the graph. Indeed, if the cleanup blocks have just one entrance point, then there are no restrictions at all (since that entrance point will necessarily dominate all other reachable cleanup blocks). I don't think we can actually strengthen the restriction much either, since drop elab actually generates cycles (for dropping arrays) and branches (for drop flags) .

Instead this constraint is about how control flow can merge "between entrance points to cleanup code." One simple example of a graph that is disallowed is the one above, another example is a W shaped graph (all edges pointing down).

Ah, I was hoping I could avoid thinking about dominators. ;) I never delved deeply enough into compilers to get solid intuition for them... In this case, do you compute dominators before

So, an example of something we cannot have is a control flow split where one of the successors is also reachable from another independent cleanup path? But a "V", where two independent cleanup paths merge, is okay?

An alternative definition that doesn't use dominators.

It must be possible to partition reachable cleanup blocks into groups such that:

Each group has a unique entry block. Edges from outside into the group must target the group's entry block.

Edges from within a particular group to outside must target the same block.

Edges in-between groups cannot form a cycle.

There are no restriction on edges within the group.

I never delved deeply enough into compilers to get solid intuition for them... In this case, do you compute dominators before

End of the sentence may have gotten lost there? Otherwise you'll have to expand a bit

So, an example of something we cannot have is a control flow split where one of the successors is also reachable from another independent cleanup path? But a "V", where two independent cleanup paths merge, is okay?

Yes.

Edges from outside into the group must target the group's entry block.

Obviously a nit but note that we have to restrict to edges from reachable blocks

End of the sentence may have gotten lost there? Otherwise you'll have to expand a bit

Sorry... got interrupted and then forgot to finish the sentence.^^

I was about to ask whether dominance is computed before or after restricting the graph to cleanup blocks. The comment should clarify that.

For me personally this restriction is so arbitrary and unmotivated that the examples would also be helpful, but it is very possible that this makes more sense to others. :)

I was about to ask whether dominance is computed before or after restricting the graph to cleanup blocks. The comment should clarify that.

Dominance is traditionally only defined for a control flow graph (ie a directed graph with a "start" vertex), so it doesn't really make sense to compute it after restricting to cleanup blocks, which typically do not form a control flow graph. That being said, there is a fairly natural extension of the definition (in which we consider a set of start vertices, instead of just one). In that case it doesn't matter which we pick. I'll clarify anyway.

For me personally this restriction is so arbitrary and unmotivated that the examples would also be helpful, but it is very possible that this makes more sense to others. :)

Yeah, I don't really understand it either, but I also don't feel like going and reading the Itanium ABI spec to find out...

The constraints are described in https://llvm.org/docs/ExceptionHandling.html#funclet-transitions. Their relation to MIR can only be understood in the context of code generation for MSVC target (other targets don't use funclets).

Oh I thought Itanium did as well. Interesting

compiler/rustc_const_eval/src/transform/validate.rs

tmiasko · 2023-01-14T07:05:33Z

@bors try @rust-timer queue

bors · 2023-01-14T07:05:42Z

⌛ Trying commit 7057f835cfc7bf395bef792e116575864578123a with merge 027f53d68204819904d833b75f4a81a74686fd0a...

bors · 2023-01-14T09:46:17Z

☀️ Try build successful - checks-actions
Build commit: 027f53d68204819904d833b75f4a81a74686fd0a (027f53d68204819904d833b75f4a81a74686fd0a)

RalfJung · 2023-01-14T09:59:16Z

r? @tmiasko

JakobDegen · 2023-01-16T23:03:24Z

So there is no concern about target dependence, and we could use crater to assess how often this is tripped. I suggest we do that at some point, regardless of if this is defaulted on or off.

Maybe regular crater runs under -Zvalidate-mir would be a good idea anyway...

JakobDegen · 2023-01-16T23:12:44Z

@bors try @rust-timer queue

bors · 2023-01-16T23:12:52Z

⌛ Trying commit 4bc963e with merge 8e42a53653740a9bdf11fbf5f1c3d28e4d2e6d01...

tmiasko · 2023-01-16T23:42:13Z

The cleanup_kinds code, where existing "check" is located, is always executed. It might be an interesting experiment to run it only when necessary, especially if this validation were to be enabled by default.

JakobDegen · 2023-01-16T23:57:39Z

The cleanup_kinds code, where existing "check" is located, is always executed. It might be an interesting experiment to run it only when necessary, especially if this validation were to be enabled by default.

From the current appearance of codegen, it seems to be unconditionally used. I don't really understand this code well enough to be sure though

tmiasko · 2023-01-17T00:01:14Z

From the current appearance of codegen, it seems to be unconditionally used.

Yes, but for !base::wants_msvc_seh(), the code in llbb_characteristics is equivalent to:

let needs_landing_pad = !fx.mir[self.bb].is_cleanup && fx.mir[target].is_cleanup;
let is_cleanupret = false;
(needs_landing_pad, is_cleanupret)

JakobDegen · 2023-01-17T00:18:43Z

Ah, ok. Yeah, that seems reasonable, should probably be done separately from this PR though

bors · 2023-01-17T01:53:13Z

☀️ Try build successful - checks-actions
Build commit: 8e42a53653740a9bdf11fbf5f1c3d28e4d2e6d01 (8e42a53653740a9bdf11fbf5f1c3d28e4d2e6d01)

rust-timer · 2023-01-17T04:30:47Z

Finished benchmarking commit (8e42a53653740a9bdf11fbf5f1c3d28e4d2e6d01): comparison URL.

Overall result: ✅ improvements - no action needed

Benchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. While you can manually mark this PR as fit for rollup, we strongly recommend not doing so since this PR may lead to changes in compiler perf.

@bors rollup=never
@rustbot label: -S-waiting-on-perf -perf-regression

Instruction count

This is a highly reliable metric that was used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	-	-	0
Regressions ❌ (secondary)	-	-	0
Improvements ✅ (primary)	-	-	0
Improvements ✅ (secondary)	-0.4%	[-0.5%, -0.4%]	2
All ❌✅ (primary)	-	-	0

Max RSS (memory usage)

Results

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	-	-	0
Regressions ❌ (secondary)	2.9%	[2.9%, 2.9%]	1
Improvements ✅ (primary)	-	-	0
Improvements ✅ (secondary)	-4.1%	[-5.1%, -3.0%]	2
All ❌✅ (primary)	-	-	0

Cycles

Results

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	-	-	0
Regressions ❌ (secondary)	3.8%	[2.1%, 5.3%]	4
Improvements ✅ (primary)	-	-	0
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	-	-	0

tmiasko · 2023-01-17T11:34:28Z

@bors r+

bors · 2023-01-17T11:34:29Z

📌 Commit 4bc963e has been approved by tmiasko

It is now in the queue for this repository.

bors · 2023-01-17T11:34:37Z

⌛ Testing commit 4bc963e with merge f34cc65...

bors · 2023-01-17T14:15:09Z

☀️ Test successful - checks-actions
Approved by: tmiasko
Pushing f34cc65 to master...

rust-timer · 2023-01-17T16:37:50Z

Finished benchmarking commit (f34cc65): comparison URL.

Overall result: no relevant changes - no action needed

@rustbot label: -perf-regression

Instruction count

This benchmark run did not return any relevant results for this metric.

Max RSS (memory usage)

Results

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	0.9%	[0.9%, 0.9%]	1
Regressions ❌ (secondary)	-	-	0
Improvements ✅ (primary)	-	-	0
Improvements ✅ (secondary)	-3.3%	[-4.1%, -2.5%]	2
All ❌✅ (primary)	0.9%	[0.9%, 0.9%]	1

Cycles

This benchmark run did not return any relevant results for this metric.

rustbot assigned RalfJung Jan 9, 2023

rustbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. labels Jan 9, 2023

rustbot added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Jan 10, 2023

tmiasko reviewed Jan 11, 2023

View reviewed changes

RalfJung reviewed Jan 12, 2023

View reviewed changes

JakobDegen force-pushed the cleanup-wf branch from 7d57491 to 54a5c42 Compare January 13, 2023 16:38

tmiasko reviewed Jan 13, 2023

View reviewed changes

compiler/rustc_const_eval/src/transform/validate.rs Outdated Show resolved Hide resolved