Enable mutable noalias for LLVM >= 12 #82834

nikic · 2021-03-06T11:46:22Z

Enable mutable noalias by default on LLVM 12, as previously known miscompiles have been resolved. Now it's time to find the next one ;)

The -Z mutable-noalias option no longer has an explicit default and accepts -Z mutable-noalias=yes and -Z mutable-noalias=no to override the LLVM version based default behavior.
The decision on whether to apply the noalias attribute is moved into rustc_codegen_llvm. rustc_middle only provides us with the necessary information to make the decision.
noalias is not emitted for types that are !Unpin, as a heuristic for self-referential structures (see Enable noalias annotations #54878 and Resolve unsound interaction between noalias and self-referential data (incl. generators, async fn) #63818).

nikic · 2021-03-06T11:46:33Z

@bors try @rust-timer queue

rust-timer · 2021-03-06T11:46:34Z

Awaiting bors try build completion.

@rustbot label: +S-waiting-on-perf

bors · 2021-03-06T11:46:42Z

⌛ Trying commit f8452a5159f665c77df589e44b37fa17fa508df0 with merge 3352d8ec2c800d702ecbbdc68f5502978773c4f3...

bors · 2021-03-06T12:47:06Z

☀️ Try build successful - checks-actions
Build commit: 3352d8ec2c800d702ecbbdc68f5502978773c4f3 (3352d8ec2c800d702ecbbdc68f5502978773c4f3)

rust-timer · 2021-03-06T12:47:07Z

Queued 3352d8ec2c800d702ecbbdc68f5502978773c4f3 with parent 51748a8, future comparison URL.

rust-timer · 2021-03-06T14:21:21Z

Finished benchmarking try commit (3352d8ec2c800d702ecbbdc68f5502978773c4f3): comparison url.

Benchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. Please note that if the perf results are neutral, you should likely undo the rollup=never given below by specifying rollup- to bors.

Importantly, though, if the results of this run are non-neutral do not roll this PR up -- it will mask other regressions or improvements in the roll up.

@bors rollup=never
@rustbot label: +S-waiting-on-review -S-waiting-on-perf

bjorn3 · 2021-03-06T14:31:12Z

When looking at incr-unchanged to exclude any slowdown in LLVM, there are many improvements of up to 1.4%, a few slowdowns of up to 0.6% and a regression of ctfe-stress of 1.8%.

When looking at all benchmarks, there are big losses of up to 3.3%. For the style-servo-opt full run for example LLVM_module_codegen took 28.5% longer.

joshtriplett · 2021-03-08T19:13:47Z

I don't think we should expect this to be compile-time-neutral; LLVM does have to do more work. It'd be helpful to have runtime performance benchmarks to compare these to. Ultimately, we may end up trading compile-time for runtime here.

bjorn3 · 2021-03-08T19:44:43Z

It'd be helpful to have runtime performance benchmarks to compare these to.

I was looking at incr-unchanged for that. Those bail out before LLVM gets invoked.

joshtriplett · 2021-03-08T23:14:14Z

@bjorn3 I meant "runtime performance" as in "performance of the compiled code". I'd expect the primary benefit of noalias to be better code generation, not faster code generation.

panstromek · 2021-03-09T07:30:48Z

@joshtriplett I think that was the idea. If you enable noalias for the compiler, you expect the Rust part of it to get faster, so incr-unchanged can roughly tell you that.

nikic · 2021-03-18T22:30:39Z

I think this is ready for review now, r? @nagisa

I followed @RalfJung's suggestion to not apply noalias to !Unpin, which should avoid issues with self-referential structures in practice, though the more general T-lang problem remains open.

compiler/rustc_middle/src/ty/layout.rs

nagisa

Note that when looking at the performance results you can open the -Zself-profile outputs to see where the regressions end up being. For example: old vs new.

There are two areas of comptime regressions here: one is in LLVM internals (which makes sense as LLVM is doing strictly more work optimizing things now) and the other in generating the IR (which also makes sense as we're generating more IR)

This LGTM to me overall. r=me with or without @RalfJung's comment addressed.

compiler/rustc_codegen_llvm/src/abi.rs

nikic · 2021-03-19T12:59:51Z

It's probably worthwhile to check the perf results again with the full implementation. The previous perf run was with it unconditionally enabled, without the Unpin and version checks.

@bors try @rust-timer queue

rust-timer · 2021-03-19T12:59:52Z

Awaiting bors try build completion.

@rustbot label: +S-waiting-on-perf

bors · 2021-03-19T13:00:06Z

⌛ Trying commit ee2eefeecbc5a8af4c5e81a05bffdad00912f3f8 with merge 989a6de73b924cf690879f45fab0a3889c267b43...

bors · 2021-03-19T13:48:41Z

☀️ Try build successful - checks-actions
Build commit: 989a6de73b924cf690879f45fab0a3889c267b43 (989a6de73b924cf690879f45fab0a3889c267b43)

rust-timer · 2021-03-19T13:48:43Z

Queued 989a6de73b924cf690879f45fab0a3889c267b43 with parent eb95ace, future comparison URL.

bjorn3 · 2021-03-22T12:59:49Z

Yes, noalias is enabled by default after this PR. The perf runs are on a rustc version that is built using a rustc version built using the same sources as this PR: The bootstrap rustc builds a rustc which itself builds a rustc. This last rustc is what is shipped to the user and what is used for perf runs.

felix91gr · 2021-03-22T13:02:03Z

Ah, perfect. Thank you, @bjorn3 💛

the8472 · 2021-03-23T17:05:09Z

I don't think we should expect this to be compile-time-neutral; LLVM does have to do more work. It'd be helpful to have runtime performance benchmarks to compare these to. Ultimately, we may end up trading compile-time for runtime here.

Would tracking the produced binary size in perf.rlo make sense as a cheaper proxy measure instead doing full benchmarks? On the assumption that LLVM optimizing more results in things being optimized away and thus smaller code.

memoryruins · 2021-03-23T17:10:02Z

tracking the produced binary size in perf.rlo

relevant issue rust-lang/rustc-perf#145

felix91gr · 2021-03-23T17:11:35Z

I don't think we should expect this to be compile-time-neutral; LLVM does have to do more work. It'd be helpful to have runtime performance benchmarks to compare these to. Ultimately, we may end up trading compile-time for runtime here.

Would tracking the produced binary size in perf.rlo make sense as a cheaper proxy measure instead doing full benchmarks? On the assumption that LLVM optimizing more results in things being optimized away and thus smaller code.

I'm not 100% sure that that would always be the case. LLVM optimizes by default for runtime performance, and many times that would entail a larger binary size. Inlining and loop unrolling are very common examples of this. Monomorphization as well.

That said, maybe there's something related to the noalias optimizations that makes LLVM reduce the binary size. I think some testing (or asking around, although with how new the feature is, maybe not many know what it does to binary size in practice) is in order :3

the8472 · 2021-03-23T17:21:35Z

I'm not 100% sure that that would always be the case.

correlation < 1 is expected, which is why wrote "proxy measure". I just hope it's still big enough to be useful.

In light of rust-lang/rust#82834, we must ensure that the intrusive linked list pointers never get mutable-noalias optimizations (see also rust-lang/rust#63818). Adding a `PhantomPinned` to the `Links` struct ensures it will always be `!Unpin`, disabling mutable-noalias. Signed-off-by: Eliza Weisman <[email protected]>

Add mutable-noalias to the release notes for 1.54 It was enabled in rust-lang#82834 and disabled in 1.53 by rust-lang#86036, but it was never disabled on (then) nightly, so it still landed in 1.54. This was mentioned on rust-lang#86696 but never made it into the release notes. r? `@XAMPPRocky` cc `@nikic`

We haven't seen any regressions upstream since it was last enabled again in March 2021: rust-lang/rust#82834. This results in a negligible increase in binary size of 24-56 kiB, depending on build configuration. Runtime perf only changed on x64 builds. 46 test cases got faster and 18 test cases got slower, with a couple of significant regressions in FIDL microbenchmarks. Overall the results look positive. Fixed: 76297 Change-Id: Id4a2b643e30e748e8d200f9d88c54ecc0ea02b2c Reviewed-on: https://fuchsia-review.googlesource.com/c/fuchsia/+/674307 Commit-Queue: Tyler Mandry <[email protected]> Reviewed-by: Dan Johnson <[email protected]>

archshift · 2022-08-30T18:19:04Z

Is there any plan to enable emitting LLVM alias.scope / noalias metadata for references not originating in function arguments?

bjorn3 · 2022-08-30T18:29:48Z

I suspect that depends on deciding a final memory model for rust.

archshift · 2022-08-30T19:05:07Z

I was under the impression that there are clear cases where we can guarantee no mutable aliasing around references?

felix91gr · 2022-08-31T06:57:29Z

Is it plausible that you two aren't talking about the same thing?

I'm not entirely sure of what context @archshift is talking about, but I think there are indeed some contexts in which we can guarantee that.

Gui, do you think you could show us an example of what you mean?

I get the feeling that the context you're talking about is more specific than what Bjorn is talking about when they mean that the memory model is needed to know the answer.

RalfJung · 2022-08-31T10:24:13Z

An old closed PR is definitely not the right place for such a discussion though. :) Please take this to Zulip, IRLO, or a new issue.

rustbot added the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Mar 6, 2021

rustbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. and removed S-waiting-on-perf Status: Waiting on a perf run to be completed. labels Mar 6, 2021

nikic force-pushed the mutable-noalias branch 2 times, most recently from ce8fb3b to 73006de Compare March 18, 2021 21:59

nikic changed the title ~~[perf] Try enabling mutable noalias again~~ Enabling mutable noalias for LLVM >= 12 Mar 18, 2021

nikic force-pushed the mutable-noalias branch from 73006de to ee2eefe Compare March 18, 2021 22:19

rust-highfive assigned nagisa Mar 18, 2021

RalfJung reviewed Mar 19, 2021

View reviewed changes

compiler/rustc_middle/src/ty/layout.rs Outdated Show resolved Hide resolved

nagisa reviewed Mar 19, 2021

View reviewed changes

compiler/rustc_codegen_llvm/src/abi.rs Outdated Show resolved Hide resolved

rustbot added the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Mar 19, 2021

the8472 mentioned this pull request Mar 23, 2021

Collect binary size statistics rust-lang/rustc-perf#145

Closed

Darksonn mentioned this pull request Mar 27, 2021

Try to avoid noalias attributes on intrusive linked lists tokio-rs/tokio#3654

Merged

hawkw mentioned this pull request Mar 28, 2021

fix(util): make intrusive list links never Unpin hawkw/mycelium#97

Merged

ericseppanen mentioned this pull request Apr 24, 2021

add Lsn type neondatabase/neon#64

Merged

briansmith mentioned this pull request May 5, 2021

Regression: Miscompilation due to bug in "mutable noalias" logic #84958

Closed

bjorn3 mentioned this pull request Jul 26, 2021

Ref parameter incorrectly decorated with noalias attribute #63787

Closed

This was referenced Aug 25, 2021

Update RELEASES.md for 1.54.0 #86696

Merged

Add mutable-noalias to the release notes for 1.54 #88325

Merged

jyn514 modified the milestones: 1.53.0, 1.54.0 Aug 25, 2021

tatsuya6502 mentioned this pull request Sep 11, 2021

Segmentation faults in moka-cht under heavy workloads on a many-core machine moka-rs/moka#34

Closed

RalfJung mentioned this pull request Jun 22, 2022

do not mark interior mutable shared refs as dereferenceable #98017

Merged

archshift mentioned this pull request Sep 1, 2022

make use of LLVM's scoped noalias metadata #16515

Open

RalfJung mentioned this pull request Nov 5, 2022

Resolve unsound interaction between noalias and self-referential data (incl. generators, async fn) #63818

Open

CeleritasCelery mentioned this pull request May 1, 2023

root! macro CeleritasCelery/rune#3

Open

Enable mutable noalias for LLVM >= 12 #82834

Enable mutable noalias for LLVM >= 12 #82834

Conversation

nikic commented Mar 6, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nikic commented Mar 6, 2021

Uh oh!

rust-timer commented Mar 6, 2021

Uh oh!

bors commented Mar 6, 2021

Uh oh!

bors commented Mar 6, 2021

Uh oh!

rust-timer commented Mar 6, 2021

Uh oh!

rust-timer commented Mar 6, 2021

Uh oh!

bjorn3 commented Mar 6, 2021

Uh oh!

joshtriplett commented Mar 8, 2021

Uh oh!

bjorn3 commented Mar 8, 2021

Uh oh!

joshtriplett commented Mar 8, 2021

Uh oh!

panstromek commented Mar 9, 2021

Uh oh!

nikic commented Mar 18, 2021

Uh oh!

Uh oh!

nagisa left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

nikic commented Mar 19, 2021

Uh oh!

rust-timer commented Mar 19, 2021

Uh oh!

bors commented Mar 19, 2021

Uh oh!

bors commented Mar 19, 2021

Uh oh!

rust-timer commented Mar 19, 2021

Uh oh!

bjorn3 commented Mar 22, 2021

Uh oh!

felix91gr commented Mar 22, 2021

Uh oh!

the8472 commented Mar 23, 2021

Uh oh!

memoryruins commented Mar 23, 2021

Uh oh!

felix91gr commented Mar 23, 2021

Uh oh!

the8472 commented Mar 23, 2021

Uh oh!

archshift commented Aug 30, 2022

Uh oh!

bjorn3 commented Aug 30, 2022

Uh oh!

archshift commented Aug 30, 2022

Uh oh!

felix91gr commented Aug 31, 2022

Uh oh!

RalfJung commented Aug 31, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

16 participants

nikic commented Mar 6, 2021 •

edited

Loading