Add `#[rustc_no_mir_inline]` for standard library UB checks #121114

Noratrieb · 2024-02-14T21:12:05Z

should help with #121110 and also with #120848

Because the MIR inliner cannot know whether the checks are enabled or not, so inlining is an unnecessary compile time pessimization when debug assertions are disabled. LLVM knows whether they are enabled or not, so it can optimize accordingly without wasting time.

r? @saethlin

rustbot · 2024-02-14T21:12:15Z

Some changes occurred to MIR optimizations

cc @rust-lang/wg-mir-opt

compiler/rustc_mir_transform/src/inline.rs

saethlin · 2024-02-14T22:21:46Z

@bors try @rust-timer queue

bors · 2024-02-14T22:22:56Z

⌛ Trying commit 749d7e2 with merge 90741d3...

Add `#[rustc_no_mir_inline]` for standard library UB checks should help with rust-lang#121110 and also with rust-lang#120848 I am not entirely sure whether this is the correct solution and I haven't validated it, I just quickly threw it together before going to sleep. r? `@saethlin`

bors · 2024-02-14T23:51:08Z

☀️ Try build successful - checks-actions
Build commit: 90741d3 (90741d31a5aa2901d7f92a68a87c1ff5a32b3afa)

rust-timer · 2024-02-15T04:18:19Z

Finished benchmarking commit (90741d3): comparison URL.

Overall result: ❌ regressions - ACTION NEEDED

Benchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. While you can manually mark this PR as fit for rollup, we strongly recommend not doing so since this PR may lead to changes in compiler perf.

Next Steps: If you can justify the regressions found in this try perf run, please indicate this with @rustbot label: +perf-regression-triaged along with sufficient written justification. If you cannot justify the regressions please fix the regressions and do another perf run. If the next run shows neutral or positive results, the label will be automatically removed.

@bors rollup=never
@rustbot label: -S-waiting-on-perf +perf-regression

Instruction count

This is a highly reliable metric that was used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	0.6%	[0.2%, 1.3%]	16
Regressions ❌ (secondary)	2.6%	[0.5%, 5.3%]	8
Improvements ✅ (primary)	-0.3%	[-0.3%, -0.3%]	1
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	0.5%	[-0.3%, 1.3%]	17

Max RSS (memory usage)

Results

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	1.8%	[0.7%, 2.6%]	3
Regressions ❌ (secondary)	-	-	0
Improvements ✅ (primary)	-6.0%	[-18.2%, -1.1%]	4
Improvements ✅ (secondary)	-5.4%	[-5.4%, -5.4%]	1
All ❌✅ (primary)	-2.7%	[-18.2%, 2.6%]	7

Cycles

Results

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	1.4%	[1.3%, 1.6%]	2
Regressions ❌ (secondary)	3.7%	[2.4%, 6.0%]	6
Improvements ✅ (primary)	-	-	0
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	1.4%	[1.3%, 1.6%]	2

Binary size

Results

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	0.6%	[0.0%, 2.2%]	71
Regressions ❌ (secondary)	2.2%	[0.0%, 4.3%]	18
Improvements ✅ (primary)	-0.2%	[-0.3%, -0.0%]	6
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	0.5%	[-0.3%, 2.2%]	77

Bootstrap: 636.575s -> 636.125s (-0.07%)
Artifact size: 306.16 MiB -> 306.10 MiB (-0.02%)

saethlin · 2024-02-15T04:54:13Z

Usually what I do at this point is look at the LLVM IR that's generated for the most-regressed case and see which precondition check dominates.

Generally I suspect slice::from_raw_parts. That function and its _mut version are used all over and the actual check implementation is about twice as big as it needs to be I think because the current implementation handles the case of invalid alignment. So assuming that it's slice::from_raw_parts I have two ideas:

Write the slice::from_raw_parts precondition in a way that dodges the extra code; ptr::is_aligned_to incurs checking from Alignment::new, and ptr.addr() % alignment incurs a check for alignment == 0. Just doing & (align - 1) == 0 is probably appropriate here?
Replace the use of that function in Vec::deref with &* ptr::slice_from_raw_parts. The precondition check in there should be redundant anyway, unless someone manages to smuggle an invalid Vec via transmute.

We should probably do both eventually on their own merits, but it might be useful to perf them separately.

…aethlin Always inline check in `assert_unsafe_precondition` with cfg(debug_assertions) The current complexities in `assert_unsafe_precondition` are delicately balancing several concerns, among them compile times for the cases where there are no debug assertions. This comes at a large runtime cost when the assertions are enabled, making the debug assertion compiler a lot slower, which is very annoying. To avoid this, we always inline the check when building with debug assertions. Numbers (compiling stage1 library after touching core): - master: 80s - just adding `#[inline(always)]` to the `cfg(bootstrap)` `debug_assertions` (equivalent to a bootstrap bump (uhh, i just realized that i was on a slightly outdated master so this bump might have happened already), (rust-lang#121112)): 67s - this: 54s So this seems like a good solution. I think we can still get the same run-time perf improvements for other users too by massaging this code further (see my other PR about adding `#[rustc_no_mir_inline]` rust-lang#121114) but this is a simpler step that solves the imminent problem of "holy shit my rustc is sooo slow". Funny consequence: This now means compiling the standard library with dbeug assertions makes it faster (than without, when using debug assertions downstream)! r? `@saethlin` (or anyone else if someone wants to review this) fixes rust-lang#121110, supposedly

…aethlin Always inline check in `assert_unsafe_precondition` with cfg(debug_assertions) The current complexities in `assert_unsafe_precondition` are delicately balancing several concerns, among them compile times for the cases where there are no debug assertions. This comes at a large runtime cost when the assertions are enabled, making the debug assertion compiler a lot slower, which is very annoying. To avoid this, we always inline the check when building with debug assertions. Numbers (compiling stage1 library after touching core): - master: 80s - just adding `#[inline(always)]` to the `cfg(bootstrap)` `debug_assertions` (equivalent to a bootstrap bump (uhh, i just realized that i was on a slightly outdated master so this bump might have happened already), (rust-lang#121112)): 67s - this: 54s So this seems like a good solution. I think we can still get the same run-time perf improvements for other users too by massaging this code further (see my other PR about adding `#[rustc_no_mir_inline]` rust-lang#121114) but this is a simpler step that solves the imminent problem of "holy shit my rustc is sooo slow". Funny consequence: This now means compiling the standard library with dbeug assertions makes it faster (than without, when using debug assertions downstream)! r? ``@saethlin`` (or anyone else if someone wants to review this) fixes rust-lang#121110, supposedly

…aethlin Always inline check in `assert_unsafe_precondition` with cfg(debug_assertions) The current complexities in `assert_unsafe_precondition` are delicately balancing several concerns, among them compile times for the cases where there are no debug assertions. This comes at a large runtime cost when the assertions are enabled, making the debug assertion compiler a lot slower, which is very annoying. To avoid this, we always inline the check when building with debug assertions. Numbers (compiling stage1 library after touching core): - master: 80s - just adding `#[inline(always)]` to the `cfg(bootstrap)` `debug_assertions` (equivalent to a bootstrap bump (uhh, i just realized that i was on a slightly outdated master so this bump might have happened already), (rust-lang#121112)): 67s - this: 54s So this seems like a good solution. I think we can still get the same run-time perf improvements for other users too by massaging this code further (see my other PR about adding `#[rustc_no_mir_inline]` rust-lang#121114) but this is a simpler step that solves the imminent problem of "holy shit my rustc is sooo slow". Funny consequence: This now means compiling the standard library with dbeug assertions makes it faster (than without, when using debug assertions downstream)! r? ```@saethlin``` (or anyone else if someone wants to review this) fixes rust-lang#121110, supposedly

Rollup merge of rust-lang#121196 - Nilstrieb:the-clever-solution, r=saethlin Always inline check in `assert_unsafe_precondition` with cfg(debug_assertions) The current complexities in `assert_unsafe_precondition` are delicately balancing several concerns, among them compile times for the cases where there are no debug assertions. This comes at a large runtime cost when the assertions are enabled, making the debug assertion compiler a lot slower, which is very annoying. To avoid this, we always inline the check when building with debug assertions. Numbers (compiling stage1 library after touching core): - master: 80s - just adding `#[inline(always)]` to the `cfg(bootstrap)` `debug_assertions` (equivalent to a bootstrap bump (uhh, i just realized that i was on a slightly outdated master so this bump might have happened already), (rust-lang#121112)): 67s - this: 54s So this seems like a good solution. I think we can still get the same run-time perf improvements for other users too by massaging this code further (see my other PR about adding `#[rustc_no_mir_inline]` rust-lang#121114) but this is a simpler step that solves the imminent problem of "holy shit my rustc is sooo slow". Funny consequence: This now means compiling the standard library with dbeug assertions makes it faster (than without, when using debug assertions downstream)! r? ```@saethlin``` (or anyone else if someone wants to review this) fixes rust-lang#121110, supposedly

scottmcm · 2024-02-22T06:58:22Z

I'd love to have #[rustc_no_mir_inline], regardless of whether using it in these specific cases here ends up working.

For example, in

rust/library/core/src/slice/mod.rs

Lines 979 to 985 in c1b478e

    
           // Introducing a function boundary here means that the two halves 
        
           // get `noalias` markers, allowing better optimization as LLVM 
        
           // knows that they're disjoint, unlike in the original slice. 
        
           revswap(front_half, back_half, half_len); 
        
           #[inline] 
        
           fn revswap<T>(a: &mut [T], b: &mut [T], n: usize) {

it's very useful to preserve the function boundary for LLVM to see noalias, thus mir-inlining it is counter-productive, but it's also important that it be #[inline] especially when it's called on arrays.

Noratrieb · 2024-02-22T07:00:42Z

I need to write down why this is a good idea and necessary but to remind myself: the reason is that mir inlining doesn't see through debug_assertions.

Noratrieb · 2024-02-22T18:01:54Z

I think we should merge this despite the regressions and count the regressions as part of this unsafe precondition checking and work on that instead of blocking this PR which is a significant improvement for users.
trying out removing the check from vec deref anyways (though thats probably more significant for runtime than compile time)
@bors try @rust-timer queue

saethlin · 2024-02-24T19:29:50Z

In addition to that, we should recoup some of these regressions by improving our lowering of if false and if true: #120650 #121421

Can you add a mir-opt test that demonstrates that this prevents MIR inlining?

Noratrieb · 2024-02-24T19:57:50Z

I have added a test, and confirmed that the test fails properly when removing the attribute from the callee (never trust a test you didn't see fail etc etc).

saethlin · 2024-02-24T20:09:12Z

@bors r+

bors · 2024-02-24T20:09:15Z

📌 Commit 7e10cc5 has been approved by saethlin

It is now in the queue for this repository.

Noratrieb · 2024-02-24T20:19:03Z

fuck
@bors r-

Co-authored-by: Ben Kimock <[email protected]>

Noratrieb · 2024-02-24T20:20:08Z

@bors r=saethlin

bors · 2024-02-24T20:20:11Z

📌 Commit 81d7069 has been approved by saethlin

It is now in the queue for this repository.

bors · 2024-02-25T03:47:34Z

⌛ Testing commit 81d7069 with merge e9f9594...

bors · 2024-02-25T05:54:32Z

☀️ Test successful - checks-actions
Approved by: saethlin
Pushing e9f9594 to master...

rust-timer · 2024-02-25T07:08:56Z

Finished benchmarking commit (e9f9594): comparison URL.

Overall result: ❌✅ regressions and improvements - ACTION NEEDED

Next Steps: If you can justify the regressions found in this perf run, please indicate this with @rustbot label: +perf-regression-triaged along with sufficient written justification. If you cannot justify the regressions please open an issue or create a new PR that fixes the regressions, add a comment linking to the newly created issue or PR, and then add the perf-regression-triaged label to this PR.

@rustbot label: +perf-regression
cc @rust-lang/wg-compiler-performance

Instruction count

This is a highly reliable metric that was used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	0.5%	[0.2%, 1.2%]	33
Regressions ❌ (secondary)	1.5%	[0.3%, 5.5%]	12
Improvements ✅ (primary)	-0.3%	[-0.3%, -0.3%]	3
Improvements ✅ (secondary)	-0.3%	[-0.3%, -0.2%]	8
All ❌✅ (primary)	0.4%	[-0.3%, 1.2%]	36

Max RSS (memory usage)

Results

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	2.7%	[0.9%, 4.5%]	5
Regressions ❌ (secondary)	1.5%	[1.5%, 1.5%]	1
Improvements ✅ (primary)	-9.5%	[-13.2%, -5.8%]	2
Improvements ✅ (secondary)	-4.5%	[-6.3%, -1.1%]	4
All ❌✅ (primary)	-0.8%	[-13.2%, 4.5%]	7

Cycles

Results

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	1.0%	[1.0%, 1.0%]	1
Regressions ❌ (secondary)	3.6%	[2.5%, 6.2%]	4
Improvements ✅ (primary)	-	-	0
Improvements ✅ (secondary)	-1.1%	[-1.1%, -1.1%]	1
All ❌✅ (primary)	1.0%	[1.0%, 1.0%]	1

Binary size

Results

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	0.7%	[0.1%, 2.3%]	61
Regressions ❌ (secondary)	2.1%	[0.1%, 4.4%]	18
Improvements ✅ (primary)	-0.1%	[-0.2%, -0.0%]	15
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	0.6%	[-0.2%, 2.3%]	76

Bootstrap: 651.95s -> 651.212s (-0.11%)
Artifact size: 311.05 MiB -> 311.01 MiB (-0.01%)

Noratrieb · 2024-02-25T09:32:38Z

That's more regressions than the previous check showed. Still, my previous analysis should apply. #121421 should get some of it back as well

Rustup Let's see if rust-lang/rust#121114 gets perf back to the old level.

apiraino · 2024-02-29T15:30:19Z

fixes #121245

Rustup Let's see if rust-lang#121114 gets perf back to the old level.

rustbot assigned saethlin Feb 14, 2024

rustbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. T-libs Relevant to the library team, which will review and decide on the PR/issue. labels Feb 14, 2024

saethlin reviewed Feb 14, 2024

View reviewed changes

compiler/rustc_mir_transform/src/inline.rs Outdated Show resolved Hide resolved

This comment has been minimized.

Sign in to view

rustbot added the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Feb 14, 2024

This comment has been minimized.

Sign in to view

rustbot added perf-regression Performance regression. and removed S-waiting-on-perf Status: Waiting on a perf run to be completed. labels Feb 15, 2024

saethlin added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Feb 15, 2024

saethlin mentioned this pull request Feb 15, 2024

Totally-not-a-tracking-issue for UB-detecting debug assertions in the standard library #120848

Open

9 tasks

Noratrieb mentioned this pull request Feb 16, 2024

Always inline check in assert_unsafe_precondition with cfg(debug_assertions) #121196

Merged

Noratrieb force-pushed the no-inline! branch from 749d7e2 to 3389c50 Compare February 22, 2024 17:57

This comment has been minimized.

Sign in to view

Noratrieb force-pushed the no-inline! branch from 3389c50 to 7e10cc5 Compare February 24, 2024 19:57

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Feb 24, 2024

This comment has been minimized.

Sign in to view

bors added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. labels Feb 24, 2024

Add #[rustc_no_mir_inline] for standard library UB checks

81d7069

Co-authored-by: Ben Kimock <[email protected]>

Noratrieb force-pushed the no-inline! branch from 7e10cc5 to 81d7069 Compare February 24, 2024 20:19

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. labels Feb 24, 2024

bors added the merged-by-bors This PR was explicitly merged by bors. label Feb 25, 2024

bors merged commit e9f9594 into rust-lang:master Feb 25, 2024
12 checks passed

rustbot added this to the 1.78.0 milestone Feb 25, 2024

RalfJung mentioned this pull request Feb 25, 2024

Rustup rust-lang/miri#3320

Merged

bors added a commit to rust-lang/miri that referenced this pull request Feb 25, 2024

Auto merge of #3320 - RalfJung:rustup, r=RalfJung

4fbef53

Rustup Let's see if rust-lang/rust#121114 gets perf back to the old level.

RalfJung mentioned this pull request Feb 25, 2024

CI: Linux test often takes >1h rust-lang/miri#3299

Closed

Noratrieb deleted the no-inline! branch February 29, 2024 16:18

RalfJung pushed a commit to RalfJung/rust that referenced this pull request Mar 3, 2024

Auto merge of rust-lang#3320 - RalfJung:rustup, r=RalfJung

02ee564

Rustup Let's see if rust-lang#121114 gets perf back to the old level.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `#[rustc_no_mir_inline]` for standard library UB checks #121114

Add `#[rustc_no_mir_inline]` for standard library UB checks #121114

Noratrieb commented Feb 14, 2024 •

edited

Loading

rustbot commented Feb 14, 2024

saethlin commented Feb 14, 2024

This comment has been minimized.

bors commented Feb 14, 2024

bors commented Feb 14, 2024

This comment has been minimized.

rust-timer commented Feb 15, 2024

saethlin commented Feb 15, 2024 •

edited

Loading

scottmcm commented Feb 22, 2024

Noratrieb commented Feb 22, 2024

Noratrieb commented Feb 22, 2024

This comment has been minimized.

saethlin commented Feb 24, 2024

Noratrieb commented Feb 24, 2024

saethlin commented Feb 24, 2024

bors commented Feb 24, 2024

This comment has been minimized.

Noratrieb commented Feb 24, 2024

Noratrieb commented Feb 24, 2024

bors commented Feb 24, 2024

bors commented Feb 25, 2024

bors commented Feb 25, 2024

rust-timer commented Feb 25, 2024

Noratrieb commented Feb 25, 2024

apiraino commented Feb 29, 2024 •

edited

Loading

Add #[rustc_no_mir_inline] for standard library UB checks #121114

Add #[rustc_no_mir_inline] for standard library UB checks #121114

Conversation

Noratrieb commented Feb 14, 2024 • edited Loading

rustbot commented Feb 14, 2024

saethlin commented Feb 14, 2024

This comment has been minimized.

bors commented Feb 14, 2024

bors commented Feb 14, 2024

This comment has been minimized.

rust-timer commented Feb 15, 2024

Overall result: ❌ regressions - ACTION NEEDED

saethlin commented Feb 15, 2024 • edited Loading

scottmcm commented Feb 22, 2024

Noratrieb commented Feb 22, 2024

Noratrieb commented Feb 22, 2024

This comment has been minimized.

saethlin commented Feb 24, 2024

Noratrieb commented Feb 24, 2024

saethlin commented Feb 24, 2024

bors commented Feb 24, 2024

This comment has been minimized.

Noratrieb commented Feb 24, 2024

Noratrieb commented Feb 24, 2024

bors commented Feb 24, 2024

bors commented Feb 25, 2024

bors commented Feb 25, 2024

rust-timer commented Feb 25, 2024

Overall result: ❌✅ regressions and improvements - ACTION NEEDED

Noratrieb commented Feb 25, 2024

apiraino commented Feb 29, 2024 • edited Loading

Add `#[rustc_no_mir_inline]` for standard library UB checks #121114

Add `#[rustc_no_mir_inline]` for standard library UB checks #121114

Noratrieb commented Feb 14, 2024 •

edited

Loading

saethlin commented Feb 15, 2024 •

edited

Loading

apiraino commented Feb 29, 2024 •

edited

Loading