Should we allow StorageLive on a live local? #99160

RalfJung · 2022-07-11T16:39:04Z

Currently, we disallow StorageLive being called on a local that is already live (both in Miri, and in the MIR docs). This was originally motivated by some very vague wording in LLVM making it unclear whether LLVM would behave properly when there are multiple lifetime.start without intermediate lifetime.end.

However, LLVM has in the mean time clarified that such a redundant lifetime.start is fine, and just resets the contents of the storage to be uninitialized. So @JakobDegen is suggesting that we change our StorageLive semantics for the case where the local is already live: this could be be defined behavior, and it will re-allocate the backing local, potentially giving it a new address and resetting its contents to be uninitialized.

I do not have any fundamental objections to this, but I also don't know if this could cause other problems, or maybe even help in other situations. Having such a "reallocation" primitive has previously been suggested by @tmandry ; the proposed semantics would mean we don't need another primitive. But maybe some MIR opts would prefer the stricter interpretation of the storage annotations? #98896 sounds like @vakaras would prefer a stricter interpretation than what we currently implement for StorageDead, so making StorageLive more relaxed might be a problem for them.

So, let's collect all the potential benefits of the current semantics of StorageLive before deciding that we change it. (And also all benefits of the change, of course.)

Cc @rust-lang/wg-unsafe-code-guidelines @rust-lang/wg-mir-opt

The text was updated successfully, but these errors were encountered:

JakobDegen · 2022-07-13T01:36:20Z

To be clear, for now I don't have a real reason for wanting this besides that it seems like unnecessary UB. I don't know of an optimization that is affected by this in either direction - will think about it though.

vakaras · 2022-07-13T08:59:17Z

Doing this would imply that the compiler consumers (for example, verifiers) would have to compute themselves whether StorageLive is allocation or reallocation. Compiler consumers can do that, but then they would not be able to check whether the compiler is doing the right thing (for example, I found #98896 just by trying to verify a simple example).

As a selfish guiding design principle, I personally would prefer for the compiler to be as explicit as possible about what it knows and what it does not know because this helps to ensure that compiler consumers interpret data in the same way as the compiler, which is important for the consumers used for quality assurance in critical fields.

RalfJung · 2022-07-17T17:35:11Z

To be clear, for now I don't have a real reason for wanting this besides that it seems like unnecessary UB. I don't know of an optimization that is affected by this in either direction - will think about it though.

Same here. I just find the current situation unsatisfying -- either we should require proper structure for both StorageLive and StorageDead, or for none of them. Right now we require it for StorageLive but not for StorageDead (that's #98896), and that is just inconsistent.

jyn514 · 2023-04-10T17:34:11Z

@saethlin says "This is a MIR semantics question. We're not sure yet if we want to pin the semantics of Rust on MIR, but t-opsem is a good owner for this issue." I'm going to label MIR semantics issues as T-opsem going forward, feel free to speak up if you think isn't a good fit.

@rustbot label T-opsem

RalfJung · 2024-05-29T19:07:43Z

#119366 is an argument for allowing StorageLive on an already live local.

RalfJung · 2024-06-08T09:55:11Z

@rust-lang/opsem I propose that we allow StorageLive when the local is already live, with the semantics that the local is implicitly freed and then re-allocated (with fresh, uninitialized contents).
Cc @rust-lang/wg-mir-opt

Basically, StorageLive will implicitly first execute a StorageDead if needed. This is similar to how we treat StorageDead, which is a NOP when the local is already dead, which is equivalent to implicitly first executing a StorageLive if needed. We have to treat StorageDead that way due to #98896. mir-opts are also sometimes struggling with maintaining the subtle invariant that StorageLive on an already live local is UB. And in particular given the fact that we do allow return to serve as implicit StorageDead for all locals that are still live, the current rules are causing issues for inlining; that's what #119366 is about.

Some out-of-tree MIR consumers would like to use StorageLive/StorageDead as exact markers for the lifetime of local variables, but at this point that seems better served by a dedicated analysis than by trying to make all mir-opts ensure this non-trivial invariant.

@rfcbot close

rfcbot · 2024-06-08T09:55:13Z

Team member @RalfJung has proposed to close this. The next step is review by the rest of the tagged team members:

No concerns currently listed.

Once a majority of reviewers approve (and at most 2 approvals are outstanding), this will enter its final comment period. If you spot a major issue that hasn't been raised at any point in this process, please speak up!

See this document for info about what commands tagged team members can give me.

digama0 · 2024-06-08T17:08:46Z

@RalfJung What is the impact of this change on inlining? Is it now simply a matter of doing nothing to the StorageLive/StorageDead calls or does the unsoundness issue persist?

RalfJung · 2024-06-08T17:32:48Z

I think doing nothing is sound under this proposal. @tmiasko thought about that more than I did, though.

CAD97 · 2024-06-08T19:53:21Z

I don't fully grasp the extent of the implications on MIR, but this makes sense to me given the context Ralf provided.

@rfcbot reviewed

rfcbot · 2024-06-08T19:53:25Z

🔔 This is now entering its final comment period, as per the review above. 🔔

celinval · 2024-06-09T21:25:04Z

I have a question, in this case, will a StorageLive imply StorageDead of the initial variable? What happens to a raw pointer that holds the address of the previous local?

digama0 · 2024-06-09T22:39:27Z

My understanding is that yes, a StorageLive implies a StorageDead of the initial variable, and this also means that a raw pointer holding the previous address is no longer pointing to a valid allocation. (I think we want the flexibility to take advantage of the re-initialization to be able to change the storage location.)

RalfJung · 2024-06-10T05:26:59Z

Yes, that's what I wrote:

with the semantics that the local is implicitly freed and then re-allocated (with fresh, uninitialized contents).

…stead of ill-formed This matches the proposal at <rust-lang/rust#99160 (comment)>, and drastically simplifies the well-formedness checks.

rfcbot · 2024-06-18T19:56:40Z

The final comment period, with a disposition to close, as per the review above, is now complete.

As the automated representative of the governance process, I would like to thank the author for their work and everyone else who contributed.

…errors StorageLive: refresh storage (instead of UB) when local is already live Blocked on [this FCP](rust-lang#99160 (comment)), which also contains the motivation. Fixes rust-lang#99160 Fixes rust-lang#98896 (by declaring it not-a-bug) Fixes rust-lang#119366 Fixes rust-lang/unsafe-code-guidelines#129

Rollup merge of rust-lang#126154 - RalfJung:storage-live, r=compiler-errors StorageLive: refresh storage (instead of UB) when local is already live Blocked on [this FCP](rust-lang#99160 (comment)), which also contains the motivation. Fixes rust-lang#99160 Fixes rust-lang#98896 (by declaring it not-a-bug) Fixes rust-lang#119366 Fixes rust-lang/unsafe-code-guidelines#129

StorageLive: refresh storage (instead of UB) when local is already live Blocked on [this FCP](rust-lang/rust#99160 (comment)), which also contains the motivation. Fixes rust-lang/rust#99160 Fixes rust-lang/rust#98896 (by declaring it not-a-bug) Fixes rust-lang/rust#119366 Fixes rust-lang/unsafe-code-guidelines#129

This was referenced Jul 11, 2022

Clarify MIR semantics of storage statements #99050

Merged

StorageDead is not dominated by StorageLive #98896

Closed

RalfJung mentioned this issue Sep 18, 2022

Reorder nesting scopes and declare bindings without drop schedule #101410

Merged

RalfJung mentioned this issue Jan 31, 2023

Remove both StorageLive and StorageDead in CopyProp. #107524

Merged

rustbot added the T-opsem Relevant to the opsem team label Apr 10, 2023

jyn514 added A-MIR Area: Mid-level IR (MIR) - https://blog.rust-lang.org/2016/04/19/MIR.html C-discussion Category: Discussion or questions that doesn't represent real issues. labels Apr 10, 2023

RalfJung mentioned this issue Aug 1, 2023

What about: StorageDead/StorageLive? rust-lang/unsafe-code-guidelines#129

Closed

rfcbot added proposed-final-comment-period Proposed to merge/close by relevant subteam, see T-<team> label. Will enter FCP once signed off. disposition-close This PR / issue is in PFCP or FCP with a disposition to close it. labels Jun 8, 2024

RalfJung mentioned this issue Jun 8, 2024

StorageLive: refresh storage (instead of UB) when local is already live #126154

Merged

rfcbot added the final-comment-period In the final comment period and will be merged soon unless new substantive objections are raised. label Jun 8, 2024

rfcbot removed the proposed-final-comment-period Proposed to merge/close by relevant subteam, see T-<team> label. Will enter FCP once signed off. label Jun 8, 2024

rfcbot added finished-final-comment-period The final comment period is finished for this PR / Issue. and removed final-comment-period In the final comment period and will be merged soon unless new substantive objections are raised. labels Jun 18, 2024

rfcbot added the to-announce Announce this issue on triage meeting label Jun 18, 2024

bors closed this as completed in 035285b Jun 19, 2024

apiraino removed the to-announce Announce this issue on triage meeting label Jul 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Should we allow StorageLive on a live local? #99160

Should we allow StorageLive on a live local? #99160

RalfJung commented Jul 11, 2022

JakobDegen commented Jul 13, 2022

vakaras commented Jul 13, 2022

RalfJung commented Jul 17, 2022

jyn514 commented Apr 10, 2023

RalfJung commented May 29, 2024

RalfJung commented Jun 8, 2024 •

edited

Loading

rfcbot commented Jun 8, 2024 •

edited by digama0

Loading

digama0 commented Jun 8, 2024

RalfJung commented Jun 8, 2024

CAD97 commented Jun 8, 2024

rfcbot commented Jun 8, 2024

celinval commented Jun 9, 2024

digama0 commented Jun 9, 2024

RalfJung commented Jun 10, 2024

rfcbot commented Jun 18, 2024

Should we allow StorageLive on a live local? #99160

Should we allow StorageLive on a live local? #99160

Comments

RalfJung commented Jul 11, 2022

JakobDegen commented Jul 13, 2022

vakaras commented Jul 13, 2022

RalfJung commented Jul 17, 2022

jyn514 commented Apr 10, 2023

RalfJung commented May 29, 2024

RalfJung commented Jun 8, 2024 • edited Loading

rfcbot commented Jun 8, 2024 • edited by digama0 Loading

digama0 commented Jun 8, 2024

RalfJung commented Jun 8, 2024

CAD97 commented Jun 8, 2024

rfcbot commented Jun 8, 2024

celinval commented Jun 9, 2024

digama0 commented Jun 9, 2024

RalfJung commented Jun 10, 2024

rfcbot commented Jun 18, 2024

RalfJung commented Jun 8, 2024 •

edited

Loading

rfcbot commented Jun 8, 2024 •

edited by digama0

Loading