Add clarifying context to the most confusing pointer APIs #95851

Gankra · 2022-04-09T15:39:49Z

No description provided.

rust-highfive · 2022-04-09T15:39:53Z

(rust-highfive has picked a reviewer for you, use r? to override)

Gankra · 2022-04-09T15:42:12Z

library/core/src/ptr/const_ptr.rs

    ///
-    /// [`offset`]: #method.offset


NB: I dropped the note that it was the "inverse" of offset because this is a genuinely confusing statement.

offset is an asymmetric binary operation, it's like talking about the "inverse" of x^y. There are two answers! (yth root and log_x)

Gankra · 2022-04-09T15:42:59Z

cc @workingjubilee and @oli-obk, hope I'm not butchering the context here

library/core/src/ptr/const_ptr.rs

RalfJung · 2022-04-09T16:41:59Z

library/core/src/ptr/const_ptr.rs

+    /// frustrating if using this pattern made it impossible for an operation to work in
+    /// const contexts. Unfortunately, observing any properties of a pointer's address
+    /// is a very dangerous and problematic operation in const contexts, because it's a huge
+    /// reproducibility issue (which has actual soundness implications with compilation units).


It's also, like, just impossible since we don't know the address at which allocations are placed.

I think "it's bad for at least two simple reasons" is good enough, unless you think it's more important to focus on that detail over the others?

(my concern is that if you get too deep into the details people will get mired in insisting it can work/makes sense/can be adjusted to make sense, and I think "nondeterminism bad" is the simplest way to dismiss that all.)

I find "dangerous and problematic" a bit vague and not as convincing as "it's simply not possible". But, no strong opinion.

In my eyes nondeterminism is kind of a poor argument here, considering that in general this function is nondeterministic at runtime in the sense that your allocation/stack whatever could be aligned to different things from one execution to the next, right? I think many folks would probably also respond to this with "ok, sure, but I'm fine with it".

It also seems to me that if we really wanted to, we probably could do "better" (e.g., if we guaranteed that all linker-level allocations were 8-byte-aligned, and then propagated that information down, creating something like monomorphized constants by the "parent" alignment). It's just actually doing that is very hard and not particularly worthwhile (you'd end up with a lot of duplicate work I'd guess).

Is there a reason we need this paragraph, instead of moving directly to the next one, which makes no real argument for why this is the case, aside from justifying it being OK to not worry about it since "we're not doing const SIMD anyway"?

Sure, we can constrain the non-determinism, and then some more things become possible. But others remain impossible (e.g. in your case, asking about any alignment >8).

The problem is not that allocation non-determinism makes the behavior of this function non-deterministic. The problem is that, as a const fn, this function is executed before allocation non-determinism has been resolved. Like, we'd literally have to predict the future to be able to always implement this as a const fn.

Yeah, that makes sense. I think we could probably tackle all realistic programs in a similar way, but it's a pile of worms and a bunch of complexity. In any case, I'm not strictly opposed to the wording here, but it also feels like it's an unnecessary diversion -- what do you think about just skipping this middle paragraph, and moving directly to the lack of need for it, i.e., const not supporting SIMD etc anyway?

I personally would leave a short note saying that const-eval runs before addresses are picked, so the final alignment if unknoweable, and that's why this operation always "fails" during const.

Is there a reason we need this paragraph, instead of moving directly to the next one, which makes no real argument for why this is the case, aside from justifying it being OK to not worry about it since "we're not doing const SIMD anyway"?

puppy-dog eyes b-b-b-but

In my experience, SIMD programmers don't actually care about Miri's needs that much. It's not like they hate Miri, it just doesn't help them understand why they have to handle this. They want to instead hear something like,

"Yes, the promises are technically very weak, but this function will exercise a good faith effort to work as you would expect it to when your code actually runs on a hard processor. But if your code is executed on Miri's virtual machine, Miri is allowed to do really really weird things, like put all the data of your program in virtual registers instead of truly addressable virtual memory, and then give you pointers into the registers, and then alignment doesn't truly exist? It's... it's weird.

Just write code that is sound and this method will give it approximately the performance characteristics you would want at runtime. But if it is executed during CTFE, then the performance characteristics don't really matter, because the actual runtime cost becomes zero, and no matter how much SIMD acceleration you bring, you can't beat zero time."

Gankra · 2022-04-09T17:20:09Z

Updated the description of offset_from for Ralf's review (good feedback!)

workingjubilee · 2022-08-02T02:32:30Z

library/core/src/ptr/const_ptr.rs

+    /// Thankfully, because the precise SIMD pattern we're interested in *already* has a fallback
+    /// mode where the alignment completely fails, the compiler can just make this operation always
+    /// fail to align the pointer when doing const evaluation, and then everything is
+    /// always deterministic and reproducible (and the const evaluator is an interpreter anyway,
+    /// so you weren't actually going to get amazing SIMD speedups).


More directly: given a slice with a random position in memory, we can't know the head of the slice is SIMD-aligned, and given a slice, we can have any number of values. Thus we must have a head and tail handling routine, because asserting on the size is not good enough (may be off-alignment) and asserting on the alignment is not good enough (may be an uneven number of elements, thus not trivially vectorized without tail-handling), and asserting on both just gives you a panic generator where a useful program should be, because you usually don't really have any way to pre-establish those guarantees in a useful manner based on the type information currently present in the system. And if you actually do, you have much bigger tricks to deploy on your code than this.

Thus, "you better believe you are going to be writing code to handle this correctly, Miri's semantics be damned."

bors · 2023-09-08T16:22:21Z

☔ The latest upstream changes (presumably #115672) made this pull request unmergeable. Please resolve the merge conflicts.

original source: rust-lang#95851

rust-highfive assigned m-ou-se Apr 9, 2022

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Apr 9, 2022

Gankra commented Apr 9, 2022

View reviewed changes

This comment has been minimized.

Sign in to view

Gankra force-pushed the clarify-ptr branch from ab539a4 to 6c179a7 Compare April 9, 2022 15:56

Gankra mentioned this pull request Apr 9, 2022

Add sub_ptr on pointers (the usize version of offset_from) #95837

Merged

RalfJung reviewed Apr 9, 2022

View reviewed changes

library/core/src/ptr/const_ptr.rs Outdated Show resolved Hide resolved

RalfJung reviewed Apr 9, 2022

View reviewed changes

library/core/src/ptr/const_ptr.rs Outdated Show resolved Hide resolved

RalfJung reviewed Apr 9, 2022

View reviewed changes

library/core/src/ptr/const_ptr.rs Outdated Show resolved Hide resolved

RalfJung reviewed Apr 9, 2022

View reviewed changes

Add clarifying context to the most confusing pointer APIs

a161062

Gankra force-pushed the clarify-ptr branch from 6c179a7 to a161062 Compare April 9, 2022 17:19

JohnCSimon added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels May 8, 2022

apiraino added the T-libs Relevant to the library team, which will review and decide on the PR/issue. label May 23, 2022

JohnCSimon added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Jul 3, 2022

Mark-Simulacrum assigned Mark-Simulacrum and unassigned m-ou-se Jul 27, 2022

workingjubilee reviewed Aug 2, 2022

View reviewed changes

Mark-Simulacrum added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Aug 14, 2022

workingjubilee mentioned this pull request Dec 5, 2022

attempt to clarify align_to docs #105245

Merged

camelid added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. labels Feb 11, 2023

RalfJung mentioned this pull request Sep 9, 2023

offset_from: docs improvements #113797

Merged

Gankra closed this Sep 18, 2023

RalfJung added a commit to RalfJung/rust that referenced this pull request Sep 26, 2023

take more clarifying text from Gankra's PR

9b7f9c4

original source: rust-lang#95851

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add clarifying context to the most confusing pointer APIs #95851

Add clarifying context to the most confusing pointer APIs #95851

Gankra commented Apr 9, 2022

rust-highfive commented Apr 9, 2022

Gankra Apr 9, 2022

Gankra commented Apr 9, 2022

This comment has been minimized.

RalfJung Apr 9, 2022

Gankra Apr 9, 2022

Gankra Apr 9, 2022

RalfJung Apr 9, 2022

Mark-Simulacrum Jul 26, 2022

RalfJung Jul 26, 2022

Mark-Simulacrum Jul 28, 2022

RalfJung Aug 1, 2022

workingjubilee Aug 2, 2022

workingjubilee Aug 2, 2022

Gankra commented Apr 9, 2022

workingjubilee Aug 2, 2022 •

edited

Loading

bors commented Sep 8, 2023

Add clarifying context to the most confusing pointer APIs #95851

Add clarifying context to the most confusing pointer APIs #95851

Conversation

Gankra commented Apr 9, 2022

rust-highfive commented Apr 9, 2022

Choose a reason for hiding this comment

Gankra commented Apr 9, 2022

This comment has been minimized.

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Gankra commented Apr 9, 2022

workingjubilee Aug 2, 2022 • edited Loading

Choose a reason for hiding this comment

bors commented Sep 8, 2023

workingjubilee Aug 2, 2022 •

edited

Loading