-
Notifications
You must be signed in to change notification settings - Fork 12.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Make TypeFolder::fold_*
return Result
#91230
Conversation
@bors try @rust-timer queue |
Awaiting bors try build completion. @rustbot label: +S-waiting-on-perf |
⌛ Trying commit 362320b3e16971f16544d13786bd4f6fcfb7465f with merge 045cf79edfbfad98d41a0fefbb824ac4306f2dbd... |
☀️ Try build successful - checks-actions |
Queued 045cf79edfbfad98d41a0fefbb824ac4306f2dbd with parent 862962b, future comparison URL. |
Finished benchmarking commit (045cf79edfbfad98d41a0fefbb824ac4306f2dbd): comparison url. Summary: This change led to large relevant mixed results 🤷 in compiler performance.
If you disagree with this performance assessment, please file an issue in rust-lang/rustc-perf. Benchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. While you can manually mark this PR as fit for rollup, we strongly recommend not doing so since this PR led to changes in compiler perf. Next Steps: If you can justify the regressions found in this try perf run, please indicate this with @bors rollup=never |
Perf is better than last time, but not neutral. It would be nice to look into it a bit, but I also think this is "landable" as-is. |
(I still have to do an actual review) |
Co-authored-by: Alan Egerton <[email protected]>
Co-authored-by: Alan Egerton <[email protected]>
Co-authored-by: Alan Egerton <[email protected]>
Co-authored-by: Alan Egerton <[email protected]>
362320b
to
6db9605
Compare
cachegrind diff for the worst scenario (diesel full check)
I haven't yet made sense of this result, but from a first glance I'm wondering if maybe a cache is no longer being used to the same degree. For example, the early/short-circuiting return in the event of error that is introduced just before this cache insertion: rust/compiler/rustc_trait_selection/src/traits/query/normalize.rs Lines 326 to 327 in 6db9605
That said, I'm not convinced this is the actual source of the regression, so I'll keep digging... unless anything obvious jumps out at a more seasoned mind in the interim? |
Per discussion on Zulip the perf regression here is suspected to be due LLVM's PGO decisions. @rustbot label: +perf-regression-triaged Marking ready for review. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A couple comments, but mostly good.
One thought I had while reading this is it does introduce quite a bit of boilerplate for the few fallible cases. I wonder if there is some way to encode this into the trait to make it nicer? That would be in a followup regardless.
ae98c5e
to
04f1c09
Compare
I pushed 9f714ef, which delegates from In fact, I wonder whether |
I do mean altering the TypeFolder api, yes. But definitely for followup, if at all. AFAIK, rustc_data_structures has no stability requirements. But removing map_id could be done in a followup PR. This LGTM. @bors r+ |
📌 Commit afa6f92 has been approved by |
Thanks for picking this up @eggyal! |
☀️ Test successful - checks-actions |
Finished benchmarking commit (e6d2de9): comparison url. Summary: This change led to large relevant mixed results 🤷 in compiler performance.
If you disagree with this performance assessment, please file an issue in rust-lang/rustc-perf. Next Steps: If you can justify the regressions found in this perf run, please indicate this with @rustbot label: +perf-regression |
…llible-folders, r=jackh726 Reduce boilerplate around infallible folders Further to rust-lang#91230 (comment) r? `@jackh726`
It's curious how the post-merge perf result is so different to the pre-merge result at #91230 (comment) (that was put down to LLVM/PGO optimisations). I don't think anything material changed in this PR between the two runs, so I guess something material was merged in the interim. Nevertheless: cachegrind diff for the worst scenario (deep-vector incr-unchanged opt)
Some material differences there in hashing, perhaps indicating a difference in cache use—possibly around resolving consts? Because I don't think this is due to PGO: |
Error: Label because can only be set by Rust team members Please let |
Implements rust-lang/compiler-team#432.
Initially this is just a rebase of @LeSeulArtichaut's work in #85469 (abandoned; see #85485 (comment)). At that time, it caused a regression in performance that required some further exploration... with this rebased PR bors can hopefully report some perf analysis from which we can investigate further (if the regression is indeed still present).
r? @jackh726 cc @nikomatsakis