Make UseRefs more memory efficient by Keno · Pull Request #44495 · JuliaLang/julia

Keno · 2022-03-07T05:27:40Z

With #44494, this cuts about 22M allocations (out of 59M) from the
compiler benchmark in #44492. Without #44494, it still reduces
the number of allocations, but not as much. This was originally
optimized in 100666b, but the
behavior of our compiler has changed to allow inling the Tuple{UseRef, Int}
into the outer struct, forcing a reallocation on every iteration.

With #44494, this cuts about 22M allocations (out of 59M) from the compiler benchmark in #44492. Without #44494, it still reduces the number of allocations, but not as much. This was originally optimized in 100666b, but the behavior of our compiler has changed to allow inling the Tuple{UseRef, Int} into the outer struct, forcing a reallocation on every iteration.

vtjnash · 2022-03-07T05:45:37Z

base/compiler/ssair/ir.jl

        return OOB_TOKEN
    end
 end
+@inline getindex(x::UseRef) = _useref_getindex(x.urs.stmt, x.op)


I had hoped our calling convention would have been enough already to make this inlining awkwardness unnecessary. What causes it to be needed?

We want UseRef to be SROA'd, so that UseRefIterator can be SROA'd. Without it UseRefIterator gets allocated.

vtjnash

It does seem slightly odd that that needs to be mutable, since that implies we eventually need to copy the stmt back to the Instruction steam.

Keno · 2022-03-07T06:40:33Z

It does seem slightly odd that that needs to be mutable, since that implies we eventually need to copy the stmt back to the Instruction steam.

Yes, that's how this API works. At the end you need to put urs[] back into the instruction stream (or whatever else you were modifying).

Keno · 2022-03-15T16:29:18Z

Merging this now - the test for zero allocation will be added in #44557.

vtjnash reviewed Mar 7, 2022

View reviewed changes

oscardssmith added the latency Latency label Mar 7, 2022

ianatol mentioned this pull request Mar 15, 2022

Make SROA pass more aggressive + make SSA use counting more accurate #44557

Merged

Keno merged commit e082917 into master Mar 15, 2022

Keno deleted the kf/userefsallocs branch March 15, 2022 16:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Make UseRefs more memory efficient#44495

Make UseRefs more memory efficient#44495
Keno merged 1 commit intomasterfrom
kf/userefsallocs

Keno commented Mar 7, 2022

Uh oh!

vtjnash Mar 7, 2022

Uh oh!

Keno Mar 7, 2022

Uh oh!

vtjnash left a comment

Uh oh!

Keno commented Mar 7, 2022

Uh oh!

Keno commented Mar 15, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

Keno commented Mar 7, 2022

Uh oh!

vtjnash Mar 7, 2022

Choose a reason for hiding this comment

Uh oh!

Keno Mar 7, 2022

Choose a reason for hiding this comment

Uh oh!

vtjnash left a comment

Choose a reason for hiding this comment

Uh oh!

Keno commented Mar 7, 2022

Uh oh!

Keno commented Mar 15, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants