WIP/RFC unbox more immutables #18632

carnaval · 2016-09-22T15:38:11Z

The codegen/gc part of this is basically working.
I'm now wondering about semantics and I'd like us to discuss the following issues a bit before I clean up the code and we start the review (there is a bunch of duplicate paths in codegen that can be merged together/simplified and some things are plain wrong and/or inefficient).

This patch allows us to unbox most immutables. By unbox I mean : allocate/store them on the stack, inline them in other objects and inline them in arrays.
Why most ? There are (for now) two problems : cycles and #undef.

Cycles are a fundamental problem, if A has a field of type B and B of type A, we obviously can't inline them into each other. The cycle needs to be broken, the annoying part is that it should be done in a predictable way. For now, on this PR, it's done in DFS order which means that for example the layout of B will differ if we ever instantiated an A before. Not good. Proposal I remember about that (Jameson @ juliacon iirc) was to make types boxed iff they are part of any field cycle.
#undef is annoying because it makes a difference at the julia level between isbits types and other immutables. To minimize breakage I've gone the route of preserving the current behavior.

So if A has a pointer field and we make, e.g., an uninitialized array of A, this branch uses the nullity of the field of A as a marker that the corresponding slot in the array is #undef. This only works if the field of a valid instance of A can never be null, i.e., if A.ninitialized >= field_index_of_the_ptr_field.

This makes most code (at least all the test suite :-)) work but I think the following rules are really weird :

A type T will be inlined into fields/arrays and stack allocated if

it is immutable
it is not possible to reach itself through a sequence of field access
it has at least one never-#undef pointer field or no pointer fields at all

The only difference between a type that is boxed or not is memory layout, but I'd assume that we want that to be easily predictable since for example people routinely interface with C.

A proposed alternative by Yichao was to make it entierly opt-in and error out if inlining was not possible. I'm worried this will lead to yet-another-annotation that people will sprinkle everywhere.

For performance, specially crafted tests (like summing lines of a very skinny matrix using subarrays) show some improvements by avoiding gc allocation. Not super satisfying for now and casual inspection of generated asm shows a lot of stack movement. We can work on that though, probably by improving llvm's visibility of our rooting mechanism and/or just using statepoints.

(to sweeten the deal I've thrown in improved undef ref errors)

JeffBezanson · 2016-09-22T15:55:02Z

Awesome!!

Probably a predictable request: is it possible to have the part that's just an optimization (stack allocation) first, without changing the layout of anything? That could probably be merged very quickly.

How does stack marking work in the gc?

carnaval · 2016-09-22T16:01:12Z

Yeah I'm just afraid that it'll make things worse since everytime the immutable will go in/out of local scope it'll have to be unboxed/boxed so we may end up making more boxes than today.

The gc objects on the stack have a pointer to them in the gc frame and the special treatment in gc is done by checking if they are inside the task stack's bounds.
With statepoints we could avoid that altogether and put the alloca's sp-offset directly in the stackmap, as well as the constant tag value, approaching zero-runtime cost.

nalimilan · 2016-09-22T21:26:39Z

Am I right that this will dramatically improve the performance of Nullable? :-)

vchuravy · 2016-09-22T21:37:22Z

src/cgutils.cpp

            // VecElement types are unwrapped in LLVM.
            addr = strct.V;
-        else
+            assert(0);


That assert should probably go away?

yuyichao · 2016-09-22T23:14:51Z

My argument for opt-in is also (local) predictability.

The question is that for given types T1 and T2, what information do we need to determine if Tuple{T1,T2} will be stored inline. I think we should make this only rely on the properties of Tuple, T1 and T2 but not their interactions. This is not the case if we treat any types in the cycle as non-inlineable. Since T2 might have a field Tuple{Int,T2}, in which case Tuple{Int,Float64} is inlined, Tuple{Float64,T2} is inlined, but Tuple{Int,T2} will not be. If we make this opt-in and mark Tuple as a inlined type, then this will be predictable with local informations (in fact, it only relies on Tuple). There will be an error if T2 is marked as inlined too during type construction time.

quinnj · 2016-11-17T18:00:24Z

Healthy bump for 0.6 timeline; I think we'd all love to see this get in.

vchuravy · 2016-12-24T00:46:25Z

I would have liked to see this in 0.6 since I think this is a very valuable optimisation especially for RefValue and it is necessary on the GPU for some core features, but with feature-freeze for v0.6 in about a week this doesn't look likely.

It would be awesome to have this early on for the next release cycle!

yuyichao · 2016-12-24T01:00:16Z

This is actually independent of stack allocation of unescaped RefValue

vtjnash · 2016-12-24T01:00:35Z

I would have liked to see this in 0.6 since I think this is a very valuable optimisation especially for RefValue

Just thought I'd drop by to point out that this PR won't alter RefValue (that optimization is, most nearly, d189cb3). They are relatively orthogonal optimizations.

vchuravy · 2016-12-24T01:04:19Z

Yeah sorry mixed the two things up in my head, since this includes a version of d189cb3 with stack_new

StefanKarpinski · 2017-07-20T20:10:59Z

This is an optimization and therefore not release blocking. Realistically, this PR is not going to be merged in this form, so we may as well close it and take it off the 1.0 milestone.

davidanthoff · 2017-07-20T20:13:19Z

Is there a high level issue that tracks progress on this general theme? Would be good to have something open that refers to this optimization.

StefanKarpinski · 2017-07-20T22:14:07Z

cc @Keno

vtjnash · 2017-07-20T22:58:17Z

Would be nice to keep this open to make it easier to find, since while it won't be merged, we are likely to take many pieces from it.

StefanKarpinski · 2017-07-21T19:50:12Z

Why don't you make a "high level issue that tracks progress on this general theme" instead and link to this PR from there along with all the other relevant PRs?

Keno · 2020-01-17T19:17:00Z

@vtjnash are you ready to close this issue now that you've implemented many of the pieces?

carnaval added 25 commits September 16, 2016 12:13

progress

49375c5

a

16c6983

:)

7d6d945

heya

57b15f8

oups

cf0f771

apsdjojo

8e0af23

zero

cbfda62

fix tctor

1216c38

fixup

5eb221f

make uniontypes immutable again (fix test/docs) but box them everywhere

5f91538

temporarily disable nullcheck elimination

6a216b9

no sret

28e485c

no sret

4387095

add hack for #undef

5349c7e

preserve #undef sema

e1b516c

fix #undef again

96ec9ac

start of multi wb

2ca4c86

swap to preserve align

4d38d3b

multiwb for arrays

1d3016d

remove memdbg

e5d7745

fix stack bounds for gc

67e635f

fix stack_lo for copy_stacks

f57065c

fix ptr array dump

2f6b05f

improve undefref

78a5e02

stub of stack_new

65647a6

vchuravy reviewed Sep 22, 2016

View reviewed changes

src/cgutils.cpp

// VecElement types are unwrapped in LLVM.

addr = strct.V;

else

assert(0);

Copy link

Member

vchuravy Sep 22, 2016

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That assert should probably go away?

musm mentioned this pull request Sep 27, 2016

benchmark results repository JuliaMath/Libm.jl#34

Closed

ChrisRackauckas mentioned this pull request Nov 7, 2016

ODE/DAE solver interface SciML/Roadmap#5

Closed

4 tasks

quinnj mentioned this pull request Nov 17, 2016

improper types when using new DataFrames JuliaData/Feather.jl#30

Closed

vtjnash mentioned this pull request Dec 22, 2016

deprecate &x syntax in ccall, replace with Ref #6080

Closed

vchuravy added this to the 1.0 milestone Dec 24, 2016

vtjnash mentioned this pull request Jan 23, 2017

Proposal: Defer calculation of field types until type parameters are known #18466

Closed

timholy mentioned this pull request Mar 5, 2017

gsoc: rewrite compiler projects page to reflect current needs JuliaLang/www_old.julialang.org#516

Merged

vtjnash mentioned this pull request Apr 25, 2017

improvements to serializer #21514

Merged

yuyichao mentioned this pull request May 16, 2017

conservative stack scanning? #11714

Closed

yuyichao mentioned this pull request Jun 29, 2017

Want non-allocating array views #14955

Closed

StefanKarpinski closed this Jul 20, 2017

StefanKarpinski removed this from the 1.0 milestone Jul 20, 2017

quinnj mentioned this pull request Sep 15, 2017

RFC: Get rid of #undef and replace it with null in Array{Union{Null, T}} #23721

Closed

chethega mentioned this pull request Nov 22, 2017

colwise ridiculously slow for small columns due to allocating array views JuliaStats/Distances.jl#83

Closed

vtjnash reopened this Dec 12, 2017

vtjnash mentioned this pull request Oct 30, 2019

datatype: reorganize layout calculation code #33724

Merged

vtjnash mentioned this pull request Nov 18, 2019

layout optimization internal changes (support pointers inlining/unboxing into parents/codegen) [disabled] #33886

Merged

vtjnash closed this Jan 17, 2020

DilumAluthge deleted the ob/ptrfree branch March 25, 2021 22:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP/RFC unbox more immutables #18632

WIP/RFC unbox more immutables #18632

carnaval commented Sep 22, 2016 •

edited by nalimilan

Loading

JeffBezanson commented Sep 22, 2016

carnaval commented Sep 22, 2016

nalimilan commented Sep 22, 2016

vchuravy Sep 22, 2016

yuyichao commented Sep 22, 2016

quinnj commented Nov 17, 2016

vchuravy commented Dec 24, 2016

yuyichao commented Dec 24, 2016

vtjnash commented Dec 24, 2016

vchuravy commented Dec 24, 2016

StefanKarpinski commented Jul 20, 2017

davidanthoff commented Jul 20, 2017

StefanKarpinski commented Jul 20, 2017

vtjnash commented Jul 20, 2017

StefanKarpinski commented Jul 21, 2017

Keno commented Jan 17, 2020

WIP/RFC unbox more immutables #18632

WIP/RFC unbox more immutables #18632

Conversation

carnaval commented Sep 22, 2016 • edited by nalimilan Loading

JeffBezanson commented Sep 22, 2016

carnaval commented Sep 22, 2016

nalimilan commented Sep 22, 2016

vchuravy Sep 22, 2016

Choose a reason for hiding this comment

yuyichao commented Sep 22, 2016

quinnj commented Nov 17, 2016

vchuravy commented Dec 24, 2016

yuyichao commented Dec 24, 2016

vtjnash commented Dec 24, 2016

vchuravy commented Dec 24, 2016

StefanKarpinski commented Jul 20, 2017

davidanthoff commented Jul 20, 2017

StefanKarpinski commented Jul 20, 2017

vtjnash commented Jul 20, 2017

StefanKarpinski commented Jul 21, 2017

Keno commented Jan 17, 2020

carnaval commented Sep 22, 2016 •

edited by nalimilan

Loading