internals: better representation for code #31191

vtjnash · 2019-02-27T22:46:11Z

I had thought it might be nice to combine Lambda into MethodInstance, but it turned out to be really awkward since they need to be constructed at different times. We want MethodInstance to be fairly unique on the specialization-tuple (since it's our key for looking up various properties). But we may also need to store several copies of the code properties specialized at various times on assorted dimensions of different inferred properties and world-applicability bounds. That meant we were forced to mutate things at various points in time, and it was really easy for things to become inconsistent. I think it'll be conceptually simpler that each type does a bit less. ~~I even feel much better about describing this in the dev docs (still to do).~~

This'll help solve many of the problems with edges getting
mis-represented and broken by adding an extra level of indirection between
MethodInstance (now really just representing a particular specialization of a
method) and the executable object, now called Lambda (representing some
functional operator that converts some input arguments to some output
values, with whatever metadata is convenient to contain there).
This fixes many of the previous representation problems with back-edges,
since a MethodInstance (like Method) no longer tries to also represent a
computation. That task is now relegated strictly to Lambda.

JeffBezanson · 2019-02-28T00:49:01Z

I think Lambda would benefit from a more specific name, indicating that it's a pretty low-level thing. Maybe NativeFunction, Callable, NativeCode, ... ?

base/compiler/typeinfer.jl

vtjnash · 2019-02-28T18:57:59Z

I think NativeCode could be good. Or CodeInstance? (I think there's some chance in the future that it could merge with CodeInfo.)

vchuravy · 2019-02-28T22:15:40Z

I like CodeInstance

vtjnash · 2019-03-01T17:23:59Z

@nanosoldier runbenchmarks(ALL, vs=":master")

nanosoldier · 2019-03-02T00:11:17Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here. cc @ararslan

vtjnash · 2019-03-04T20:40:28Z

@nanosoldier runbenchmarks(ALL, vs=":master")

nanosoldier · 2019-03-05T03:25:08Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here. cc @ararslan

vtjnash · 2019-03-05T04:01:05Z

Alright, I admit I cheated a little bit to make the nanosoldier results looks pretty nice. Anyways, I'm pretty happy with this chunk of work now. There's still many improvements that can be made (such as correctness of invoke, correctness of codegen, track forward edges, use more caches for performance), but it should be enough to take care of known existing issues such as #29425 #29267 #29498 #28595, and set us up for further refinements in the future.

base/compiler/typeinfer.jl

Keno · 2019-03-12T00:20:20Z

base/errorshow.jl

                    print(iob, " (method too new to be called from this world context.)")
-                elseif ex.world > max_world(method)
-                    print(iob, " (method deleted before this world age.)")


I don't quite understand what changed to make this obsolete.

Methods don't have world-ages bounds. They were never supposed to, and it doesn't make any logical sense to claim to have them.

Then what does the Method.max_world field mean?

Doesn't mean anything—that's why it's not correct to have and is now removed

Does that mean that the world keyword argument to hasmethod should be deprecated (in 2.0)?

No, while Methods don't have any particularly meaningful world age, the lookup algorithm for a method does

src/builtins.c

src/julia.h

Keno · 2019-03-12T00:30:10Z

src/gf.c

 {
    JL_TIMING(INFERENCE);
    if (jl_typeinf_func == NULL)
        return NULL;
+    if (jl_is_method(mi->def.method) && mi->def.method->unspecialized == mi)


If nospecialize is set on all arguments, will we still get a MethodInstance here to infer? It is useful to at least resolve things like loops and literals.

That's a nice feature that this work will allow us to add. Since we don't have it now, I'm not concerned about it right now.

src/gf.c

src/dump.c

Keno · 2019-03-12T00:50:42Z

I see about a 10% performance regression in building the system image. Is that expected or am I doing something wrong?

vtjnash · 2019-03-12T02:02:20Z

Might be about right, as some correct computations are much more expensive than we had been doing. Would be helpful to know if you get some performance marks on what the hotspots are.

vtjnash · 2019-03-12T16:17:28Z

I sometimes see ~10% normal variance (or even just NFC PRs like #31306 seeming to show several percent variance), but does seem possibly consistent. The PR:

Base  ─────────── 26.874227 seconds
Base64  ─────────  3.679461 seconds
CRC32c  ─────────  0.008079 seconds
SHA  ────────────  0.170542 seconds
FileWatching  ───  0.104288 seconds
Unicode  ────────  0.006611 seconds
Mmap  ───────────  0.072765 seconds
Serialization  ──  1.190580 seconds
Libdl  ──────────  0.029516 seconds
Markdown  ───────  1.036088 seconds
LibGit2  ────────  2.653477 seconds
Logging  ────────  0.282062 seconds
Sockets  ────────  1.571644 seconds
Printf  ─────────  0.007632 seconds
Profile  ────────  0.195968 seconds
Dates  ──────────  1.756954 seconds
DelimitedFiles  ─  0.109094 seconds
Random  ─────────  0.713382 seconds
UUIDs  ──────────  0.012662 seconds
Future  ─────────  0.004585 seconds
LinearAlgebra  ──  9.753868 seconds
SparseArrays  ───  3.961595 seconds
SuiteSparse  ────  1.476663 seconds
Distributed  ────  6.395261 seconds
SharedArrays  ───  0.159251 seconds
Pkg  ──────────── 10.984870 seconds
Test  ───────────  0.871862 seconds
REPL  ───────────  0.789499 seconds
Statistics  ─────  0.187705 seconds
Stdlibs total  ── 48.199787 seconds
Sysimage built. Summary:
Total ───────  75.075785 seconds 
Base: ───────  26.874227 seconds 35.7961%
Stdlibs: ────  48.199787 seconds 64.2015%
Generating precompile statements... 904 generated in  83.484133 seconds (overhead  57.593296 seconds)
julia> @time include("compiler/compiler.jl")
 20.449931 seconds (34.89 M allocations: 1.706 GiB, 4.91% gc time)
 18.274587 seconds (22.55 M allocations: 1.088 GiB, 4.04% gc time)
 18.564789 seconds (22.49 M allocations: 1.085 GiB, 4.21% gc time)

master branch point:

Base  ─────────── 24.532865 seconds
Base64  ─────────  3.872494 seconds
CRC32c  ─────────  0.008283 seconds
SHA  ────────────  0.181091 seconds
FileWatching  ───  0.090749 seconds
Unicode  ────────  0.006631 seconds
Mmap  ───────────  0.072882 seconds
Serialization  ──  1.177631 seconds
Libdl  ──────────  0.029778 seconds
Markdown  ───────  2.044509 seconds
LibGit2  ────────  2.682191 seconds
Logging  ────────  0.309563 seconds
Sockets  ────────  1.529000 seconds
Printf  ─────────  0.006040 seconds
Profile  ────────  0.167854 seconds
Dates  ──────────  1.732500 seconds
DelimitedFiles  ─  0.108260 seconds
Random  ─────────  0.650892 seconds
UUIDs  ──────────  0.012112 seconds
Future  ─────────  0.005767 seconds
LinearAlgebra  ──  9.602296 seconds
SparseArrays  ───  3.823795 seconds
SuiteSparse  ────  1.450669 seconds
Distributed  ────  6.428315 seconds
SharedArrays  ───  0.158997 seconds
Pkg  ──────────── 10.749139 seconds
Test  ───────────  0.852206 seconds
REPL  ───────────  0.805774 seconds
Statistics  ─────  0.174339 seconds
Stdlibs total  ── 48.747497 seconds
Sysimage built. Summary:
Total ───────  73.282180 seconds 
Base: ───────  24.532865 seconds 33.4773%
Stdlibs: ────  48.747497 seconds 66.5203%
Generating precompile statements... 957 generated in  81.815108 seconds (overhead  55.129309 seconds)
julia> @time include("compiler/compiler.jl")
 19.697466 seconds (31.17 M allocations: 1.508 GiB, 4.70% gc time)
 19.200277 seconds (25.49 M allocations: 1.232 GiB, 4.44% gc time)
 19.460540 seconds (25.47 M allocations: 1.231 GiB, 4.39% gc time)

src/julia.h

vtjnash · 2019-03-27T21:36:53Z

@JeffBezanson OK to merge? There's additional work I'd like to get started on that builds on this

This'll help solve many of the problems with edges getting mis-represented and broken by adding an extra level of indirection between MethodInstance (now really just representing a particular specialization of a method) and the executable object, now called Lambda (representing some functional operator that converts some input arguments to some output values, with whatever metadata is convenient to contain there). This fixes many of the previous representation problems with back-edges, since a MethodInstance (like Method) no longer tries to also represent a computation. That task is now relegated strictly to Lambda.

adapt Cassette for JuliaLang/julia#31191

timholy · 2022-01-29T10:45:29Z

src/dump.c

+
+    if (internal == 1) {
+        mi->uninferred = jl_deserialize_value(s, &mi->uninferred);
+        jl_gc_wb(mi, mi->uninferred);


To me it looks like the order here was (and remains til this day) reversed from serialization: compare

julia/src/dump.c

Lines 726 to 736 in c3235cd

write_uint8(s->s, internal);

if (!internal) {

// also flag this in the backref table as special

uintptr_t *bp = (uintptr_t*)ptrhash_bp(&backref_table, v);

assert(*bp != (uintptr_t)HT_NOTFOUND);

*bp |= 1;

}

if (internal == 1)

jl_serialize_value(s, (jl_value_t*)mi->uninferred);

jl_serialize_value(s, (jl_value_t*)mi->specTypes);

jl_serialize_value(s, mi->def.value);

with

julia/src/dump.c

Lines 1611 to 1627 in c3235cd

int internal = read_uint8(s->s);

mi->specTypes = (jl_value_t*)jl_deserialize_value(s, (jl_value_t**)&mi->specTypes);

jl_gc_wb(mi, mi->specTypes);

mi->def.value = jl_deserialize_value(s, &mi->def.value);

jl_gc_wb(mi, mi->def.value);

if (!internal) {

assert(loc != NULL && loc != HT_NOTFOUND);

arraylist_push(&flagref_list, loc);

arraylist_push(&flagref_list, (void*)pos);

return (jl_value_t*)mi;

}

if (internal == 1) {

mi->uninferred = jl_deserialize_value(s, &mi->uninferred);

jl_gc_wb(mi, mi->uninferred);

}

.

If this is indeed a bug, I can submit a fix.

vtjnash added needs docs Documentation for this change is required needs nanosoldier run This PR should have benchmarks run on it labels Feb 27, 2019

vtjnash force-pushed the jn/lambda-edges branch from 4e57aa5 to a6635a1 Compare February 28, 2019 06:13

smldis reviewed Feb 28, 2019

View reviewed changes

base/compiler/typeinfer.jl Outdated Show resolved Hide resolved

vtjnash force-pushed the jn/lambda-edges branch from a6635a1 to e85ab94 Compare February 28, 2019 18:23

vtjnash force-pushed the jn/lambda-edges branch from ac352a0 to c678976 Compare March 1, 2019 16:54

vtjnash removed the needs docs Documentation for this change is required label Mar 1, 2019

vtjnash force-pushed the jn/lambda-edges branch from c678976 to 125ca2c Compare March 4, 2019 20:28

vtjnash removed the needs nanosoldier run This PR should have benchmarks run on it label Mar 5, 2019

vtjnash mentioned this pull request Mar 5, 2019

Strange bug(Segmentation fault: 11) solely existed in julia v1.0 #31240

Closed

vtjnash force-pushed the jn/lambda-edges branch from 125ca2c to 0aaec61 Compare March 11, 2019 20:24