ideas for reducing compiler latency via the interpreter #33333

timholy · 2019-09-20T00:22:34Z

This is a parallel issue to #33326, focused instead on the interpreter. The basic idea is that one of the best ways to reduce inference time is to not infer at all.

It's worth noting that JuliaInterpreter may be a useful playground for a subset of these ideas.

timholy · 2019-09-20T00:23:55Z

Idea 1: infer less stuff from top level. The heuristic might be that from top level, any call that (1) doesn't call a method already on the stack (i.e., no recursion) and (2) doesn't contain a backwards-pointing :goto or :gotoifnot (i.e., no loop) is unlikely to be performance sensitive and doesn't deserve inference time.

timholy · 2019-09-20T00:28:58Z

Idea 2: split method bodies, using the interpreter for everything except basic blocks that are inside loops. This is a more extreme version of idea 1, isolating the performance-sensitive parts of methods for compilation and interpreting everything else.

timholy · 2019-09-20T00:34:41Z

Idea 3: measure run time performance and compile the hot spots. This is harder than it sounds, because (1) measuring performance costs performance, and (2) you may have to solve the "entry point" problem if you discover that you're in the middle of something that's taking forever and need to switch to compiled mode mid-stream (JuliaDebug/JuliaInterpreter.jl#44).

timholy · 2019-09-20T00:42:35Z

Idea 4: make the interpreter faster. Relying on the interpreter for more stuff inevitably puts pressure on its performance, and right now it's not great. Some thoughts:

introduce local method tables to circumvent much of the cost of method dispatch (already pioneered in JuliaInterpreter)
more efficient :foreigncall handling (already pioneered in JuliaInterpreter, which just compiles these)
an efficiently-interpretable bytecode for Julia's compressed ASTs. A possible sketch can be found in WIP: a new serialization format for optimizing & executing lowered IR JuliaDebug/JuliaInterpreter.jl#309. (There are some things I'd definitely change, but hopefully some of the ideas are good.)
(edit by @JeffBezanson ) The interpreter currently looks up all globals, even constants, in the module's hash table. Removing that overhead is a big piece of low-hanging fruit.

JeffBezanson · 2019-09-20T02:50:36Z

Idea 5: This is another approach to incrementally switching from the interpreter to the compiler.

Currently, if we want to compile a given method, we also need to compile all of the methods it directly calls (i.e. not inlined, and not dynamically dispatched via jl_apply_generic). Instead, we could emit a call to a stub that invokes the interpreter for the called method. If and when we decide to compile that method, we replace the pointer to refer to the compiled version.

Then we could try various strategies for picking methods to compile. One simple one would be to initially only compile methods with loops (backwards branches). For other methods, keep a counter and compile them once they are called a certain number of times or with a certain frequency.

jl_compile_method_internal in gf.c generally decides whether to compile things; for instance it handles the logic for --compile=min.

JeffBezanson added the compiler:latency Compiler latency label Sep 20, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ideas for reducing compiler latency via the interpreter #33333

ideas for reducing compiler latency via the interpreter #33333

timholy commented Sep 20, 2019 •

edited

Loading

timholy commented Sep 20, 2019

timholy commented Sep 20, 2019

timholy commented Sep 20, 2019

timholy commented Sep 20, 2019 •

edited by JeffBezanson

Loading

JeffBezanson commented Sep 20, 2019

ideas for reducing compiler latency via the interpreter #33333

ideas for reducing compiler latency via the interpreter #33333

Comments

timholy commented Sep 20, 2019 • edited Loading

timholy commented Sep 20, 2019

timholy commented Sep 20, 2019

timholy commented Sep 20, 2019

timholy commented Sep 20, 2019 • edited by JeffBezanson Loading

JeffBezanson commented Sep 20, 2019

timholy commented Sep 20, 2019 •

edited

Loading

timholy commented Sep 20, 2019 •

edited by JeffBezanson

Loading