Use const propagating inference in REPL tab completions #36437

Keno · 2020-06-25T21:42:13Z

We can overload getfield now along the lines of:

julia> struct Foo
       end

julia> Base.propertynames(::Foo) = (:a,:b,:c)

julia> Base.getproperty(::Foo, s::Symbol) = s === :a ? 1 : s === :b ? 2 : s === :c

and while we do get tab completion if the value is assigned to a symbol, e.g.

julia> x = Foo()
Foo()

julia> x.<TAB>
a b c

it doesn't work for even slightly more complicated expressions, like:

julia> Foo().<TAB> # No results

julia> arr = Any[Foo(), 1]
julia> arr[1]. # No results

This is probably my main frustration when using the REPL with packages that make use of this features heavily (e.g. PyCall), since I rely on tab completion for discovery of ABIs that I'm not particularly familiar with.

I'm thinking we might be able to solve this by having the REPL completer perform type inference for a thunk of

propertynames(<expr>)

e.g. propertynames(Foo()) for the first example:

1 ─ %1 = Main.Foo()::Core.Compiler.Const(Foo(), false)
│   %2 = Main.propertynames(%1)::Core.Compiler.Const((:a, :b, :c), false)
└──      return %2

The compiler knows what the result will be, so we should be able to use it.

The same strategy could work for the second example, but we'd need a mode where inference is allowed to read values out of global variables (of course those results couldn't be cached, etc.). We may also want to have a way to put a total time bound on inference to avoid blocking the interactive use case too much (or maybe even be able to cancel inference - e.g. if another key is pressed).

The text was updated successfully, but these errors were encountered:

JeffBezanson · 2020-06-26T15:23:27Z

It seems pretty reasonable to me to just evaluate the whole expression before the dot. It would be very unusual to put side effects there.

Keno · 2020-06-26T20:05:21Z

I agree that it's unusual, but I also think it's dangerous. Think of things like

pop!(a).<TAB>

Is that perhaps bad style? yes, but I think it might be common enough that we shouldn't just evaluate it. Also, type inference might able to give answers even when just straight up evaluation can't (e.g. if you know the type and propertynames is always based on the type).

tkf · 2020-06-27T23:49:05Z

Eval-based completion would be useful and very robust but I'm against making it the default behavior, too. It'd be nice to add extra key bind like Ctrl+Shift+TAB to do this.

staticfloat · 2023-03-30T20:43:32Z

I took a stab at this here: #49199

This PR generalizes the idea from #49199 and uses inference to analyze the types of REPL expression. This approach offers several advantages over the current `get_[value|type]`-based implementation: - The need for various special cases is eliminated, as lowering normalizes expressions, and inference handles all language features. - Constant propagation allows us to obtain accurate completions for complex expressions safely (see #36437). Analysis on arbitrary REPL expressions can be done by the following steps: - Lower a given expression - Form a top-level `MethodInstance` from the lowered expression - Run inference on the top-level `MethodInstance` This PR implements `REPLInterpreter`, a custom `AbstractInterpreter` that: - aggressively resolve global bindings to enable reasonable completions for lines like `Mod.a.|` (where `|` is the cursor position) - does not optimize the inferred code, as `REPLInterpreter` is only used to obtain the type or constant information of given expressions In particular, aggressive binding resolution presents challenges for `REPLInterpreter`'s cache validation (since #40399 hasn't been resolved yet). To avoid cache validation issue, `REPLInterpreter` only allows aggressive binding resolution for top-level frame representing REPL input code (`repl_frame`) and for child `getproperty` frames that are constant propagated from the `repl_frame`. This works, since 1.) these frames are never cached, and 2.) their results are only observed by the non-cached `repl_frame` Note that the code cache for `REPLInterpreter` is separated from the native code cache, ensuring that code caches produced by `REPLInterpreter`, where bindings are aggressively resolved and the code is not really optimized, do not affect the native code execution. A hack has also been added to avoid serializing `CodeInstances`s produced by `REPLInterpreter` during precompilation to workaround #48453. closes #36437 replaces #49199

This PR generalizes the idea from #49199 and uses inference to analyze the types of REPL expression. This approach offers several advantages over the current `get_[value|type]`-based implementation: - The need for various special cases is eliminated, as lowering normalizes expressions, and inference handles all language features. - Constant propagation allows us to obtain accurate completions for complex expressions safely (see #36437). Analysis on arbitrary REPL expressions can be done by the following steps: - Lower a given expression - Form a top-level `MethodInstance` from the lowered expression - Run inference on the top-level `MethodInstance` This PR implements `REPLInterpreter`, a custom `AbstractInterpreter` that: - aggressively resolve global bindings to enable reasonable completions for lines like `Mod.a.|` (where `|` is the cursor position) - aggressively concrete evaluates `:inconsistent` calls to provide reasonable completions for cases like `Ref(Some(42))[].|` - does not optimize the inferred code, as `REPLInterpreter` is only used to obtain the type or constant information of given expressions Aggressive binding resolution presents challenges for `REPLInterpreter`'s cache validation (since #40399 hasn't been resolved yet). To avoid cache validation issue, `REPLInterpreter` only allows aggressive binding resolution for top-level frame representing REPL input code (`repl_frame`) and for child `getproperty` frames that are constant propagated from the `repl_frame`. This works, since 1.) these frames are never cached, and 2.) their results are only observed by the non-cached `repl_frame` `REPLInterpreter` also aggressively concrete evaluate `:inconsistent` calls within `repl_frame`, allowing it to get get accurate type information about complex expressions that otherwise can not be constant folded, in a safe way, i.e. it still doesn't evaluate effectful expressions like `pop!(xs)`. Similarly to the aggressive binding resolution, aggressive concrete evaluation doesn't present any cache validation issues because `repl_frame` is never cached. Also note that the code cache for `REPLInterpreter` is separated from the native code cache, ensuring that code caches produced by `REPLInterpreter`, where bindings are aggressively resolved and the code is not really optimized, do not affect the native code execution. A hack has also been added to avoid serializing `CodeInstances`s produced by `REPLInterpreter` during precompilation to workaround #48453. closes #36437 replaces #49199

StefanKarpinski · 2023-04-03T21:56:56Z

My favorite example I can cook up is this:

julia> fs = [()->Foo(), ()->(println("hi!"); Foo())]
2-element Vector{Function}:
 #3 (generic function with 1 method)
 #4 (generic function with 1 method)

julia> fs[1]().

a  b  c
julia> fs[2]().

a  b  c
julia> fs[2]().a
hi!
1

…g#49206) This PR generalizes the idea from JuliaLang#49199 and uses inference to analyze the types of REPL expression. This approach offers several advantages over the current `get_[value|type]`-based implementation: - The need for various special cases is eliminated, as lowering normalizes expressions, and inference handles all language features. - Constant propagation allows us to obtain accurate completions for complex expressions safely (see JuliaLang#36437). Analysis on arbitrary REPL expressions can be done by the following steps: - Lower a given expression - Form a top-level `MethodInstance` from the lowered expression - Run inference on the top-level `MethodInstance` This PR implements `REPLInterpreter`, a custom `AbstractInterpreter` that: - aggressively resolve global bindings to enable reasonable completions for lines like `Mod.a.|` (where `|` is the cursor position) - aggressively concrete evaluates `:inconsistent` calls to provide reasonable completions for cases like `Ref(Some(42))[].|` - does not optimize the inferred code, as `REPLInterpreter` is only used to obtain the type or constant information of given expressions Aggressive binding resolution presents challenges for `REPLInterpreter`'s cache validation (since JuliaLang#40399 hasn't been resolved yet). To avoid cache validation issue, `REPLInterpreter` only allows aggressive binding resolution for top-level frame representing REPL input code (`repl_frame`) and for child `getproperty` frames that are constant propagated from the `repl_frame`. This works, since 1.) these frames are never cached, and 2.) their results are only observed by the non-cached `repl_frame` `REPLInterpreter` also aggressively concrete evaluate `:inconsistent` calls within `repl_frame`, allowing it to get get accurate type information about complex expressions that otherwise can not be constant folded, in a safe way, i.e. it still doesn't evaluate effectful expressions like `pop!(xs)`. Similarly to the aggressive binding resolution, aggressive concrete evaluation doesn't present any cache validation issues because `repl_frame` is never cached. Also note that the code cache for `REPLInterpreter` is separated from the native code cache, ensuring that code caches produced by `REPLInterpreter`, where bindings are aggressively resolved and the code is not really optimized, do not affect the native code execution. A hack has also been added to avoid serializing `CodeInstances`s produced by `REPLInterpreter` during precompilation to workaround JuliaLang#48453. closes JuliaLang#36437 replaces JuliaLang#49199

quinnj mentioned this issue Sep 7, 2021

Nested tab complete quinnj/JSON3.jl#177

Closed

aviatesk mentioned this issue Mar 31, 2023

REPLCompletions: replace get_type by the proper inference #49206

Merged

oscardssmith closed this as completed in #49206 Apr 3, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use const propagating inference in REPL tab completions #36437

Use const propagating inference in REPL tab completions #36437

Keno commented Jun 25, 2020

JeffBezanson commented Jun 26, 2020

Keno commented Jun 26, 2020

tkf commented Jun 27, 2020

staticfloat commented Mar 30, 2023

StefanKarpinski commented Apr 3, 2023

Use const propagating inference in REPL tab completions #36437

Use const propagating inference in REPL tab completions #36437

Comments

Keno commented Jun 25, 2020

JeffBezanson commented Jun 26, 2020

Keno commented Jun 26, 2020

tkf commented Jun 27, 2020

staticfloat commented Mar 30, 2023

StefanKarpinski commented Apr 3, 2023