Slicing rules for desugaring #41

rolyp · 2020-05-19T11:37:43Z

For explorable-viz/fluid#278 and explorable-viz/fluid#277 we need syntax sugar. Slicing needs to be defined for desugaring.

syntax of surface language (as separate figure), including if..then..else fluid#115
desugaring definition, with auxiliary function to totalise an eliminator
forward slicing rules
backward slicing rules
~~prove that desugar_fwd and desugar_bwd form a Galois connection~~ – see Desugaring Galois connection #52

rolyp · 2020-06-29T13:37:15Z

@min-nguyen Here’s the sketch from earlier that shows the type of the relation, the form of the judgement, an example rule and the Galois connection that we’ll derive from it.

Once you’ve completed the final tidy-up of explorable-viz/fluid#300, we can have a chat about how to get started with formalising desugaring.

rolyp · 2020-07-06T09:16:14Z

@min-nguyen If you’re happy getting started with this over the next couple of days, a concrete suggestion would be to sketch out a definition of the “desugars to” relation (written → above) in the style of Fig. 9 in the paper. Start by giving the syntax of the surface language in the style of Fig 2. (Not necessarily in LaTeX – paper would be fine.) Then define a relation like Fig. 9. You can ignore the greyout out T :: that occurs to the left of the judgement in every premise/conclusion – these correspond to traces in the implementation. For now we needn’t worry about these for desugaring, although we will want them at some point. Ping me if you have questions, otherwise let’s talk when I’m free on Weds.

rolyp · 2020-07-13T15:06:58Z

@min-nguyen See below. I recommend using GitHub branches when working with the formalism – knowing you’re on a branch should allow you to feel more relaxed about making changes! Feel free to ping me about macros or any other LaTeX questions.

rolyp · 2020-07-21T15:06:52Z

@min-nguyen Notes on desugaring rules that want to reuse the surface language:

And on “totalising” an eliminator with a default continuation, needed for the desugaring of partial generators (and which we can also use to add catch-all _ clauses to the surface language):

rolyp · 2020-07-30T11:47:50Z

@min-nguyen We sketched the form of the relation, but I don’t think we discussed any specific rules. Something I mentioned last time: we may not need a trace after all for desugaring. Instead we can probably use the original (unsugared) expression, which contains enough information to run the computation backward. (You can actually take this approach with evaluation as well, at least for a deterministic language, but it becomes prohibitively expensive. IIRC, you end up having to do a lot of evaluation during backwards slicing – in fact more than you would do just executing the program once.)

So, in the upper part of the diagram above, I’ve given a rule from the desugaring relation. Underneath is the corresponding backward slicing rule. The basic idea behind the bwd rules is to push the annotations on “core” language constructors back onto “surface” language constructors. For now you can just have an intuitive stab at this; when we come to define fwd slicing, we’ll need to make sure (and eventually prove) that the bwd propagation of annotations is sound w.r.t. fwd propagation, and vice versa. Don’t worry about this too much for now.

I think it will be useful here (and in the core language) to distinguish expressions which are unannotated all the way down (where the type of the annotations is Unit), from expressions where a bit is associated with every constructor (where the type of the annotations is 𝔹). For example, the parser should build SExpr Unit, which desugars to Expr Unit, which evaluates to Expl Unit and Val Unit. By doing map (const True), we can switch to a slicing setting, where desugar_fwd goes from SExpr 𝔹 to Expr 𝔹, and eval_fwd goes from Expr 𝔹 to Expl 𝔹 and Val 𝔹. That allows us to ignore annotations (since they’re unit) when we don’t care about them.

With that in mind, I’m abusing metavariables a bit here: on the left of the bwd arrow, we should probably think of s as ranging over SExpr Unit, whereas on the right of the bwd arrow, as ranging over SExpr 𝔹.

rolyp · 2020-08-06T09:07:09Z

@min-nguyen Ok, I’ve updated the implementation along the lines of what I proposed yesterday. There is now a type class [Bounded]JoinSemilattice for bona fide join semilattices, with join total and bot a unary constant. There are instances for Boolean and Unit.

The old [Bounded]JoinSemilattice type class is now called [Bounded]Slices and has a partial maybeJoin and botOf: a -> a.

I’ve parameterised the Slices instances for Expr and Val so that they work for annotations of an arbitrary JoinSemilattice. For notational convenience (so that we can write ∨ in backward slicing and not consider the fact that this technically may be undefined), every Slices is also a JoinSemilattice which implements join as fromJust <<< maybeJoin, using a helper called definedJoin.

I’ve also generalised the expr and val helpers so that they construct a term using bot, rather than assuming the annotation to be of type Boolean (which I think is the problem you were facing yesterday). And the Pretty instance is similarly parameterised over an arbitrary BoundedJoinSemilattice (since it uses expr, atlhough only once).

rolyp · 2020-08-06T14:40:42Z

@min-nguyen Nib was loose on my pencil 🤦 .

Thinking some more, having eval work with Expr Unit and eval_fwd work with Expr 𝔹 sounds nice, but might not be easy to do in practice. We will have an environment of type Env Unit (containing primitives and prelude) which we use to evaluate the program, which we will need to be of type Env 𝔹 when we come to forward slice in that environment. Although conceptually this is as simple as applying map (const true), unfortunately map doesn’t really work with environments: an environment is a DAG with a vast amount of redundancy (lots of sharing), and recursing over the DAG effectively unravels it into an enormous tree. Without some kind of memoisation/dynamic programming, it’s not a feasible computation.

So, in the formalism I think we can say the annotation type is (meta-)unit if it is convenient to do so, but in the implementation we will for now assume it’s always Boolean, even if sometimes we don’t care.

rolyp · 2020-08-11T12:57:05Z

@min-nguyen Ok, I’ve had a chance to look over what you’ve done. This is looking good – you’ve really made progress. We’ve still a fair way to go to reach the point we need to get to, but that’s not a reflection on what you’ve done, just the normal process of iterating over something until it converges.

I have lots of minor suggestions, and a couple of other thoughts that we might need to discuss. If you could aim to process these changes by Friday then we can talk about other things.

The list-comp-gen rule is going to take some thought. You seem to have a slightly odd mixture here: an anonymous function (which doesn’t exist in the core language), which you bind to a name f. Generate names during desugaring will require freshness side-conditions, which will make the rules non-deterministic, which will mean (for backward slicing) we
will need a trace after all to record the names that were picked. That might be a complication to avoid if we can, which suggests we need anonymous functions in the core. But then without some further refactoring, we’ll end up with two kinds of closure (recursive and non-recursive). These are low-level details, mostly presentational, but we’ll need to decide how best to rearrange things. For now I’ve suggested dropping the binding to f and assuming anonymous functions exist in the core.

Finally, there’s some thinking to do about annotation-propagation in the slicing rules. The slicing rules we’ve given elsewhere don’t uniformly assume expressions have annotations on them (although the implementation does), but they do propagate annotations on data constructors and constants. We’ll need to do something similar for desugaring. But I think it’s fine to get the structure of the rules right first, and then think about the annotations.

rolyp · 2020-08-26T16:40:26Z

rolyp · 2020-09-08T09:54:51Z

@min-nguyen I’m thinking we should aim for ICFP next year (deadline ~Feb). I’ll email separately about that. First some thoughts on how to move forward with the desugaring formalisation and implementation. These need to be tied together as we move forward and you should be in the driving seat for both, with my guidance.

A couple of high-level points, now that things are a bit clearer. First, I think we should treat eliminators as core language only — very similar to the way Haskell pattern-matching compiles to case trees. We already have some of this in place as we had introduce a pattern syntax for list comprehensions and a way of “totalising” these into eliminators. What I propose we do further is allow functions to be defined “equationally”, i.e. as lists of equations with patterns on the LHS. These equations should be turned into an eliminator by merging their patterns. This operation is only partial; if it fails the equations are ill-formed, e.g. because they try to merge constructors from distinct data types.

The implementation does this already, so this is just a question of formalising this aspect of the implementation. The upside is that we will have a clean separation of concerns between the surface language (which uses more familiar pattern syntax) and core language (which uses eliminators), and a small but useful contribution of the paper will be to show how Galois slicing works in that setting.

The other point concerns list notation (let’s use this term to mean notation like [x, y, …] rather than Nil and Cons. (Minor aside: it’s annoying to have both forms in the same surface language, but it seems we can’t do without; we might be able to make this slightly less painful by renaming Nil to [], and perhaps Cons to infix :, but I won’t suggest that just yet.)

The key observation about list notation is that I think we need to be syntactically explicit about the implied Cons nodes. Currently we give the syntax of lists as either [] or [t · \vec{t}], where \vec is effectively interpreted as a meta-level type constructor for sequences. This doesn’t really allow the surface language to track the annotations on the Cons nodes in the desugared expression. I think we need to model the grammar of list notation more explicitly:

The “list rest” metavariable is the letter ell. I’ve left the typing and evaluation rules for ell for you to do (they’re straightforward). Hopefully this makes some kind of sense; if the rationale for doing things this way isn’t quite clear yet, hopefully it will become so once you start writing down the rules that propagate annotations between the two notations. (Returning to the aside above: if we were to rename Nil to [], we no longer need any extra rules for [].)

I’ll create a new issue for the “equational definitions” desugaring, and create a list of todo’s for list notation and other minor tweaks, this afternoon. (I’m tied up now until about 3pm.)

rolyp · 2020-09-09T09:12:22Z

@min-nguyen Todo’s relating to list notation and other minor comments:

Let’s swap metavariables s and t so that s ranges over surface terms (I think this will be more memorable, but it means updating all the figures – sorry!)
New list notation as described above and associated typing and evaluation rules.
List patterns will also need to support the new list notation, with an analogous grammar.
Eliminators no longer need to be part of the surface language, and so can be removed from Fig. 17. We will need to address the question of partiality but we’ll do that as part of Piecewise definitions #51.
I don’t really understand the definition of eliminator desugaring (Fig. 23). Luckily we don’t need it any more.
Based on your observation that desugaring and forward slicing of desugaring are not very different, I suggest we turn Fig. 22 into a definition of forward slicing, rather than desugaring. This will mean extending each rule so that the conclusion takes a surface term on the LHS rather than a raw surface term and then specifies how annotations are produced on the output term. We won’t give an explicit definition of desugaring itself, but just say (in the text) that it’s the same as the forward slicing rules, but with the annotations disregarded.
Define forward and backward slicing for desugaring “list rest”. This is where you’ll see the point of the explicit list notation, because the annotations will be transferred to and from the desugared form.
Kill match from the surface language until we complete Piecewise definitions #51
The list-comp-true rule could be renamed list-comp-done.
I don’t think the t’ ∈ Bool side-condition on list-comp-guard is necessary (and I don’t really know what it means).
We need to take care to visually distinguish “object language” names like concatMap and range from meta-language names like totalise. For the latter, use the same formatting macro as we use for the val function that occurs in Fig. 15, which is also a meta-level function.
Figs. 26 & 27: If we want to use metavariables to give the signatures of these operations, I would use an equational notation, e.g. totalise ρ κ = σ. (That’s not a particularly common way of giving a function signature, but it’s better than treating metavariables as types. Note that in this case the first p is in the wrong font.) Alternatively, we could give these operations actual types, using indexed families (the set theoretic notation for dependent types). We can talk about how to do this; it’s not necessary, but it is a way of making definitions precise, and can be useful when operations do complex things like transform variable contexts in certain ways.
Fig. 26. a Cons eliminator has syntax Cons σ not Cons ↦ σ. A couple of the equations have an extra ↦.
Fig. 26: also, the Cons rules that defer to totalise don’t need to wrap the totalise call in parentheses – in the metalanguage we’re assuming functions have the notation f(x) in constrast to f x in the object language, which does sometimes need to be parenthesised.

rolyp · 2020-09-19T07:29:53Z

rolyp · 2020-10-02T15:26:10Z

@min-nguyen Adding some todo’s for today’s chat:

add annotations to the “list rest” and qualifier syntax
explicit if constructor for guards (check the macros do their work here!)
add the [] case to the desugaring rules (we should probably add rules for all surface language forms at some point, but it’s not urgent now)
remove spurious α’s in a few places in the outputs
identify all the places where there are annotations on the LHS or RHS of every judgement ( e.g. nested function applications, qualifiers inside list comprehensions, etc) and ensure the rules specify how all of these are consumed/produced as necessary – annotations should be conjoined going forward and disjoined going backwards
by the same token, remember that λσ is a raw term (and thus has its own annotation)
for the meta-level sequence notation \vec{q}, I suggest using the “·” (\cdot) symbol for cons and concat, so for example q · \vec{q} would pattern-match a sequence into a head and a tail (you can use the macro \concat for this).

If would be great if you could deal with these and make a start on #51 by next Friday. We can also start talking about/defining the lattices induced by the α᾽s, which we’ll need for the Galois connection.

rolyp · 2020-10-30T16:19:48Z

rolyp · 2020-11-05T17:38:56Z

@min-nguyen Here you go. Page 12 is a little bit blurry but there’s very little on that page anyway.

rolyp · 2020-11-30T10:14:03Z

@min-nguyen I think we can consider this task done. Well done, more of a slog than expected, but worth it!

rolyp added the what:formalism label May 19, 2020

rolyp mentioned this issue May 19, 2020

Draft 1 #40

Closed

7 tasks

rolyp assigned rolyp and mengwangoxf May 19, 2020

rolyp assigned min-nguyen and unassigned rolyp Jun 29, 2020

rolyp unassigned mengwangoxf Jul 21, 2020

rolyp closed this as completed Nov 30, 2020

rolyp mentioned this issue May 12, 2021

First pass over surface language #117

Closed

22 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Slicing rules for desugaring #41

Slicing rules for desugaring #41

rolyp commented May 19, 2020 •

edited

Loading

rolyp commented Jun 29, 2020 •

edited

Loading

rolyp commented Jul 6, 2020

rolyp commented Jul 13, 2020 •

edited

Loading

rolyp commented Jul 21, 2020

rolyp commented Jul 30, 2020

rolyp commented Aug 6, 2020 •

edited

Loading

rolyp commented Aug 6, 2020 •

edited

Loading

rolyp commented Aug 11, 2020 •

edited by min-nguyen

Loading

rolyp commented Aug 26, 2020 •

edited by min-nguyen

Loading

rolyp commented Sep 8, 2020

rolyp commented Sep 9, 2020 •

edited

Loading

rolyp commented Sep 19, 2020 •

edited

Loading

rolyp commented Oct 2, 2020 •

edited by min-nguyen

Loading

rolyp commented Oct 30, 2020 •

edited

Loading

rolyp commented Nov 5, 2020

rolyp commented Nov 30, 2020

Slicing rules for desugaring #41

Slicing rules for desugaring #41

Comments

rolyp commented May 19, 2020 • edited Loading

rolyp commented Jun 29, 2020 • edited Loading

rolyp commented Jul 6, 2020

rolyp commented Jul 13, 2020 • edited Loading

rolyp commented Jul 21, 2020

rolyp commented Jul 30, 2020

rolyp commented Aug 6, 2020 • edited Loading

rolyp commented Aug 6, 2020 • edited Loading

rolyp commented Aug 11, 2020 • edited by min-nguyen Loading

rolyp commented Aug 26, 2020 • edited by min-nguyen Loading

rolyp commented Sep 8, 2020

rolyp commented Sep 9, 2020 • edited Loading

rolyp commented Sep 19, 2020 • edited Loading

rolyp commented Oct 2, 2020 • edited by min-nguyen Loading

rolyp commented Oct 30, 2020 • edited Loading

rolyp commented Nov 5, 2020

rolyp commented Nov 30, 2020

rolyp commented May 19, 2020 •

edited

Loading

rolyp commented Jun 29, 2020 •

edited

Loading

rolyp commented Jul 13, 2020 •

edited

Loading

rolyp commented Aug 6, 2020 •

edited

Loading

rolyp commented Aug 6, 2020 •

edited

Loading

rolyp commented Aug 11, 2020 •

edited by min-nguyen

Loading

rolyp commented Aug 26, 2020 •

edited by min-nguyen

Loading

rolyp commented Sep 9, 2020 •

edited

Loading

rolyp commented Sep 19, 2020 •

edited

Loading

rolyp commented Oct 2, 2020 •

edited by min-nguyen

Loading

rolyp commented Oct 30, 2020 •

edited

Loading