Faster pattern matching for built-in types

This issue is for discussing approaches to improving performance of pattern matching for built-in types.

The current approach is inefficient. Details can be found in [this](https://github.com/IntersectMBO/plutus/blob/81ac3ed283e2928faa51185eb6c29696f9d3af39/plutus-core/plutus-core/src/PlutusCore/Default/Builtins.hs#L902) Note, the gist is that currently pattern matching over lists is implemented as (pseudocode)

```haskell
matchList :: [a] -> b -> (a -> [a] -> b) -> b
matchList xs z f = chooseList xs (\_ -> z) (\_ -> f (head xs) (tail xs)) ()
```

where `chooseList`, `head` and `tail` are all built-in functions (`chooseList` returns either its second or its third argument depending on whether its first argument is null or not, respectively). Therefore in case of `nil` we end up performing one builtin call and in case of `cons` we end up performing three builtin calls. Note how we also have to pass a unit argument around in order not to always evaluate all branches: Plutus is strict, hence `chooseList xs z (f (head xs) (tail xs))` would have the wrong semantics.

There have been two proposals on how to implement faster pattern matching for built-in types:

1. allow expressing pattern matching built-in functions directly, implemented in #5486 
2. piggy-back on the pattern matching machinery that we use for sums-of-products, implemented in #5704

Both PRs are documented, see the initial comment on each PR if you want to understand the details of design and implementation.

Both approaches work and give us comparable speedup on built-in-list-specific benchmarks. 

This is what we get for (1):

![Screenshot from 2024-01-12 00-41-30](https://github.com/IntersectMBO/plutus/assets/10480926/ac4642ef-9ad9-4d07-a3ae-4030a978efb8)

![Screenshot from 2024-01-12 00-45-13](https://github.com/IntersectMBO/plutus/assets/10480926/3912e777-c036-4ad6-b62c-7d725e18c37a)

This is what we get for (2):

![Screenshot from 2024-01-12 00-27-27](https://github.com/IntersectMBO/plutus/assets/10480926/54d35ee5-b5b3-4c98-a783-1bf582c1bb99)

![Screenshot from 2024-01-12 00-27-41](https://github.com/IntersectMBO/plutus/assets/10480926/8308a400-3262-4bd1-8703-e4a5a1568a52)

It may appear that (2) is faster, however the benchmarking machine apparently is capable (see [this](https://input-output-rnd.slack.com/archives/G01EE4NQ9U0/p1704935052577519?thread_ts=1704934648.640219&cid=G01EE4NQ9U0) slack discussion) of being wrong by 6% and probably even more, hence we can't really rely on these results, but it's still clear that both the approaches give us meaningful improvement of about the same scale.

If we analyze performance analytically, this is what we'll find:

1. (1) has to keep `unit` arguments around in order for pattern matching not to force all the branches at once and only force the picked one, while (2), being backed by SOPs, has no such restriction since `case` is specifically designed to be a proper pattern matching construct (unlike function application which is strict in Plutus). E.g. for (1) we have the following diff:

![Screenshot from 2024-01-10 01-06-15](https://github.com/IntersectMBO/plutus/assets/10480926/d8d52e3a-50cd-4a7a-a656-f97b3ec35c90)

while for (2) it's

![Screenshot from 2024-01-10 01-05-01](https://github.com/IntersectMBO/plutus/assets/10480926/1ca1d0c5-ecf1-4917-bcc9-33abc3884ef3)

I.e. (2) wins for this one.

2. in (1) `matchList` is literally a single built-in function call, while in (2) it's a built-in function call + a `case`:

![Screenshot from 2024-01-12 01-59-18](https://github.com/IntersectMBO/plutus/assets/10480926/99534eac-d89b-4d97-a996-6b1cca953bda)

which affects not only performance, but also size, which is an even scarcer resource.

I.e. (1) wins for this one.

3. (1) requires us to amend the builtins machinery in a way that allows for returning an applied function. Not only is this a much larger change (propagating through many parts of the builtins machinery, including tests) compared to what (2) requires, it also introduces a slowdown for all existing builtins, because handling exactly one output of a built-in function is a bit faster than handling any non-zero number of them (if a builtin returns `f x y` we simply return all of those terms separately and let the evaluator handle reducing the application -- we have to do that, because each evaluator deals with reducing applications differently), because one doesn't need to case on the result to figure out if it contains a single output or multiple of them.

I.e. (2) wins for this one.

4. the whole issue we've been discussing here is improving performance of pattern matching for built-in types. Extending the builtins machinery seems very appropriate for that, particularly with something as straightforward as "allow for returning a function application", while teaching builtins about SOPs and hardcoding a specific SOP representation right into some built-in functions feels weird -- why do builtins and SOPs have to be intertwined this way?

Maybe it's fine, but I believe (1) wins for this one.

5. as per Michael's [comment](https://github.com/IntersectMBO/plutus/issues/5711#issuecomment-1895818477) SOPs aren't supported with all versions of Plutus, so we need to figure out what to do for early ones that don't support SOPs

I.e. (1) wins for this one.

Overall, there's no clear winner. Performance comparison is unclear with both approaches delivering meaningful improvement, although my very subjective feeling is that (2) slightly wins when it comes to performance. But also (1) makes builtins more expressive, maybe for some another reason we'll end up needing to return function applications from builtins eventually anyway?

I'm personally torn between the two options. Which one should we choose? 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Faster pattern matching for built-in types #5711

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Faster pattern matching for built-in types #5711

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions