Word eval refactor #1136

robdockins · 2021-03-26T04:06:30Z

Refactor various aspects of the evaluator, mostly dealing with the packed word representation, but also aiming to remove the eta-delay mechanism, which should no longer be required with the current evaluation semantics. The main trick, I think, is to find a way to implement sufficiently-lazy sequence primitives without also introducing unnecessary unpacking or thunking.

robdockins · 2021-04-01T18:18:30Z

I think this refactoring has reached a pretty good point to merge. The main goals of this were to factor out the SeqMap and WordValue types into separable modules with abstract APIs, remove the legacy eta-delay mechanism, and to fix such strictness bugs as I could without harming performance. This should help enable the more aggressive performance-related refactoring stages I have in mind, as well as eventually allowing better reuse inside the SAWCore evaluator. As a side benefit, I was able to merge more of the primitive operations to use identical code paths; all the shift operations now use unified code paths. Among the core operations, only sequence indexing and update have separate definitions anymore.

With the performance considerations in mind, this PR contains a lot of small, self-contained commits and I spent a lot of time performance testing after each change. It should be relatively straightforward to bisect for performance problems inside this PR, as I believe each commit leaves the project in a testable state. At the end, the benchmarks I tried ended up performance neutral, or slightly faster than they started, with the exception of the Karatsuba example, which slowed down somewhat due to a lazier # operator.

Along the way, I generalized the type of the take operation as discussed in issue #1064. This required making take and drop primitive instead of splitAt, which will have minor downstream consequences for cryptol-saw-core.

The next stages of refactoring are quite a bit more speculative, but should hopefully be largely contained within the SeqMap and WordValue modules.

robdockins · 2021-04-02T21:21:50Z

I think this last series of patches lets us finally close out the remaining known strictness bugs in the evaluator, #640, #619, and #422.

Instead of every `WordValue` being wrapped in the `SEval` monad, create a separate `ThunkWordVal` constructor that captures exactly when lazy behavior is needed. This simplifies several things, and makes it more clear some places where we can actually be stricter in the spine of word values.

It is now lazier in its arguments when required.

This brings the evaluator more in line with the reference semantics, although there may still be some differences.

into the `Value` module. Remove most uses of the `WordValue` constructors in other modules, but don't quite make the type abstract yet.

This allows us to generalize the type of `take` and simplifies the implementations.

Make sure they are sufficently lazy in their arguments. Join is still to eager in some cases, but that will require a larger refactoring.

Introduce the concept of "index segments", which allows us to delay deciding if index values should be treated as packed or unpacked until they are consumed in an indexing operation.

Memoization now only occurs in the barrel shifter when a symbolic bit is encountered, instead of at every shift.

as the symbolic simulators.

This breaks the test for issue211, but we were cheating on it anyway.

Update it's type to prepare for upcoming changes

We should either decide that the semantics of `#` requires it to be strict in its arguments, or we should fix the definition.

Replace it with `takeWordVal` and `dropWordVal` instead.

This should eliminte the possiblity of building up chains of thunked word values.

This lets us share work, and lets us optimistically treat the word as packed if we know it has already been forced. It also lets us delay unpacking decisions.

With the lazier join operator, this squashes a space leak in examples that do a lot of hashing.

Also, test for looping computations in addition to `error` computations. This doesn't let us cheat by eta-expanding the error, as we were doing before.

I think we've finally cracked the nut on this strictness bug. Fixes #640

The main benefit of this reorganization is that it notices when a memoized `SeqMap` has been forced at all of its locations. This allows us to discard the underlying computation, which will never need to be consulted again. This, in turn, allows the garbage collector to reclaim the associated memory and help prevent certain classes of space leaks.

GaloisInc/cryptol#1048 GaloisInc/cryptol#1136 GaloisInc/cryptol#1165 GaloisInc/cryptol#1171

robdockins marked this pull request as ready for review April 1, 2021 18:18

robdockins requested review from brianhuffman and yav April 1, 2021 18:18

robdockins force-pushed the word-eval-refactor branch from 0c4c025 to d842145 Compare April 1, 2021 18:30

robdockins force-pushed the word-eval-refactor branch 2 times, most recently from b39bab1 to 7bcc47e Compare April 6, 2021 17:43

robdockins mentioned this pull request Apr 13, 2021

error at the empty type #1160

Closed

robdockins added 21 commits April 13, 2021 10:27

Remove obsolete functions

d4e5fc2

Rework the # operator.

3942a3d

It is now lazier in its arguments when required.

Remove the eta delay mechanisim from the evaluator.

b24e2f5

This brings the evaluator more in line with the reference semantics, although there may still be some differences.

Squash warnings

5a84ea5

Break SeqMap out into a separate module

3f71046

Relocate most of the logic that works directly on WordValue values

b9635b2

into the `Value` module. Remove most uses of the `WordValue` constructors in other modules, but don't quite make the type abstract yet.

Complete the process of making WordVal abstract.

7fc748d

Move WordValue into a separate module.

3f18544

Complete the job of making SeqMap abstract.

87d8912

Update cryptol-remote-api

a112ed8

Make take and drop primitive instead of splitAt.

d3accfb

This allows us to generalize the type of `take` and simplifies the implementations.

Fix test suite

b5454bc

Tweak split, join, reverse and transpose.

5011771

Make sure they are sufficently lazy in their arguments. Join is still to eager in some cases, but that will require a larger refactoring.

Update the reference interpreter

69a589d

Remove uses of panic in SeqMap

6bf4d2e

Continue consolidating WordValue code into the related module

a2aa432

Consolidate some code related to indexing and shifting.

ebc8327

Introduce the concept of "index segments", which allows us to delay deciding if index values should be treated as packed or unpacked until they are consumed in an indexing operation.

Reduce the amount of sequence memoization that occurs in shifts.

8ca536d

Memoization now only occurs in the barrel shifter when a symbolic bit is encountered, instead of at every shift.

Rework shifting in the concrete simulator to use the same code paths

d291318

as the symbolic simulators.

Make indexPrim lazier in its sequence argument.

07f4ba9

robdockins added 17 commits April 13, 2021 10:27

Fix a bug with width accounting

62bcaba

Fix another width accounting bug

6d53e91

Stop eta-expanding the error combinator.

55a48ba

This breaks the test for issue211, but we were cheating on it anyway.

Change largeBitsVal into bitmapWordVal.

e65bf8c

Update it's type to prepare for upcoming changes

Temporary fix to the test for issue211.

b812cd5

We should either decide that the semantics of `#` requires it to be strict in its arguments, or we should fix the definition.

Update reference implementation and documentation

cc8822f

Remove the splitWordVal operation.

149428c

Replace it with `takeWordVal` and `dropWordVal` instead.

Tweak the way delayWordValue works.

5f4b979

This should eliminte the possiblity of building up chains of thunked word values.

Don't reexport the SeqMap API

6853ba0

Keep a thunk for packed words alongside bitmap representations.

f7d66df

This lets us share work, and lets us optimistically treat the word as packed if we know it has already been forced. It also lets us delay unpacking decisions.

Make joinWordVal lazier in its arguments.

77f6718

Increase the strictness in the SuiteB SHA padding function.

5d96829

With the lazier join operator, this squashes a space leak in examples that do a lot of hashing.

Make joinWords lazier in its arguments

d4c3402

Make the "large bit size" a bit more reasonable.

838a252

Update the test for issue 211 to restore the old test behavior.

31ab387

Also, test for looping computations in addition to `error` computations. This doesn't let us cheat by eta-expanding the error, as we were doing before.

Add test case for issue640

69ad34b

I think we've finally cracked the nut on this strictness bug. Fixes #640

Fix test suite

aac42f4

robdockins force-pushed the word-eval-refactor branch from 7bcc47e to aac42f4 Compare April 13, 2021 17:33

robdockins added 2 commits April 13, 2021 15:35

Fixes that help a small amount with space leaks

29c5bc9

robdockins merged commit 76d1dc9 into master Apr 14, 2021

This was referenced Apr 14, 2021

Revised semantics for errors #619

Closed

strictness of indexing #422

Closed

possible memory leak #1043

Closed

Cryptol/pr1136 GaloisInc/saw-core#200

Closed

brianhuffman mentioned this pull request Apr 26, 2021

Updates flowing from cryptol PRs #1048 and #1136 GaloisInc/saw-script#1191

Merged

robdockins added a commit to GaloisInc/saw-script that referenced this pull request Apr 27, 2021

Updates flowing from cryptol PRs

6eb7a6c

GaloisInc/cryptol#1048 GaloisInc/cryptol#1136 GaloisInc/cryptol#1165 GaloisInc/cryptol#1171

robdockins added a commit to GaloisInc/saw-script that referenced this pull request May 18, 2021

Updates flowing from cryptol PRs

08c5995

GaloisInc/cryptol#1048 GaloisInc/cryptol#1136 GaloisInc/cryptol#1165 GaloisInc/cryptol#1171

brianhuffman mentioned this pull request Jul 13, 2021

Some uninterpreted functions don't work with goal_eval_unint GaloisInc/saw-script#1045

Closed

RyanGlScott deleted the word-eval-refactor branch March 22, 2024 14:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Word eval refactor #1136

Word eval refactor #1136

robdockins commented Mar 26, 2021

robdockins commented Apr 1, 2021

robdockins commented Apr 2, 2021

Word eval refactor #1136

Word eval refactor #1136

Conversation

robdockins commented Mar 26, 2021

robdockins commented Apr 1, 2021

robdockins commented Apr 2, 2021