opt_merge: hashing performance and correctness #4677

widlarizer · 2024-10-18T21:13:03Z

This is a direct remake of #4175 sans the 64-bit hash value. I'm making use of the interface in #4524 and requiring that PR (and containing its commits at the moment). Instead of xorshifts, values are sorted, though a final xorshift is included as a part of the fudge (--hash-seed=N) mechanism.

Additionally, I discovered opt_merge behaves incorrectly in the case of hash collisions. This suggests that this PR might in rare cases bring improvements in quality of results for flows that use opt_merge, since prior to it, hash collisions would inhibit merging. I modified the sharemap from a dict<hash_t, Cell*> to an equivalent std::unordered_multimap so that multiple cells can be associated with the same hash. This can't be a separate change since this bug actually broke the build just by changing how the hashes are constructed.

Sorry for the spam to code owners due to being based on the above mentioned wide-reaching PR #4524, I don't have a way of removing you from the reviewer list. The diff for this PR is not going to be very readable on github either until that's merged

Co-authored-by: KrystalDelusion <[email protected]>

widlarizer · 2024-11-11T11:09:48Z

The current implementation regresses opt_merge runtime many times over. I'll see if std::unordered_multimap is salvageable.
Currently, std::unordered_multimap, as I use it

maps from hash_t to Cell*
hashes hash_t with std::hash (unnecessary)
compares Cell*s with pointer equality

Instead it should:

map from Cell* to Cell*
hash with hash_cell_function
compare with compare_cell_parameters_and_connections

…idelines

This reverts commit 4db3e78.

widlarizer · 2024-11-13T22:05:46Z

pool fails to bring a clear advantage over std::unordered_set:

ibex pool
Elapsed time: 0:42.16[h:]min:sec. CPU time: user 41.87 sys 0.17 (99%). Peak memory: 164800KB.

ibex std::unordered_set
Elapsed time: 0:42.85[h:]min:sec. CPU time: user 42.56 sys 0.16 (99%). Peak memory: 164076KB.

ibex pool
Elapsed time: 0:49.41[h:]min:sec. CPU time: user 48.76 sys 0.49 (99%). Peak memory: 634672KB.

ibex std::unordered_set
Elapsed time: 0:47.62[h:]min:sec. CPU time: user 46.95 sys 0.51 (99%). Peak memory: 629852KB.

On par on these designs in comparison with the state prior to touching opt_merge (cf5585e)

ibex
Elapsed time: 0:42.12[h:]min:sec. CPU time: user 41.87 sys 0.13 (99%). Peak memory: 162396KB.

jpeg
Elapsed time: 0:48.41[h:]min:sec. CPU time: user 47.70 sys 0.54 (99%). Peak memory: 621304KB.

When isolated, std takes home the perf prize:

$ hyperfine "./uut/std/bin/yosys -p \"read_rtlil garbage/ibex-pre-opt-merge.il; opt_merge\"" --warmup 5
Benchmark 1: ./uut/std/bin/yosys -p "read_rtlil garbage/ibex-pre-opt-merge.il; opt_merge"
  Time (mean ± σ):      99.8 ms ±   1.4 ms    [User: 90.6 ms, System: 8.8 ms]
  Range (min … max):    97.7 ms … 102.7 ms    29 runs

$ hyperfine "./uut/pool/bin/yosys -p \"read_rtlil garbage/ibex-pre-opt-merge.il; opt_merge\"" --warmup 5
Benchmark 1: ./uut/pool/bin/yosys -p "read_rtlil garbage/ibex-pre-opt-merge.il; opt_merge"
  Time (mean ± σ):     151.6 ms ±   1.5 ms    [User: 142.4 ms, System: 8.6 ms]
  Range (min … max):   149.4 ms … 154.2 ms    19 runs

Memory consumption is equal because the RTLIL read and Yosys state dominates the memory consumption in this case

rmlarsen · 2024-11-18T19:06:46Z

@widlarizer nice work!

widlarizer and others added 18 commits October 18, 2024 12:01

hashlib: redo interface for flexibility

c852dd3

driver: add --hash-seed

76eacb4

abc: sort stats

15c51d7

hashlib: fix pyosys

1bfddea

hashlib: only include in one place

6a570b3

hashlib: use hash_t across the board

2bc5ca0

hashlib: hash_t can be set to 64-bit

25cd9fb

hashlib: fudge always

d14d2dd

hashlib: don't xorshift in between upper and lower word

88f0774

hashlib: allow forcing Hasher state, use it for IdString trivial hashing

df44003

hashlib: prevent naive hashing of IdString when hashing SigBit

9bdecc5

hash: solo hashing interface, override for SigBit

fb45749

hashlib: restore hash_obj_ops for pointers to indexed types

363a902

hashlib: remove is_new from HasherDJB32, implement hash_top for IdString

cf086d6

hashlib: run_hash uses hash_top_ops, not hash_ops

3ddea7d

docs: document the ideas behind the hashing interface

59d8562

Docs: Formatting and fixes

efab718

docs: formatting and fixes

0bcb41c

Co-authored-by: KrystalDelusion <[email protected]>

widlarizer force-pushed the emil/opt_merge-hashing branch from ae88e5f to c68b6c2 Compare November 8, 2024 19:36

widlarizer added 7 commits November 11, 2024 13:14

docs: move hashing-based container details into internal docs from gu…

1f40e57

…idelines

hashlib: add deprecated mkhash function to prevent plugin breakage

b4f3806

hashlib: acc -> eat

4084d92

hashlib: legacy mkhash_add -> djb2_add

cf5585e

opt_merge: avoid hashing strings

ccfe209

opt_merge: fix the many collisions case

29969dd

opt_merge: missing include

e82e917

widlarizer force-pushed the emil/opt_merge-hashing branch from c68b6c2 to 2e1e5a8 Compare November 12, 2024 13:43

opt_merge: switch to unordered_set

45cbadc

widlarizer force-pushed the emil/opt_merge-hashing branch from 2e1e5a8 to 45cbadc Compare November 12, 2024 13:59

widlarizer added 5 commits November 12, 2024 15:57

opt_merge: try pool?

4db3e78

opt_merge: hash type

24c96f2

opt_merge: fix fixpoint abuse

e30afc9

Revert "opt_merge: try pool?"

1b49bce

This reverts commit 4db3e78.

fixup! opt_merge: fix the many collisions case

1415cd9

povik mentioned this pull request Nov 18, 2024

Improve opt_merge performance #4175

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

opt_merge: hashing performance and correctness #4677

opt_merge: hashing performance and correctness #4677

widlarizer commented Oct 18, 2024

widlarizer commented Nov 11, 2024 •

edited

Loading

widlarizer commented Nov 13, 2024 •

edited

Loading

rmlarsen commented Nov 18, 2024

opt_merge: hashing performance and correctness #4677

Are you sure you want to change the base?

opt_merge: hashing performance and correctness #4677

Conversation

widlarizer commented Oct 18, 2024

widlarizer commented Nov 11, 2024 • edited Loading

widlarizer commented Nov 13, 2024 • edited Loading

rmlarsen commented Nov 18, 2024

widlarizer commented Nov 11, 2024 •

edited

Loading

widlarizer commented Nov 13, 2024 •

edited

Loading