Extend restoration benchmarks to include large random wallets. #2081

KtorZ · 2020-08-26T13:55:01Z

Issue Number

#2032

Overview

I have defined a new special wallet scheme analogous to the any% scheme used until now, but built on top of the RndState, so that we still get to store and retrieve the address space too.
I have removed the 2% case and "replaced" it with a 1% case on large random wallets.

Comments

Tested on the testnet, which works fine. The reason to add this one is to be able to create arbitrarily large random wallets that "make sense", and then measure the time taken for other specific operations such as listing addresses or estimating fees (which I'll do as a separate PR).

Anviking · 2020-08-26T14:05:18Z

lib/core/src/Cardano/Wallet/Primitive/AddressDiscovery/Random.hs

+-- seed.
+--
+-- The first argument is expected to be a ratio (between 0 and 1) of addresses
+-- we ought to simply recognize as ours. So, giving .5 means that 50% of the


Seems outdated. Should be ratio (between 0% and 100%) and giving @50 means 50% of the I presume?

Right, I had it as an argument initially and switched to a type-level parameter later on.

giving @50 means 50% of the I presume?

Indeed.

Anviking · 2020-08-26T14:11:04Z

lib/core/src/Cardano/Wallet/Primitive/AddressDiscovery/Random.hs

+-- For benchmarking and testing arbitrary large random wallets.
+
+-- | An "unsound" alternative that can be used for benchmarking and stress
+-- testing. It re-uses the same underlying structure as the `RndState` but


same underlying structure

Is there a main difference, say in complexity, between RndAnyState and AnyAddressState?

Would this be true?

RndAnyState, like RndState, grow with the number of addresses used. This is unlike SeqState/AnyAddressState, where which has a bounded gap of user addresses it keeps in memory

If not, something like it, would be cool to understand what makes RndAnyState indicative of performance of random wallets, and why it's not enough to have AnyAddressState.

Re my earlier comment:

Ah, I'm blind 😅

innerState :: RndState network

"same underlying structure" makes sense then

:) ... It still stores the UTxO and transaction history, but the AnyState do not store the address space. BUT, we've observed some major slowness when it comes to storing addresses, so I find it quite important to also include that in the benchmark.

I think that we could have a similar SeqAnyState which also include the seq state, and get rid of the AnyState altogether.

Anviking · 2020-08-26T14:26:50Z

lib/shelley/bench/Restore.hs

+                 socketFile
+                 np
+                 vData
+                 "1-percent-naked.timelog"


Maybe just 1-percent.timelog would be easier

Anviking · 2020-08-26T14:30:03Z

lib/shelley/bench/Restore.hs


-        , bench ("restore " <> network <> " 2% ownership")


Removing because it was too slow, I presume?

Maybe it's worth adding 0.1%, or 0.5%, to still have more data points than 0% and 1%.

(Useful, I imagine, if you'd ever want to do some curve fitting on the data)

Removing because it was too slow, I presume?

Correct.

Maybe it's worth adding 0.1%, or 0.5%, to still have more data points than 0% and 1%.

Good idea 👍

Anviking · 2020-08-26T14:37:52Z

lib/core/src/Cardano/Wallet/Primitive/AddressDiscovery/Random.hs

+-- For benchmarking and testing arbitrary large random wallets.
+
+-- | An "unsound" alternative that can be used for benchmarking and stress
+-- testing. It re-uses the same underlying structure as the `RndState` but


Re my earlier comment:

Ah, I'm blind 😅

innerState :: RndState network

"same underlying structure" makes sense then

This wallet is very analogous to the existing any% wallet scheme we designed a while ago, with a subtle difference: it is built __on top of__ the 'RndState' and, as a result, does perform the same database operation and addresses management as the standard random wallets. So the benchmark results obtained from this are much closer to what an actual random wallet of the same size would look like.

This is actually the implementation we ought to test in these benchmark.

…, .5%, 1%)

KtorZ · 2020-08-26T14:57:08Z

@Anviking added .1% and .5% benchmarks + fix comment to correctly mention the type parameter.

https://github.com/input-output-hk/cardano-wallet/compare/f0a4bf9b25f9d1801fbd1a67b6b294f03096b451..580bd968daa9607431dd80c8d6d3ca7f0b334f68

Anviking · 2020-08-26T15:02:52Z

lib/core/src/Cardano/Wallet/Primitive/AddressDiscovery/Random.hs

+            (False, _) | crc32 bytes < p ->
+                let
+                    (path, gen') = findUnusedPath
+                        (gen inner) (accountIndex inner) (unavailablePaths inner)


Couldn't this extra findUnusedPath call, compared to the inner state implementation, affect the results?

A bit yes. We don't benchmark this function independently but I believe that even on large wallets with 100K+ addresses, finding an unused path in a space of 2^31 is quite fast.

So I'd expect it to be quite negligible.

KtorZ · 2020-08-26T15:14:02Z

bors r+

iohk-bors · 2020-08-26T17:34:41Z

Build succeeded

KtorZ requested a review from Anviking August 26, 2020 13:55

KtorZ self-assigned this Aug 26, 2020

KtorZ added the RESOLVING ISSUE Mark a PR as resolving issues, for auto-generated CHANGELOG label Aug 26, 2020

Anviking reviewed Aug 26, 2020

View reviewed changes

KtorZ added 3 commits August 26, 2020 16:56

use 'ShelleyKey' instead of 'IcarusKey' for any% wallet benchmark

3ce0d58

This is actually the implementation we ought to test in these benchmark.

add more cases for any% wallet to have some level of comparisons (.1%…

580bd96

…, .5%, 1%)

KtorZ force-pushed the KtorZ/2032/extend-restoration-benchmarks branch from f0a4bf9 to 580bd96 Compare August 26, 2020 14:56

Anviking approved these changes Aug 26, 2020

View reviewed changes

iohk-bors bot merged commit ef1ed03 into master Aug 26, 2020

iohk-bors bot deleted the KtorZ/2032/extend-restoration-benchmarks branch August 26, 2020 17:34

KtorZ mentioned this pull request Aug 27, 2020

'listAddresses' is extremely slow / unusable for large wallets. #2032

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extend restoration benchmarks to include large random wallets. #2081

Extend restoration benchmarks to include large random wallets. #2081

KtorZ commented Aug 26, 2020

Anviking Aug 26, 2020

KtorZ Aug 26, 2020

Anviking Aug 26, 2020

Anviking Aug 26, 2020

Anviking Aug 26, 2020 •

edited

Loading

KtorZ Aug 26, 2020

Anviking Aug 26, 2020

Anviking Aug 26, 2020

KtorZ Aug 26, 2020

Anviking Aug 26, 2020 •

edited

Loading

KtorZ commented Aug 26, 2020 •

edited

Loading

Anviking Aug 26, 2020

KtorZ Aug 26, 2020

KtorZ Aug 26, 2020

KtorZ commented Aug 26, 2020

iohk-bors bot commented Aug 26, 2020

Extend restoration benchmarks to include large random wallets. #2081

Extend restoration benchmarks to include large random wallets. #2081

Conversation

KtorZ commented Aug 26, 2020

Issue Number

Overview

Comments

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Anviking Aug 26, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Anviking Aug 26, 2020 • edited Loading

Choose a reason for hiding this comment

KtorZ commented Aug 26, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

KtorZ commented Aug 26, 2020

iohk-bors bot commented Aug 26, 2020

Build succeeded

Anviking Aug 26, 2020 •

edited

Loading

Anviking Aug 26, 2020 •

edited

Loading

KtorZ commented Aug 26, 2020 •

edited

Loading