Generate txhashset archives on 720 block intervals. #2813

cadmuspeverell · 2019-05-08T02:17:30Z

This is the first of several PRs to resolve #2740. With this PR, txhashset archives will only be generated for blocks 250, 500, 750, etc. This inadvertently solves #2806 as well.

Marked as WIP to encourage review and discussion while I write tests.

garyyu · 2019-05-08T04:52:35Z

Finally, all 3 brothers are here 😃 👍

antiochp · 2019-05-08T09:40:19Z

Hello brother.

chain/src/chain.rs

antiochp

I think once this is parameterized we may want to extend this to a 24 hour period (1,440 blocks) rather than 250 blocks (at least for mainnet and floonet).

We have attempted to keep all "time" based values to round numbers in terms of hours, days, weeks etc rather than 100s, 1000s of blocks.
It might be less intuitive in the code but it makes it easier to reason about ("time is money").

For an extreme example - https://www.grin-forum.org/t/wtb-one-hour-in-atomic-swap-for-1-8-btc/4831 where John Tromp is attempting to purchase an "hour" of grin...

… of the current txhashset archive

cadmuspeverell · 2019-05-10T12:50:48Z

This is ready for review whenever you get a chance @antiochp

antiochp · 2019-05-10T13:29:48Z

p2p/src/protocol.rs

@@ -251,15 +251,17 @@ impl MessageHandler for Protocol {
 					sm_req.hash, sm_req.height
 				);

-				let txhashset = self.adapter.txhashset_read(sm_req.hash);


We need to think this through a bit. Our peer sends us a height and hash (the sm_req) but we are now effectively ignoring this and just sending the most recent snapshot.

I wonder if we need to take the height send by our peer and quantize this to the nearest previous snapshot (i.e. the closest 720 snapshot to that height).

Alternatively maybe we should be ignoring the values passed in by our peer - all we care about is serving the most recent snapshot?

We just need to be careful that our peer receives consistent data in terms of -

the full set of headers and

the corresponding txhashet archive

It is critical that these are on the same fork. If for whatever reason they are on a fork (i.e. another peer sent them the "wrong" latest headers) then they need to be able to rewind headers to the point where this txhashset exists on the chain. If they cannot then they cannot recover from this and the full txhashset is wasted.
I don't think this is a problem in practice because the txhashset is at least n blocks old (based on the horizon).

I thought about it for a while, but I don't think we would want to provide them any other txhashset. I considered verifying that the requested header is within an expected range, but it seemed unlikely enough to not justify any additional complexity.

antiochp · 2019-05-10T14:59:30Z

I need to think through the horizon behavior here a bit more but looks good so far.

cadmuspeverell · 2019-05-10T15:40:45Z

I need to think through the horizon behavior here a bit more but looks good so far.

Sure, take your time. This does effectively change the horizon temporarily for newly synced nodes. Once most nodes have this change, it might make sense to just make txhashset_archive_header the new cut_through_horizon. Perhaps as part of the first hard fork?

antiochp · 2019-05-13T15:32:45Z

Currently we make some guarantees/assumptions about what blocks are available from peers.

We maintain 7 days of full block history based on global::cut_through_horizon().
But on initial sync we ask for a txhashset based on 48 hours with global::state_sync_threshold().

So every running node has at least 48 hours of full blocks (after successful sync).
And every running node has up to 7 days of full blocks based on compaction/pruning.

One of the edge cases we need to account for here is when we ask for full blocks during fast sync from another node that has recently sync'd. i.e. Other nodes (possible a majority of nodes on the network) are only guaranteed to have blocks from 48 hours ago. Currently this is exactly 48 hours ago.

So there are a couple of conflicting requirements here pulling in competing directions -

If we ask for a txhashset older than 48 hours then we risk not being able to reliably sync full blocks from that point onwards.
If we ask for a txhashset younger than 48 hours then we can (selfishly) sync but we are unable to provide 48 hours worth of blocks to other peers who may also by sync'ing.

So currently we are kind of stuck doing the same thing as all other peers and setting our sync_threshold to 48 hours.

If we were to deploy a node with different behaviour by for example using more of a "step function" in terms of 740 block periods then we risk being a bad citizen with respect to these conflicting requirements. We either have too few blocks locally (temporarily) or we are asking for too many blocks and cannot reliably retrieve them (if our peers are also syncing).

cadmuspeverell · 2019-05-14T02:11:23Z

We either have too few blocks locally (temporarily) or we are asking for too many blocks and cannot reliably retrieve them (if our peers are also syncing).

The way it's coded in this PR, it would be the latter. This was coded that way intentionally, for the reasons you listed above. Although initial sync might be delayed very slightly if it requests blocks from a peer that just synced, it shouldn't be a large problem because the significant majority of peers would have more than 48 hours worth of blocks.

…interval.

garyyu · 2019-05-22T01:45:23Z

@cadmuspeverell thanks for your work and how are you doing?

I would like to take a look and test this PR and could you please solve the conflict? thanks.

antiochp

👍

antiochp · 2019-05-28T17:15:56Z

Q) If we only produce a new txhashset every 12 hours - do we now reuse one already generated if we are asked for it more than once in a 12 hour period?

I don't remember if we implemented de-duping behavior before (we had a smaller window before).

cadmuspeverell · 2019-05-29T01:52:28Z

Q) If we only produce a new txhashset every 12 hours - do we now reuse one already generated if we are asked for it more than once in a 12 hour period?

I don't remember if we implemented de-duping behavior before (we had a smaller window before).

Yep! That logic was already there - and seemed to work well when I was doing my testing. I'll resolve that darned conflict now and then maybe @garyyu can try it out to make sure I didn't miss anything? 😸

DavidBurkett

As of v0.5.3, Grin++ gracefully handles receiving a different TxHashSet height than requested. Since grin already handles this, merging this code shouldn't adversely affect the network. I didn't test every scenario, but I did run the code for a day or so, and didn't see any unusual bans. There were only ever 2 or 3 zips in the temp folder, even though dozens of peers synced from my node. I'm happy with it.

garyyu · 2019-06-06T02:38:53Z

After my 2nd reading everything is good on my view, and thanks @cadmuspeverell for this 1st PR 👍 and a long review :-)

antiochp · 2019-06-06T13:16:04Z

Looks like this PR was merged into master after the release of 1.1.0.
Can we revert this on master and retarget it for the 2.0.0 branch (its tagged as 2.x.x milestone).

If we do need to push 1.1.x for any reason we will do this from master and we only want to include critical fixes.

I'm not 100% sure if this is the approach we will always take but we have some tight time constraints between 1.1.0 and 2.0.0 and we decided to keep master clean for a 1.1.1 release if necessary.

garyyu · 2019-06-06T13:36:17Z

its tagged as 2.x.x milestone

but it was tagged as 1.1.1 milestone before, you can see above history.
And I don't see any risk for this PR in 1.1.1, and we didn't yet plan a release date about 1.1.1, did we?

yeastplume · 2019-06-06T13:53:26Z

@garyyu There is no 1.1.1 milestone anymore, in recent meetings we've collectively decided that the next release will be 2.0.0, and that there will only be a 1.1.1 if there is a critical issue uncovered that warrants it. Also, given the time constraints we should only be pushing PRs that are hard-fork related to the branch, and labeling any other fixes or changes as post 2.0.0 (i've created a 2.x.x milestone for that)

Sorry this wasn't communicated properly as there have been one or two extra meeting schedules over the past days, but we really feel it's important that everyone stick to this plan for the next few weeks.

* Revert #2813 * Rustfmt

* generate txhashset archives on 250 block intervals. * moved txhashset_archive_interval to global and added a simple test. * cleaning up the tests and adding license. * increasing cleanup duration to 24 hours to prevent premature deletion of the current txhashset archive * bug fixes and changing request_state to request height using archive_interval. * removing stopstate from chain_test_helper to fix compile issue

generate txhashset archives on 250 block intervals.

b11f368

antiochp self-requested a review May 8, 2019 09:41

cadmuspeverell commented May 8, 2019

View reviewed changes

chain/src/chain.rs Show resolved Hide resolved

antiochp reviewed May 8, 2019

View reviewed changes

cadmuspeverell added 3 commits May 8, 2019 22:34

moved txhashset_archive_interval to global and added a simple test.

71dc8b5

cleaning up the tests and adding license.

34e1145

increasing cleanup duration to 24 hours to prevent premature deletion…

a80d98a

… of the current txhashset archive

cadmuspeverell changed the title ~~[WIP] generate txhashset archives on 250 block intervals.~~ Generate txhashset archives on 250 block intervals. May 9, 2019

cadmuspeverell changed the title ~~Generate txhashset archives on 250 block intervals.~~ Generate txhashset archives on 720 block intervals. May 9, 2019

chisa0a mentioned this pull request May 9, 2019

db/store full (forked) blocks cleanup #2585

Closed

antiochp reviewed May 10, 2019

View reviewed changes

ignopeverell added the enhancement label May 13, 2019

ignopeverell added this to the 1.1.1 milestone May 13, 2019

antiochp mentioned this pull request May 16, 2019

txhashset snapshot zips and dirs still using up a lot of disk space #2806

Closed

cadmuspeverell changed the title ~~Generate txhashset archives on 720 block intervals.~~ WIP: Generate txhashset archives on 720 block intervals. May 16, 2019

bug fixes and changing request_state to request height using archive_…

880d86d

…interval.

cadmuspeverell changed the title ~~WIP: Generate txhashset archives on 720 block intervals.~~ Generate txhashset archives on 720 block intervals. May 16, 2019

garyyu mentioned this pull request May 22, 2019

Why txhashset download response is so resource heavy #2787

Closed

cadmuspeverell mentioned this pull request May 23, 2019

sync optimization through parallelization #2837

Closed

antiochp approved these changes May 28, 2019

View reviewed changes

cadmuspeverell added 2 commits May 29, 2019 02:03

Merge branch 'master' into master

b2e0330

removing stopstate from chain_test_helper to fix compile issue

da2c9bc

DavidBurkett approved these changes Jun 2, 2019

View reviewed changes

garyyu approved these changes Jun 6, 2019

View reviewed changes

garyyu merged commit 5ebe2aa into mimblewimble:master Jun 6, 2019

hashmap mentioned this pull request Jun 6, 2019

Bittorrent support for txhashset zip download #2740

Closed

hashmap mentioned this pull request Jun 14, 2019

zip_read improvement with cache #1928

Closed

quentinlesceller mentioned this pull request Jun 25, 2019

[2.x.x] Generate txhashset archives on 720 block intervals. #2919

Closed

quentinlesceller added a commit to quentinlesceller/grin that referenced this pull request Jun 25, 2019

Revert mimblewimble#2813

0822ead

quentinlesceller mentioned this pull request Jun 25, 2019

Revert #2813 #2920

Merged

yeastplume pushed a commit that referenced this pull request Jun 26, 2019

Revert #2813 (#2920)

fd6fe35

* Revert #2813 * Rustfmt

quentinlesceller mentioned this pull request Jul 16, 2019

[2.x.x] Generate txhashset archives on 720 block intervals. #2951

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generate txhashset archives on 720 block intervals. #2813

Generate txhashset archives on 720 block intervals. #2813

cadmuspeverell commented May 8, 2019

garyyu commented May 8, 2019

antiochp commented May 8, 2019

antiochp left a comment

cadmuspeverell commented May 10, 2019

antiochp May 10, 2019 •

edited

Loading

cadmuspeverell May 10, 2019

antiochp commented May 10, 2019

cadmuspeverell commented May 10, 2019

antiochp commented May 13, 2019

cadmuspeverell commented May 14, 2019

garyyu commented May 22, 2019

antiochp left a comment

antiochp commented May 28, 2019 •

edited

Loading

cadmuspeverell commented May 29, 2019

DavidBurkett left a comment

garyyu commented Jun 6, 2019

antiochp commented Jun 6, 2019

garyyu commented Jun 6, 2019

yeastplume commented Jun 6, 2019

Generate txhashset archives on 720 block intervals. #2813

Generate txhashset archives on 720 block intervals. #2813

Conversation

cadmuspeverell commented May 8, 2019

garyyu commented May 8, 2019

antiochp commented May 8, 2019

antiochp left a comment

Choose a reason for hiding this comment

cadmuspeverell commented May 10, 2019

antiochp May 10, 2019 • edited Loading

Choose a reason for hiding this comment

cadmuspeverell May 10, 2019

Choose a reason for hiding this comment

antiochp commented May 10, 2019

cadmuspeverell commented May 10, 2019

antiochp commented May 13, 2019

cadmuspeverell commented May 14, 2019

garyyu commented May 22, 2019

antiochp left a comment

Choose a reason for hiding this comment

antiochp commented May 28, 2019 • edited Loading

cadmuspeverell commented May 29, 2019

DavidBurkett left a comment

Choose a reason for hiding this comment

garyyu commented Jun 6, 2019

antiochp commented Jun 6, 2019

garyyu commented Jun 6, 2019

yeastplume commented Jun 6, 2019

antiochp May 10, 2019 •

edited

Loading

antiochp commented May 28, 2019 •

edited

Loading