Chain stalls while scaling out from 1 to 4 nodes, changing quorum size fixes things #796

panghalamit · 2019-08-07T22:44:07Z

Scenario:
While scaling out nodes by voting them one by one in successive blocks, if a roundchange timer is triggered when validator set size <=3, the nodes keep on timing out on roundchange, without exchanging any new block msgs. The chain stalls and nodes keep on timing out forever.

On careful inspection of logs from multiple runs and the code, issue is in the roundchange msg handling logic in roundchange.go
https://github.com/jpmorganchase/quorum/blob/4c74f47525a8db6ffaf7a539c875a8c71236f15c/consensus/istanbul/core/roundchange.go#L96-L109
the if (line 96) and else if (line 101) condition are same for N<=3, with f = 0. The code in line 103 is never executed, which has logic for current proposer to either send locked proposal, or pendingRequest. Hence, nodes keep on sending roundchange messages and timing out and chain doesn't make any progress.

Changing condition in line 101, from num == 2f+1 to num == ceil(2N/3) or num == N-f. fixes this failure scenario.
Please find attached logs for 4 quorum nodes in failure scenario.

27728.zip

The change includes adding QuorumSize() function to validator set interface, which returns quorum size specific to formulae used. This change only updates the number of confirmations (quorum size) in roundchange.go but it could be added at other places where 2f+1 is used.

…talls node scale out

jimthematrix · 2019-08-08T14:45:45Z

As additional background, besides ceil(2N/3), we have also evaluated three other formulae: ceil((N+f+1)/2), floor(2N/3)+1 and N-f. It turns out ceil(2N/3) and ceil((N+f+1)/2) are equivalent, while floor(2N/3)+1 and N-f are equivalent:

N	f	2f+1	ceil(2N/3) or ceil((N+f+1)/2)	floor(2N/3)+1 or N-f
2	0	1	2	2
3	0	1	2	3
4	1	3	3	3
5	1	3	4	4
6	1	3	4	5
7	2	5	5	5
8	2	5	6	6
9	2	5	6	7

We felt ceil(2N/3) already guarantees super majority in all cases, whereas floor(2N/3)+1 or N-f might be overly strict in some cases (N=3, 6, 9, etc.) In our testing we have run the new formula ceil(2N/3) alongside the original in our test pipelines (where we stand up and tear down Quorum networks constantly throughout the day). While the original formula produced a few stalled chains during the add-one-node-at-a-time ramp up, all attributed to the roundchange loop, the new formula has not failed at all.

We couldn't find existing literature that explains if ceil(2N/3) and floor(2N/3)+1 would cause IBFT to behave differently. But empirical data definitely points to a positive enhancement over the existing formula.

jimthematrix · 2019-08-08T14:50:38Z

A question we should consider is whether there's value in making the formula switchable. We could adopt a default, between ceil(2N/3) and floor(2N/3)+1. But have the other non-default switched on via an optional command line switch. This may allow the technical community to experiment with both and collect empirical data.

If that's considered useful, @panghalamit I do want to make a suggestion, to remove the formula parameter from the function QuorumSize(), but instead have the function implementation detect the formula in effect via a configuration setting, that can be controlled with a command line switch.

jpmsam · 2019-08-08T17:27:05Z

@panghalamit @jimthematrix thank you for your contribution and for your feedback on the formula that is being used in round change. We have been actively testing both formulas for all phases of IBFT, not just the round change and more specifically in non 3f+1 networks with dynamic validators. On smaller non 3f+1 validator networks, n-f provides the best consistency in an network that should tolerate f failures but both formulas converge on a similar quorum for larger networks. To calculate n-f, we use floor(2n/3) + 1 so that we don't have to calculate f separately.

Our intention is to make the update on all phases of IBFT, not just the round change. It would be helpful if you can provide your feedback on the below quorum branches based on the networks that you have been testing against.

https://github.com/jbhurat/quorum/tree/update_consensus_floor (floor(2N/3)+1 )
docker image: jbhurat/quorum:ibft_floor

https://github.com/jbhurat/quorum/tree/update_istanbul_consensus ceil(2N/3)
docker: jbhurat/quorum:ibft_ceil

We won't merge this PR with only the round change but we'd like to see it updated to include the other phases of IBFT to use a consistent formula.

No need to make it a flag as most users won't control the entire network and that might lead to unintentional breaks. Users similar to you should be able to update the code to experiment with other formulas.

jimthematrix · 2019-08-08T18:35:16Z

@jpmsam thanks for the review and input Sam, we'll use our test pipeline to exercise the two formulae and report our findings. My prediction is that neither will turn up any failures given the conditions we run them in. Would love to see more details on On smaller non 3f+1 validator networks, n-f provides the best consistency to help us better understand the difference.

… to floor(2*N/3)+1

jpmsam · 2019-08-08T20:22:21Z

In a 3 node validator network, f should be 0. In using ceil(2N/3), you are tolerating f=1. n-f guarantees that f is 0 by requiring a quorum of 3. Similar logic applies to 6,9,12... validator networks.

panghalamit · 2019-08-09T13:31:00Z

Updated code to add the formula everywhere it is used. The formula used is (floor(2N/3) +1).

hmoniz · 2019-08-16T16:49:21Z

Hi @panghalamit @jimthematrix. I'm sorry about the back and forth. @jpmsam and I actually discussed this today and we think that ceil(2N/3) is the most appropriate formula, precisely for the reasons you identified. It matches the lower bound for safety, while not being as strict as n-f, thus ensuring the best performance. Thank you for this contribution! (We are happy to make the updates too.)

…to ibft-quorum-formula

jimthematrix · 2019-08-19T13:05:32Z

@hmoniz thanks for your input, no trouble at all, we totally understand the complexity of this and are happy to help in different ways to get it right. The latest updates from Amit have been running in our health check part of the pipeline since two weeks ago and haven’t caused any failures (again the deliberate hostility of the environment has reliably produced failures in the past with the old formula). Please let us know if any further changes are needed.

consensus/istanbul/validator/default.go

consensus/istanbul/core/prepare_test.go

consensus/istanbul/validator/default_test.go

…talls node scale out

… to floor(2*N/3)+1

`op.isMutating()` was added for checking mutating VM operation, but `operation.writes` should be used now as it registers all mutating ops and is consistent with upstream. Additionally, there is a new mutating opcode `CREATE2`, but `CREATE2` has not been added to the `op.isMutating()` set. The VM is in read-only mode when a private contract calls a public contract.

Tessera 0.10.0 Release updates

…ceil(2n/3), tests updated based on suggestion

hmoniz

Sorry, I just have a few more comments. Thank you for incorporating all the suggestions!

consensus/istanbul/validator/default_test.go

consensus/istanbul/backend/engine.go

consensus/istanbul/core/commit_test.go

consensus/istanbul/core/prepare_test.go

panghalamit · 2019-08-22T17:09:49Z

consensus/istanbul/backend/engine.go

@@ -289,8 +289,8 @@ func (sb *backend) verifyCommittedSeals(chain consensus.ChainReader, header *typ
 		}
 	}

-	// The length of validSeal should be larger than number of faulty node + 1


In the master the condition is
if validSeal < 2*snap.ValSet.F()
shouldn't it be this instead?
if validSeal <= snap.ValSet.F()

consensus/istanbul/core/commit_test.go

…to ibft-quorum-formula

…onfirmations required to move between states to Ceil(2n/3)

jbhurat · 2019-09-19T18:24:29Z

Hi @jimthematrix and @panghalamit, to make a controlled transition to the new formula for existing chains we have made an enhancement where the change in formula only happen after ceil2Nby3Block defined in genesis config block has passed. Can you please review the changes in https://github.com/jbhurat/quorum/tree/Ceil2Nby3Block and let us know your thoughts

jimthematrix · 2019-09-25T14:32:10Z

@jbhurat sorry for the delayed response, the additional changes in the Ceil2Nby3Block branch looks good to me.

just to confirm my understanding, we expect the procedure to migrate an existing branch to be:

if all nodes in an existing chain are upgraded together without modifying and re-applying chainconfig, all nodes start applying Ceil(2N/3) formula and all is good
if an existing chain is not able to coordinate a synchronized upgrade, they should insert Ceil2Nby3Block into the genesis.json and give it a block number in the future, and re-init chain config (geth init genesis.json), to ensure the chain can still function when a mix of old and new node versions are running, as long as all nodes are eventually upgraded before the fork block is hit

jbhurat · 2019-09-25T15:14:21Z

Hi @jimthematrix, Ceil(2N/3) formula will only be applied after Ceil2Nby3Block value has passed, so in both the cases above, geth init genesis.json will have to be run for the new formula to be used.

jimthematrix · 2019-09-25T15:33:28Z

I saw in DefaultConfig this value is set to 0, doesn't that mean once upgraded, if the node doesn't find Ceil2Nby3Block in the chain config, it'll assume 0 and apply the new formula right away?

jbhurat · 2019-09-25T16:02:45Z

In DefaultConfig the value is set to 0, but it is not being applied. If you look at RequestTimeout, it has a default value of 10000 and it is being applied to IstanbulRequestTimeoutFlag in flags.go.

The reason we went with the approach of applying the flag, for Ceil(2N/3) formula, was that when a node restarts, it doesn't know if 2F + 1 or Ceil(2N/3) was used unless we store that info in the block or level db which we wanted to avoid

jimthematrix · 2019-09-25T18:22:44Z

thanks for explaining that. The mechanism looks good to us 👍

…rn nil pointer dereference when istanbul config section is missing from genesis

jbhurat · 2019-09-27T18:14:36Z

Hi @jimthematrix and @panghalamit, do you guys want to merge those changes in this PR and we will go ahead and merge this PR

…m-formula

jimthematrix · 2019-09-27T18:50:06Z

@jbhurat just finished merging from your branch to this PR, please double check the result. thanks!

fyi @panghalamit

jpmsam · 2019-09-30T19:27:14Z

Thanks @jimthematrix and @panghalamit for your contribution and for incorporating all the feedback.

panghalamit added 3 commits August 7, 2019 17:46

initial commit for pr

c8e9178

replaced quorum size in roundchange.go with ceil(2N/3) to fix chain s…

bc55567

…talls node scale out

updated quorumsize unit test to use CONSTANTS as defined

a914f8a

updated to include use quorumsizewhere applicable and changed default…

3dee78e

… to floor(2*N/3)+1

Merge branch 'master' into ibft-quorum-formula

c34d30f

panghalamit added 3 commits August 16, 2019 13:29

updated quorum formula to ceil(2N/3)

6098dc1

Merge branch 'master' into ibft-quorum-formula

90123ae

Merge branch 'ibft-quorum-formula' of github.com:kaleido-io/quorum in…

97123e2

…to ibft-quorum-formula

hmoniz reviewed Aug 22, 2019

View reviewed changes

panghalamit and others added 13 commits August 22, 2019 09:46

initial commit for pr

247e48f

replaced quorum size in roundchange.go with ceil(2N/3) to fix chain s…

125b3eb

…talls node scale out

updated quorumsize unit test to use CONSTANTS as defined

d0a5b41

v2.2.5

637d669

updated to include use quorumsizewhere applicable and changed default…

c8d3391

… to floor(2*N/3)+1

Tessera 0.10.0 - Documentation update (Consensys#801)

8886310

Tessera 0.10.0 Release updates

updated quorum formula to ceil(2N/3)

4d2319d

quorum size formula doesnt take any inputs and only returns value by …

2a3f5e1

…ceil(2n/3), tests updated based on suggestion

rebased with master

b55bddd

removed constants

9d4ffd6

cleaning up unused constants

2073b07

Merge branch 'master' into ibft-quorum-formula

0158e45

updated missed import

031daa0

hmoniz reviewed Aug 22, 2019

View reviewed changes

panghalamit commented Aug 22, 2019

View reviewed changes

panghalamit added 2 commits August 22, 2019 14:33

updated with suggested changes

a98c32f

Merge branch 'master' into ibft-quorum-formula

ef496ff

jbhurat reviewed Aug 27, 2019

View reviewed changes

consensus/istanbul/core/commit_test.go Outdated Show resolved Hide resolved

panghalamit and others added 4 commits August 27, 2019 16:48

updated inconsistent testcase

68f568a

Merge branch 'ibft-quorum-formula' of github.com:kaleido-io/quorum in…

2cf2657

…to ibft-quorum-formula

Merge branch 'master' into ibft-quorum-formula

b380271

Adding Ceil2Nby3Block genesis config option to change the number of c…

f84b5f9

…onfirmations required to move between states to Ceil(2n/3)

Merge branch 'master' into ibft-quorum-formula

60bb862

Adding a nil check for newcfg.Istanbul so that geth init doesn't retu…

c05673f

…rn nil pointer dereference when istanbul config section is missing from genesis

jimthematrix added 2 commits September 27, 2019 14:42

Merge remote-tracking branch 'jbhurat/Ceil2Nby3Block' into ibft-quoru…

ae62307

…m-formula

Removed duplicate impl of QuorumSize() in validator

75ecbf4

jimthematrix added 2 commits September 30, 2019 09:49

Addressing review comments from Jitu

9a45052

Updated nodeinfo tests based on Jitu's feedback

cd0a976

jpmsam approved these changes Sep 30, 2019

View reviewed changes

jpmsam merged commit e127852 into Consensys:master Sep 30, 2019

jpmsam mentioned this pull request Nov 8, 2019

Istanbul Byzantine Fault Tolerance ethereum/EIPs#650

Closed

KimKyungup mentioned this pull request Feb 9, 2022

How many committed seal should be checked? 2F+1 or F+1 or QuorumSize? #1324

Open

KimKyungup mentioned this pull request Jun 2, 2022

Change quorum size of consensus klaytn/klaytn#1403

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chain stalls while scaling out from 1 to 4 nodes, changing quorum size fixes things #796

Chain stalls while scaling out from 1 to 4 nodes, changing quorum size fixes things #796

panghalamit commented Aug 7, 2019

jimthematrix commented Aug 8, 2019

jimthematrix commented Aug 8, 2019

jpmsam commented Aug 8, 2019

jimthematrix commented Aug 8, 2019

jpmsam commented Aug 8, 2019

panghalamit commented Aug 9, 2019

hmoniz commented Aug 16, 2019

jimthematrix commented Aug 19, 2019

hmoniz left a comment

panghalamit Aug 22, 2019

jbhurat commented Sep 19, 2019

jimthematrix commented Sep 25, 2019

jbhurat commented Sep 25, 2019

jimthematrix commented Sep 25, 2019

jbhurat commented Sep 25, 2019

jimthematrix commented Sep 25, 2019

jbhurat commented Sep 27, 2019

jimthematrix commented Sep 27, 2019 •

edited

Loading

jpmsam commented Sep 30, 2019

Chain stalls while scaling out from 1 to 4 nodes, changing quorum size fixes things #796

Chain stalls while scaling out from 1 to 4 nodes, changing quorum size fixes things #796

Conversation

panghalamit commented Aug 7, 2019

jimthematrix commented Aug 8, 2019

jimthematrix commented Aug 8, 2019

jpmsam commented Aug 8, 2019

jimthematrix commented Aug 8, 2019

jpmsam commented Aug 8, 2019

panghalamit commented Aug 9, 2019

hmoniz commented Aug 16, 2019

jimthematrix commented Aug 19, 2019

hmoniz left a comment

Choose a reason for hiding this comment

panghalamit Aug 22, 2019

Choose a reason for hiding this comment

jbhurat commented Sep 19, 2019

jimthematrix commented Sep 25, 2019

jbhurat commented Sep 25, 2019

jimthematrix commented Sep 25, 2019

jbhurat commented Sep 25, 2019

jimthematrix commented Sep 25, 2019

jbhurat commented Sep 27, 2019

jimthematrix commented Sep 27, 2019 • edited Loading

jpmsam commented Sep 30, 2019

jimthematrix commented Sep 27, 2019 •

edited

Loading