Introduce an explicit delay for QUALITY phase #785

masih · 2024-12-11T10:16:34Z

Capturing what was discussed in 11-12-2024 standup. Below is a summary of an idea originally proposed by @Kubuxu:

The duration of QUALITY phase is directly governed by delta. This phase will await the maximum length of timeout (or proceeds if there is strong quorum for the proposal).

The QUALITY phase is also very important to the progress velocity of an instance: insufficiently propagated QUALITY messages will lead to PREPARE for base, and most likely additional rounds. This behaviour was observed repeatedly in mainnet testing, specially during bootstrap phase, which is less than ideal. Because in bootstrap phase there really is no chain forkiness: the entire network is trying to decide on chains that have far lower propability of reorg than chains at steady state.

We could just increase the delta to make sure enough time is given to QUALITY messages to propagate but larger delta would result in further increase of the time it takes for an instance to terminate affecting every phase. Further, it would specifically impact CONVERGE phase, because that phase also waits for the timeout to pass regardless.

So the proposal here is to introduce a dedicated timeout for QUALITY phase, at least in the dynamic manifest for testing purposes. If proven to be successful we then proceed to propose the changes in a FIP etc.

The rationale for having this dedicated timeout in QUALITY phase only instead of both CONVERGE and QUALITY is that if QUALITY messages are sufficiently propagated and we still hit CONVERGE, then the chances are the chain is too forky and we are better off finalising on base and starting a new instance with a fresh proposal than trying to finalise on a nonbase in the current instance. Therefore, the chances are a faster CONVERGE and the start of a fresh instance results in higher overall progress velocity compared to delaying CONVERGE.

Of course, we can test this thesis at scale by introducing delay for both QUALITY and CONVERGE.

BigLep · 2024-12-18T15:50:14Z

2024-12-18 standup notes:

agreed it's more elegant if it it's a multiplier. We'll do a float multiplier.
this will be exposed for passive testing (will be in the manifest)

github-project-automation bot added this to F3 Dec 11, 2024

github-project-automation bot moved this to Todo in F3 Dec 11, 2024

masih added this to the Milestone 2.5: Mainnet Deployment Readiness milestone Dec 11, 2024

BigLep modified the milestones: Milestone 2.5: Mainnet Activation, Milestone 2: Mainnet Passive Testing Dec 13, 2024

BigLep mentioned this issue Dec 18, 2024

Hash common fields for message propagation #792

Open

BigLep assigned Kubuxu Dec 18, 2024

BigLep moved this from Todo to In progress in F3 Dec 18, 2024

Kubuxu linked a pull request Dec 19, 2024 that will close this issue

Add quality duration multiplier #805

Merged

BigLep moved this from In progress to In review in F3 Dec 19, 2024

Kubuxu closed this as completed in #805 Dec 20, 2024

github-project-automation bot moved this from In review to Done in F3 Dec 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce an explicit delay for QUALITY phase #785

Introduce an explicit delay for QUALITY phase #785

masih commented Dec 11, 2024 •

edited

Loading

BigLep commented Dec 18, 2024

Introduce an explicit delay for QUALITY phase #785

Introduce an explicit delay for QUALITY phase #785

Comments

masih commented Dec 11, 2024 • edited Loading

BigLep commented Dec 18, 2024

masih commented Dec 11, 2024 •

edited

Loading