Parallel retry join #13606

ncabatoff · 2022-01-08T18:28:54Z

When using retry_join stanzas to setup new raft nodes via config (instead of issuing explicit join requests), each of the "leaders" specified are tried in turn until one if found that works, with a 2s delay between each attempt. It's common when using retry_join to use the same config that lists all nodes for every node's HCL, which means a new node could take quite a while to join if the actual leader in the cluster is listed last.

This PR changes how retry_join is handled so that we reach out to every node listed in parallel, and try to complete the join to the first node that replies without an error. We maintain the current behaviour of a 2s delay between each attempt, only now it's a 2s delay between retrying all nodes, instead of a 2s delay between each node attempt.

pmmukh

couple nits/questions, olgtm!

vault/raft.go

This reverts commit 7e74beb.

ncabatoff added 3 commits January 8, 2022 11:48

Initial refactoring.

a64e347

Do first part of join attempts in parallel.

d47460d

Fix bug I introduced to ha-only mode.

ab7d280

vercel bot temporarily deployed to Preview – vault-storybook January 10, 2022 13:32 Inactive

vercel bot temporarily deployed to Preview – vault January 10, 2022 13:32 Inactive

ncabatoff mentioned this pull request Jan 10, 2022

Ignore retry_join elements that refer to ourself. #13544

Closed

Add CL.

e2cf2dd

vercel bot temporarily deployed to Preview – vault-storybook January 10, 2022 14:01 Inactive

vercel bot temporarily deployed to Preview – vault January 10, 2022 14:01 Inactive

pmmukh approved these changes Jan 13, 2022

View reviewed changes

vault/raft.go Show resolved Hide resolved

vault/raft.go Outdated Show resolved Hide resolved

vault/raft.go Show resolved Hide resolved

vault/raft.go Outdated Show resolved Hide resolved

Reviewer feedback

307767d

vercel bot temporarily deployed to Preview – vault January 14, 2022 17:30 Inactive

vercel bot temporarily deployed to Preview – vault-storybook January 14, 2022 17:30 Inactive

ncabatoff merged commit 7e74beb into main Jan 17, 2022

ncabatoff deleted the parallel-retry-join branch January 17, 2022 15:33

pmmukh added a commit that referenced this pull request Feb 28, 2022

Revert "Parallel retry join (#13606)"

06b0a81

This reverts commit 7e74beb.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parallel retry join #13606

Parallel retry join #13606

ncabatoff commented Jan 8, 2022 •

edited

Loading

pmmukh left a comment

Parallel retry join #13606

Parallel retry join #13606

Conversation

ncabatoff commented Jan 8, 2022 • edited Loading

pmmukh left a comment

Choose a reason for hiding this comment

ncabatoff commented Jan 8, 2022 •

edited

Loading