NRG: Install leader snapshot on scaleup by MauriceVanVeen · Pull Request #7509 · nats-io/nats-server

MauriceVanVeen · 2025-11-04T18:04:27Z

When scaling up a stream from R1 to R3 a snapshot is made of the R1 stream and SendSnapshot is called to share the initial state with the new peers. However, this snapshot would solely be in the log and not installed. If the upper layer JetStream catchup were to fail halfway, the two incomplete peers could try to become the leader. This could then result in the stream becoming desynced.

We can ensure these peers never become leader before they're fully synced by installing the snapshot, as that ensures the upper layer can process it during recovery. If the previous R1 leader is not online to perform the catchup, the follower can now successfully call n.DrainAndReplaySnapshot() without needing to reset clustered state. Allowing it to reuse the installed snapshot and not become leader until after the snapshot has been successfully processed.

Signed-off-by: Maurice van Veen github@mauricevanveen.com

neilalexander

LGTM

neilalexander · 2025-11-04T18:20:43Z

FYI:

=== RUN   TestJetStreamClusterSnapshotAndRestoreWithHealthz
    jetstream_cluster_3_test.go:4811: S-1 - JetStream stream '$G > TEST' is not current: group node unhealthy
--- FAIL: TestJetStreamClusterSnapshotAndRestoreWithHealthz (2.56s)

neilalexander

LGTM

Signed-off-by: Maurice van Veen <github@mauricevanveen.com>

Includes the following: - #7499 - #7503 - #7508 - #7510 - #7509 - #7512 - #7516 - #7515 Signed-off-by: Neil Twigg <neil@nats.io>

Includes the following: - #7416 - #7425 - #7486 - #7495 - #7482 - #7496 - #7499 - #7503 - #7508 (excluding weak pointer/cache-related changes that apply only to 2.12.x) - #7510 - #7509 - #7512 - #7516 - #7515 Signed-off-by: Neil Twigg <neil@nats.io>

MauriceVanVeen requested a review from a team as a code owner November 4, 2025 18:04

neilalexander approved these changes Nov 4, 2025

View reviewed changes

MauriceVanVeen force-pushed the maurice/r1-scaleup branch from 864f629 to 3b55741 Compare November 4, 2025 18:48

neilalexander approved these changes Nov 5, 2025

View reviewed changes

NRG: Install leader snapshot on scaleup

1ac5ba5

Signed-off-by: Maurice van Veen <github@mauricevanveen.com>

MauriceVanVeen force-pushed the maurice/r1-scaleup branch from 3b55741 to 1ac5ba5 Compare November 5, 2025 09:25

neilalexander merged commit b6d9254 into main Nov 5, 2025
130 of 136 checks passed

neilalexander deleted the maurice/r1-scaleup branch November 5, 2025 10:14

This was referenced Nov 5, 2025

Cherry-picks for 2.12.2-RC.2 #7513

Merged

Cherry-picks for 2.11.11-RC.2 #7514

Merged

neilalexander added a commit that referenced this pull request Nov 5, 2025

Cherry-picks for 2.12.2-RC.2 (#7513)

a885313

Includes the following: - #7499 - #7503 - #7508 - #7510 - #7509 - #7512 - #7516 - #7515 Signed-off-by: Neil Twigg <neil@nats.io>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

NRG: Install leader snapshot on scaleup#7509

NRG: Install leader snapshot on scaleup#7509
neilalexander merged 1 commit intomainfrom
maurice/r1-scaleup

MauriceVanVeen commented Nov 4, 2025

Uh oh!

neilalexander left a comment

Uh oh!

neilalexander commented Nov 4, 2025

Uh oh!

neilalexander left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

MauriceVanVeen commented Nov 4, 2025

Uh oh!

neilalexander left a comment

Choose a reason for hiding this comment

Uh oh!

neilalexander commented Nov 4, 2025

Uh oh!

neilalexander left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants