VReplication: Take replication lag into account in VStreamManager healthcheck result processing#15761
Merged
mattlord merged 6 commits intovitessio:mainfrom Apr 22, 2024
Merged
Conversation
…ponse handling Signed-off-by: Matt Lord <mattalord@gmail.com>
Contributor
Review ChecklistHello reviewers! 👋 Please follow this checklist when reviewing this Pull Request. General
Tests
Documentation
New flags
If a workflow is added or modified:
Backward compatibility
|
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #15761 +/- ##
==========================================
+ Coverage 68.38% 68.40% +0.01%
==========================================
Files 1556 1556
Lines 195361 195437 +76
==========================================
+ Hits 133598 133689 +91
+ Misses 61763 61748 -15 ☔ View full report in Codecov by Sentry. |
Signed-off-by: Matt Lord <mattalord@gmail.com>
7e30dd7 to
9c64cbd
Compare
The event slice is not protected. Signed-off-by: Matt Lord <mattalord@gmail.com>
Signed-off-by: Matt Lord <mattalord@gmail.com>
rohit-nayak-ps
approved these changes
Apr 22, 2024
Signed-off-by: Matt Lord <mattalord@gmail.com>
Signed-off-by: Matt Lord <mattalord@gmail.com>
frouioui
approved these changes
Apr 22, 2024
This was referenced Apr 22, 2024
timvaillancourt
pushed a commit
to slackhq/vitess
that referenced
this pull request
May 30, 2024
…reamManager healthcheck result processing (vitessio#15761) (vitessio#15772) Signed-off-by: Matt Lord <mattalord@gmail.com> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> Co-authored-by: Matt Lord <mattalord@gmail.com>
4 tasks
timvaillancourt
added a commit
to slackhq/vitess
that referenced
this pull request
May 30, 2024
* [release-17.0] VReplication: Take replication lag into account in VStreamManager healthcheck result processing (vitessio#15761) (vitessio#15772) Signed-off-by: Matt Lord <mattalord@gmail.com> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> Co-authored-by: Matt Lord <mattalord@gmail.com> * fix signature Signed-off-by: Tim Vaillancourt <tim@timvaillancourt.com> --------- Signed-off-by: Matt Lord <mattalord@gmail.com> Signed-off-by: Tim Vaillancourt <tim@timvaillancourt.com> Co-authored-by: vitess-bot[bot] <108069721+vitess-bot[bot]@users.noreply.github.com> Co-authored-by: Matt Lord <mattalord@gmail.com>
twthorn
added a commit
to slackhq/vitess
that referenced
this pull request
Aug 12, 2024
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Description
The
vtgateVStreamManager— which managesVStream APIrequests made against avtgate— watches a healthcheck stream from the tablets that it's streaming data from for each high level vstream. When processing the source tablet's healthcheck responses, however, it was not taking replication lag into account as it should. See the issue for more details.This PR corrects that by checking the replication lag reported in the healthcheck response against the vtgate's live value tied to the
--discovery_low_replication_lagflag (but this is also a dynamic variable).I think that we should backport this all the way to 17.0 as it's a small and straightforward fix for a somewhat glaring bug.
Related Issue(s)
Checklist