vttablet: heartbeat always enabled#6665
vttablet: heartbeat always enabled#6665shlomi-noach wants to merge 14 commits intovitessio:masterfrom
Conversation
-enable_heartbeat flag now always assumed to be enabled -heartbeat_interval is now always set to some value in the range (0..1] seconds, default 500ms Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
|
#6668 depends on this. |
|
ping, request for review 🙏 #6668 depends on this. |
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
…eartbeat Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
|
I believe I understand why The problem is that Quote:
However, we enabled I could, potentially, add a |
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
|
57d5fe3 addresses the above comment about test failures. |
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
|
Just checking in, since I haven't been following the recent code changes: it used to be the case that the same command line flag controlled both whether to write heartbeats and whether to disable a tablet as unhealthy if no heartbeat lag could be determined, which created a messy rollout problem where you couldn't have Has that been resolved by recent refactorings so that |
That is an excellent point. I believe it is still the case that restarting a replica with |
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
|
I was thinking maybe 00100d4, as a temporary experiment, would fix the failing tests -- seemed it could be related. It did not. Tests fail on same issue. Reverted that commit. The tests fail like so: It seems like I have no further ideas at this time. Been at this for a couple days now with no progress. |
Here's how we could solve the rollout issue with enabling heartbeats.
As far as throttler goes (#6668), we can have it implicitly enable heartbeat rather than changing the default. |
|
The throttling PR, #6668, now implicitly enables heartbeat, without changing the config. The tests pass and behavior seems correct. This PR is now a non-blocker. |
|
@shlomi-noach I think we decided not to make this the default. Can we close this PR and mark the issue as "Won't Fix"? |
|
This will not be implemented at this time. See #6665 (comment) |
Closes #6662
-heartbeat_enableflag now always assumed to be enabled-heartbeat_intervalis now always set to some value in the range(0..1]seconds, default250ms