check replication lag on state change before starting query service by deepthi · Pull Request #5000 · vitessio/vitess

deepthi · 2019-07-15T20:14:30Z

check replication delay on replica during state_change before setting it to SERVING
restored tablets should always start as NOT_SERVING
SecondsBehindMaster from SHOW SLAVE STATUS gets set to 0 when replication is stopped and restarted. Implemented logic in backup/restore to ensure that either replication has caught up to master, or has progressed from last known position before this can be trusted to compute replication delay.

Signed-off-by: deepthi deepthi@planetscale.com

setassociative

This looks good -- interested in what your approach to validating this was with live data and if you had any problems with the tablet getting stuck in a non-serving mode (iirc I was never able to figure out why that happened to me).

go/vt/vttablet/tabletmanager/state_change.go

rafael · 2019-07-25T19:11:00Z

@deepthi - This looks great. I think there are some Println that we need to remove from the tests.

Before we merge. Could you verify manually that this works? Run a manual integration test? The way we reproduce this is:

Have a script that writes tons of data and create lag.
Take a backup while this script is running.
Have a script that reads from replica.
Notice the tablet UI that when it comes from backup, it does no longer serve any query.

go/vt/vttablet/tabletmanager/healthcheck_test.go

go/vt/vttablet/tabletmanager/state_change.go

sougou · 2019-07-27T18:15:01Z

This change looks good. @deepthi I can merge this once you've satisfied @rafael's request.

deepthi · 2019-07-29T17:24:33Z

This change looks good. @deepthi I can merge this once you've satisfied @rafael's request.

In runHealthCheckLocked, when we set _replicationDelay we also set _healthy and _healthyTime. Should these fields also be set here?

rafael · 2019-08-14T22:51:40Z

@deepthi - I was doing some of our internal testing I think the bug is still present in this branch. I think I was able to create instructions that should help you reproduce in any environment.

Using vtbench:

Create a credential files with this format:

# vtbench_mysql_creds.json
{
   "vtgate_user":[
      "vt_pass"
  ]
}

Create a table with the following schema:

CREATE TABLE `test_table` (
  `i` int(11) DEFAULT NULL,
  `c` char(10) DEFAULT NULL
) ENGINE=InnoDB DEFAULT CHARSET=utf8mb4

Use vtbench to insert data into the table:

vtbench -host localhost -protocol mysql -port 15306 -user vtgate_user -db-credentials-file ./vtbench_mysql_creds.json -db @master --count 300000 --threads 25 -sql "INSERT INTO test_table (i,c)  VALUES(1,'record one') /* vt_bench:thread */"```

This should create lag pretty quickly in the replicas.
Once replicas are lagging run the following:

while true; do vtbench -host localhost -protocol mysql -port 15306 -user vtgate_user -db-credentials-file ./vtbench_mysql_creds.json -db @replica --count 30000 --threads 25 -sql 'select count(*) from test_table'; done

If lag is beyond the serving threshold, you will see vtbench failures.

Trigger a backup to one of the replicas.
You will notice that when the replica comes back from backup, some queries slip in and get served:

deepthi · 2019-09-12T04:10:42Z

The latest changes do ensure that replica does not go into serving if it is lagged more than unhealthyThreshold after completing a backup.
Scenarios tested:

tablet is healthy before backup, comes back to serving after backup
tablet is degraded before backup, still degraded after backup, comes back to serving
tablet is degraded before backup, unhealthy after backup, goes to non_serving
tablet is unhealthy before backup, stays non_serving after backup
when tablet is restored from backup, it goes non_serving if the backup is far behind current master, and goes serving only after catching up

There are a few oddities to be noted:

when trying to change tablet type and state after completing a backup, if the tablet is lagged, it stays in BACKUP type until it is caught up and only then transitions into REPLICA type. This seems to be an artifact of how state_change has been implemented in the code.
in my testing, I stop writing to the master after a while and wait for the replica to catch up (post-backup). The lag will keep growing even as the gap between Executed_Gtid_Set and Retrieved_Gtid_Set narrows. I suspect this is because there are no new transactions on the master (which would trigger a proper re-eval of SecondsBehindMaster). Only when the Executed_Gtid_Set catches up to master, the SecondsBehindMaster goes to 0. So instead of gradually dropping, it suddenly goes to 0. This is probably not going to happen in a real system where there is continuous traffic to master.

rafael

@deepthi - Nice work! I think this approach should be good for what we need. Added minor comments.

go/vt/mysqlctl/backup.go

rafael · 2019-09-12T20:45:10Z

go/vt/mysqlctl/builtinbackupengine.go

This is a new failure mode that we need to be aware, but I think it makes sense to fail if we can't get master position.

Agreed. Also the same type of tablet/mysql loss is something that needs to be handled already by an operator as vttablet/mysqld could go down and fall behind the ability to catch up for several reasons already, e.g., network partition. This failure mode should be well covered by normal tooling.

The thing I'm more interested in actually is going to be

when trying to change tablet type and state after completing a backup, if the tablet is lagged, it stays in BACKUP type until it is caught up and only then transitions into REPLICA type. This seems to be an artifact of how state_change has been implemented in the code.

go/vt/vttablet/tabletmanager/state_change.go

go/vt/mysqlctl/backup.go

teejae · 2019-09-13T01:06:21Z

go/vt/mysqlctl/builtinbackupengine.go

i don't think it's necessary to extract all the params into separate vars, unless you need a few of the vars to overwrite, and then i'd only make those.

I did this for 2 reasons:

it serves as documentation of what params we actually use from the object in this function

it avoided making a whole lot of name changes through out the functions

i think the docs belong in the params struct docs. if it isn't used, then there was not point of putting it in the struct.

if we never change the lines, then the code will get progressively more cluttered.

that params struct is used to pass the params down two levels of calls. Not all params are used at both levels so it is a union of the two sets of params. Do you still feel like we should not extract the params into vars?

$0.02: I find that the backup param makes ExecuteBackup & RestoreBackup much easier to read.

If the primary reason for extracting args is to minimize churn in this review I would suggest a follow up where we move to dereferencing params instead of extracting them like this. That would leave the interface cleaner / more easy to scan and remove this noisy block of param extraction.

setassociative · 2019-09-13T21:12:06Z

go/vt/mysqlctl/builtinbackupengine.go

$0.02: I find that the backup param makes ExecuteBackup & RestoreBackup much easier to read.

If the primary reason for extracting args is to minimize churn in this review I would suggest a follow up where we move to dereferencing params instead of extracting them like this. That would leave the interface cleaner / more easy to scan and remove this noisy block of param extraction.

setassociative · 2019-09-13T21:31:47Z

go/vt/mysqlctl/backupengine.go

Because your params structs are used as an abstraction for "control of some action this mysqlctl API takes" I would suggest dropping BackupHandle as a user facing value. It seems that anything outside of mysqlctl.Backup|Restore is going to have the value they set overridden so we might as well not complicate matters by allowing it to be set before calling the Backup|Restore func.

No strong opinions on if this is done by making private or passing it as a separate param outside this abstraction in to the ultimate ExecuteBackup/FindBackupToRestore

good point. I'll change this.

setassociative · 2019-09-13T21:54:51Z

go/vt/mysqlctl/backupengine.go

For BackupParams and RestoreParams docs on the non-obvious args would be ✨. (To me the non-obvious one is mostly HookExtraEnv though I'm just assuming I understand DbName, LocalMetadata, and what the Keyspace/Shard pair is for).

setassociative · 2019-09-13T22:38:48Z

go/vt/mysqlctl/builtinbackupengine.go

doc nit: "Wait for a reliable 'seconds behind master' value" or something. My initial read of this was that it was going to wait for a number of seconds that we considered reliable to have us get a new seconds behind master reading.

I'll fix the comment

setassociative · 2019-09-13T22:48:08Z

go/vt/mysqlctl/builtinbackupengine.go

Agreed. Also the same type of tablet/mysql loss is something that needs to be handled already by an operator as vttablet/mysqld could go down and fall behind the ability to catch up for several reasons already, e.g., network partition. This failure mode should be well covered by normal tooling.

The thing I'm more interested in actually is going to be

when trying to change tablet type and state after completing a backup, if the tablet is lagged, it stays in BACKUP type until it is caught up and only then transitions into REPLICA type. This seems to be an artifact of how state_change has been implemented in the code.

deepthi · 2019-09-14T00:55:57Z

@setassociative is this a concern? Can you articulate what you see as potential problems with it?

when trying to change tablet type and state after completing a backup, if the tablet is lagged, it stays in BACKUP type until it is caught up and only then transitions into REPLICA type. This seems to be an artifact of how state_change has been implemented in the code.

setassociative · 2019-09-15T17:09:45Z

@deepthi With respect to the BACKUP mode thing: sorry, I don't have concerns and should have been more clear. I was calling it out simply because that felt the more "new" change and thus a little more interesting.

Signed-off-by: deepthi <deepthi@planetscale.com>

…dMaster during backup/restore. separate disallowQueryService from disallowQueryReason. disallowQueryReason was being used to permanently disable query service, but for lagging tablets we want to disable it temporarily. change ExecuteBackup and ExecuteRestore to accept BackupParams and RestoreParams instead of a long list of arguments. Signed-off-by: deepthi <deepthi@planetscale.com>

… minor edits Signed-off-by: deepthi <deepthi@planetscale.com>

sougou

The more stable fix will be to encapsulate this functionality within the module that reports replica lag so all users can benefit from this, which includes other healthcheck workflows.

We can do that as a different PR.

deepthi requested a review from sougou as a code owner July 15, 2019 20:14

setassociative reviewed Jul 17, 2019

View reviewed changes

go/vt/vttablet/tabletmanager/state_change.go Outdated Show resolved Hide resolved

go/vt/vttablet/tabletmanager/state_change.go Outdated Show resolved Hide resolved

go/vt/vttablet/tabletmanager/state_change.go Outdated Show resolved Hide resolved

deepthi requested review from dweitzman and rafael July 23, 2019 00:18

deepthi changed the title ~~WIP: check replication lag on state change before starting query service~~ check replication lag on state change before starting query service Jul 23, 2019

rafael reviewed Jul 25, 2019

View reviewed changes

go/vt/vttablet/tabletmanager/healthcheck_test.go Outdated Show resolved Hide resolved

go/vt/vttablet/tabletmanager/healthcheck_test.go Outdated Show resolved Hide resolved

go/vt/vttablet/tabletmanager/state_change.go Outdated Show resolved Hide resolved

deepthi mentioned this pull request Aug 15, 2019

Tablets serving stale data after backup #4426

Closed

deepthi force-pushed the ds-4426 branch 2 times, most recently from 0547ddc to 35be115 Compare August 27, 2019 00:38

deepthi force-pushed the ds-4426 branch from 35be115 to b8cab7c Compare September 12, 2019 03:53

rafael reviewed Sep 12, 2019

View reviewed changes

deepthi force-pushed the ds-4426 branch from b8cab7c to 76cdc19 Compare September 12, 2019 22:44

teejae reviewed Sep 13, 2019

View reviewed changes

deepthi force-pushed the ds-4426 branch from 76cdc19 to 214e636 Compare September 13, 2019 21:48

setassociative reviewed Sep 13, 2019

View reviewed changes

deepthi force-pushed the ds-4426 branch from 3defc17 to a26cdfb Compare September 16, 2019 14:19

deepthi added 4 commits September 16, 2019 18:01

update replicationDelay for all serving non-master tablet types

639b695

Signed-off-by: deepthi <deepthi@planetscale.com>

start restored tablets in NON-SERVING state

8fe0fb8

Signed-off-by: deepthi <deepthi@planetscale.com>

keep BackupHandle out of Params structs, document struct fiels, other…

06e5a5f

… minor edits Signed-off-by: deepthi <deepthi@planetscale.com>

deepthi force-pushed the ds-4426 branch from a26cdfb to 06e5a5f Compare September 17, 2019 01:01

sougou approved these changes Sep 17, 2019

View reviewed changes

deepthi merged commit 3a21af2 into vitessio:master Sep 18, 2019

deepthi mentioned this pull request Sep 30, 2019

Don't abort restore if master is unreachable #5254

Merged

deepthi mentioned this pull request Oct 30, 2019

PlannedReparentShard: Fix more known-recoverable problems. #5376

Merged

spark4 mentioned this pull request Nov 12, 2019

Serry deploy tinyspeck/vitess#140

Closed

spark4 mentioned this pull request Nov 22, 2019

Slack sync upstream 2019 11 09.r0 tinyspeck/vitess#142

Merged

rafael mentioned this pull request Dec 11, 2019

Slack sync upstream 2019 12 11.r0 tinyspeck/vitess#143

Merged

deepthi deleted the ds-4426 branch May 14, 2020 16:49

deepthi mentioned this pull request Mar 17, 2021

restore: check disable_active_reparents properly before waiting for position update #7703

Merged

8 tasks

Conversation

deepthi commented Jul 15, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

setassociative left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rafael commented Jul 25, 2019

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sougou commented Jul 27, 2019

Uh oh!

deepthi commented Jul 29, 2019

Uh oh!

rafael commented Aug 14, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

deepthi commented Sep 12, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rafael left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

deepthi commented Sep 14, 2019

Uh oh!

setassociative commented Sep 15, 2019

Uh oh!

sougou left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

deepthi commented Jul 15, 2019 •

edited

Loading

rafael commented Aug 14, 2019 •

edited

Loading

deepthi commented Sep 12, 2019 •

edited

Loading