VDiff External DB by rafael · Pull Request #144 · tinyspeck/vitess

rafael · 2020-01-11T00:35:07Z

Description

Enables vdiff to connect to an external database. This PR leverages most of the work provided by https://github.com/vitessio/vitess/pull/5367/files.
Specifically, vreplication/vdiff.go is a copy of vt/wrangler/vdiff.go that enables vdiff to be run directly from a tablet. The following changes were required to make this work:
- Discovery mechanism from open source version is removed and there is an assumption that the target tablet is always the tablet where vdiff is running. Source tablet is determined by the same discovery mechanism that vreplication uses to find a source for the stream.
- vstreamer_client interface was enhanced to be able to run vdiff against an external source.
- The way diffs are reported were tweaked and they are exposed via an API endpoint in vttablet.
- It always assumes that there will only be a single vreplication stream.

Asks for reviewer

Keep in my mind that eventually this code will be reverted. We only need this while we are in the migration process.
Did not add tests, as there is enough coverage provided in vreplication: vdiff vitessio/vitess#5367.
The following two PR's from upstream have cherry-picked into this branch, so no need to review changes related to filepos flavor.
- Updates how master gtid position is obtained for file:pos flavor vitessio/vitess#5689
- Fixes bug in filepos flavor vitessio/vitess#5688

Signed-off-by: Rafael Chacon <rafael@slack-corp.com>

* Prior to this commit, flavorpos was using lexicographical comparison of the gtids. Thas was a bug in this context. Signed-off-by: Rafael Chacon <rafael@slack-corp.com>

When generating masterGTIDSet in file:pos most likely you will have a topology like the following: Source A -> Target B (B has a vreplication stream from A) From the target perspective, the source A is the master and you want to generate a gtid that is based on binlog file position of that server. As an example, let's see this topology: Master A -> Source B -> Target C (C has vreplication stream from B) Prior to this change, masterGTIDSet was returning the binlogfile:pos of A. But in reality, the Target C wants the position of B. Signed-off-by: Rafael Chacon <rafael@slack-corp.com>

setassociative

This is an admittedly clumsy read through as it's large and I don't have deep knowledge of the existing vreplication/vdiff code but it feels good to me. Backed by extensive manual + testing I think this is good to merge.

setassociative · 2020-01-14T01:04:25Z

go/vt/vttablet/tabletmanager/action_agent.go


 	// The db name is set by the Start function called above
-	agent.VREngine = vreplication.NewEngine(ts, tabletAlias.Cell, mysqld, func() binlogplayer.DBClient {
+	agent.VREngine = vreplication.NewEngine(ts, tabletAlias.GetCell(), agent.Tablet(), mysqld, func() binlogplayer.DBClient {


Just to be clear when we agent.Tablet() we get a clone of the proto object at that point -- is there danger that it will change in the background in some meaningful way before we come through and run the vtdiff?

Yeah, that is correct. There is no danger, because in this context we always assume that you will be running vdiff from the current tablet, against it's external source.

setassociative · 2020-01-14T01:10:33Z

go/vt/vttablet/tabletmanager/vreplication/controller.go

 	healthcheckRetryDelay      = flag.Duration("vreplication_healthcheck_retry_delay", 5*time.Second, "healthcheck retry delay")
 	healthcheckTimeout         = flag.Duration("vreplication_healthcheck_timeout", 1*time.Minute, "healthcheck retry delay")
 	retryDelay                 = flag.Duration("vreplication_retry_delay", 5*time.Second, "delay before retrying a failed binlog connection")
+	onlyOnceVdiff              sync.Once


it doesn't look like this is used -- is that intentional?

oh good catch. This is from a previous incarnation when I did not have an api to control vdiff. Let me remove.

setassociative · 2020-01-14T01:31:43Z

go/vt/vttablet/tabletmanager/vreplication/vstreamer_client.go

+		// Wait for the conn.ExecuteFetch() call to return.
+		<-done
+		// Close the connection. Upon Recycle() it will be thrown out.
+		conn.Close()


Because you defer conn.Close() at the executeFetchContext call site this will happen twice. Safe to call multiple times but not sure if you want to leave it out of this code path.

This one of the other things I had to copy from other places to make this work against an external database. This comes from: https://github.com/tinyspeck/vitess/blob/master/go/vt/mysqlctl/query.go#L98

* Address PR review + some other cleanup per linter Signed-off-by: Rafael Chacon <rafael@slack-corp.com>

Rafael Chacon added 6 commits January 5, 2020 16:55

VDiff ad-hoc version for slack

c2c8dab

Merge branch 'master' into vdiff-external-db

f20d7cb

Removes extra log items

1c13207

Signed-off-by: Rafael Chacon <rafael@slack-corp.com>

Print actual position where it actually stop

fbed963

Signed-off-by: Rafael Chacon <rafael@slack-corp.com>

Fixes bug in filepos flavor

ccc4b5d

* Prior to this commit, flavorpos was using lexicographical comparison of the gtids. Thas was a bug in this context. Signed-off-by: Rafael Chacon <rafael@slack-corp.com>

setassociative approved these changes Jan 14, 2020

View reviewed changes

Cleanup per review

58423f8

* Address PR review + some other cleanup per linter Signed-off-by: Rafael Chacon <rafael@slack-corp.com>

rafael merged commit 2ffd3e0 into vifl-master Jan 14, 2020

rafael deleted the vdiff-external-db branch January 14, 2020 21:59

rafael mentioned this pull request May 14, 2020

vifl master branch #158

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

VDiff External DB#144

VDiff External DB#144
rafael merged 7 commits intovifl-masterfrom
vdiff-external-db

rafael commented Jan 11, 2020 •

edited

Loading

Uh oh!

setassociative left a comment

Uh oh!

setassociative Jan 14, 2020

Uh oh!

rafael Jan 14, 2020

Uh oh!

setassociative Jan 14, 2020

Uh oh!

rafael Jan 14, 2020

Uh oh!

setassociative Jan 14, 2020

Uh oh!

rafael Jan 14, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

rafael commented Jan 11, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Asks for reviewer

Uh oh!

setassociative left a comment

Choose a reason for hiding this comment

Uh oh!

setassociative Jan 14, 2020

Choose a reason for hiding this comment

Uh oh!

rafael Jan 14, 2020

Choose a reason for hiding this comment

Uh oh!

setassociative Jan 14, 2020

Choose a reason for hiding this comment

Uh oh!

rafael Jan 14, 2020

Choose a reason for hiding this comment

Uh oh!

setassociative Jan 14, 2020

Choose a reason for hiding this comment

Uh oh!

rafael Jan 14, 2020

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

rafael commented Jan 11, 2020 •

edited

Loading