Skip to content

Fix bug found in dev around repairReplication patch in PR #4024#4081

Merged
sougou merged 1 commit intovitessio:masterfrom
tinyspeck:fix-repair-replication-orc-active-recovery-empty-response
Jul 12, 2018
Merged

Fix bug found in dev around repairReplication patch in PR #4024#4081
sougou merged 1 commit intovitessio:masterfrom
tinyspeck:fix-repair-replication-orc-active-recovery-empty-response

Conversation

@zmagg
Copy link
Copy Markdown
Contributor

@zmagg zmagg commented Jul 12, 2018

Fixes a bug introduced in the repairReplication #4024 where replica tablets sometimes do not reconnect replication to the master tablet after restarting. This bug is triggered when Orchestrator has never recovered the shard the tablet is a member of.

This fixes the bug by allowing repairReplication to continue when audit-recovery returns an empty response, as an empty response indicates that Orchestrator is not actively recovering that shard, and has never recovered the shard before. Previously, #4024 assumed an empty response was an error and skipped repairReplication after getting an empty response back from Orchestrator's audit-recovery endpoint.

active-recovery response when it has no history of recovery on that
cluster.

Signed-off-by: Maggie Zhou <mzhou@slack-corp.com>
@zmagg zmagg force-pushed the fix-repair-replication-orc-active-recovery-empty-response branch from 5a4f6cc to d5b2279 Compare July 12, 2018 01:22
@sougou sougou merged commit debb5ac into vitessio:master Jul 12, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants