add option for warming reads to mirror primary read queries onto replicas from vtgates to warm bufferpools by olyazavr · Pull Request #13206 · vitessio/vitess

olyazavr · 2023-05-31T13:02:01Z

Description

When reparenting to a replica, if that replica has recently been restarted, it will have a cold bufferpool, and for bufferpool-reliant workloads, this means a performance hit for a few minutes until the bufferpool of the new primary warms up.

As such, it would be great to have the ability to mirror a certain percentage of SELECTs from the current primary to the replicas at the vtgate level, so that when time comes to reparent, the replicas will have a warmer bufferpool than before and not suffer this consequence.

This adds three vtgate flags that can be used to enable and control mirroring a percentage of SELECTs from the current primary to replicas.

      --warming-reads-percent int                                        Percentage of reads on the primary to forward to replicas. Useful for keeping buffer pools warm (default 0)
      --warming-reads-concurrency int                                    Number of concurrent warming reads allowed     (default 500)
      --warming-reads-query-timeout duration                             Timeout of warming read queries (default 5s)

We've been running with this feature at HubSpot for years now and it has significantly improved performance during reparents/rolling restarts. Previously, rolling servers to release a fix/feature was risky and could impact running apps and customers, but now it's something invisible, largely because we are no longer reparenting to a replica with a cold bufferpool

Related Issue(s)

Fixes #13205

Checklist

"Backport to:" labels have been added if this change should be back-ported
Tests were added or are not required
Did the new or modified tests pass consistently locally and on the CI
Documentation was added or is not required

Deployment Notes

vitess-bot · 2023-05-31T13:02:05Z

timvaillancourt · 2023-06-02T11:42:40Z

@olyazavr this is really cool! 🎉

I wonder if this could more "observable". Is it possible to add *stats.Counter of total reads mirrored and any thoughts on other useful metrics?

Assuming REPLICA is the only tablet type warmed I think keyspace+shard would be useful labels for additional metrics

jfg956 · 2023-06-02T16:34:53Z

@timvaillancourt : metrics in the vtgate about warm-up might generate a large metric volume, which makes me think it is not the best place to have this observability. What would you think of putting these metrics on the vttablet ?

timvaillancourt · 2023-06-04T18:31:44Z

@timvaillancourt : metrics in the vtgate about warm-up might generate a large metric volume, which makes me think it is not the best place to have this observability. What would you think of putting these metrics on the vttablet ?

@jfg956 that's a good idea that has given me a few more ideas 👍

I think a single vtgate stat of total warmup queries would still be useful. If vttablet has better metrics there would be no need for metric labels, it would just help explain how much traffic comes from where at a high level, vttablet would explain the rest

I think vttablet being aware of "warmup" means querypb.ExecuteOptions will need a new field to tell vttablet this is happening. vttablet having that state would be awesome because:

The query throttling Slack is building could consider this "warmup" field and throttle (during overload)
vttablet could return an empty result to vtgate (because it's not read) if it knows a query is for warmup
Maybe updates to query plan stats could be skipped for warming queries, because the cost of warmed queries is expected to be bad - this probably pollutes stats for a while

Last, a deferrable new question: Is there an upper limit to how many queries vtgate will warm at once? I see there is a "percentage" control but what if those queries take a very long time, let's say each takes the full default 30 second query timeout - could vtgate spawn too may goroutines or something along those lines? Maybe implementing a "warmup pool" vs just using go func would be an optimization for the future? 🤔

olyazavr · 2023-06-23T14:42:08Z

@timvaillancourt those are all really good ideas! I'm waiting to get some sort of approval or comment from Vitess team that this is even a feature they would consider merging before I go and implement more parts of this

timvaillancourt · 2023-06-23T23:08:35Z

@timvaillancourt those are all really good ideas! I'm waiting to get some sort of approval or comment from Vitess team that this is even a feature they would consider merging before I go and implement more parts of this

@olyazavr makes sense! They're all deferrable ideas, don't let them block things 👍. Let me know if I can help with anything

maxenglander

Hey @olyazavr this is awesome, and something I am excited to see land. I work at PlanetScale, my review does not carry the power of approval. I think you can continue to hold off on making changes until someone from the Vitess eng. team chimes in. That said, left some questions and small suggestions.

At a higher-level, had some bigger questions:

From your experience using this in production, what CPU/memory impact have you observed on VTGates? What about overall performance?
I think it might be nice to have the option to change this value at runtime. If setting 1-2% for warming has a noticeable impact on overall performance, this might be something we only want to set during a maintenance window prior to PRS. No idea how much work it would be to incorporate structural support for this in your PR. If it's a lot I imagine it could be added later on.

I know that the Vitess eng. team will eventually request an addition to changelog/ and a website PR, so noting that as well. I also think adding or modifying existing E2E tests would be a good idea and something they will likely request.

go/flags/endtoend/vtgate.txt

maxenglander · 2023-07-14T19:17:21Z

go/vt/vtgate/engine/route.go

+1 to the idea in the main thread to have a warming pool or something else that caps the inflight warming requests.

maxenglander · 2023-07-14T19:20:20Z

go/vt/vtgate/executor_select_test.go

Looks like missing mysqlCtx argument between the context and the method.

maxenglander · 2023-07-14T19:24:25Z

go/vt/vtgate/vcursor_impl.go

Might be nice to pull this 5*time.Second up into a constant, make it configurable by flag, or maybe default to --query-timeout.

Also, not sure if this is something we could/woud want to use:

vitess/go/vt/vtgate/engine/route.go

Line 186 in 344280a

func addQueryTimeout(ctx context.Context, vcursor VCursor, queryTimeout int) (context.Context, context.CancelFunc) {

Or this?

vitess/go/vt/vtgate/safe_session.go

Line 206 in aab7c89

func (session *SafeSession) SetQueryTimeout(queryTimeout int64) {

go/vt/vtgate/vcursor_impl.go

maxenglander · 2023-07-14T22:21:27Z

go/vt/vtgate/executor_select_test.go

Would be nice to modify this test so that it runs through a bunch of different cases which exercise the various case statements you have in executeWarmingReplicaRead, and also some tests for negatives like insert, update, and queries that compose select with insert and update.

Would also be good to add tests with trailing comments to see how that lands on the replicas.

maxenglander · 2023-07-14T22:22:35Z

go/vt/vtgate/executor_select_test.go

Query serving team may disagree to me, but might be good to set this to a number greater than 0 to help shake out any issues over the next release cycle.

This would create nondeterministic replica queries, which can make for flakey tests (we encountered this problem)

maxenglander · 2023-07-14T22:22:41Z

go/vt/vtgate/executor_select_test.go

Same thought here and everywhere else.

maxenglander · 2023-07-14T22:31:42Z

go/vt/vtgate/engine/primitive.go

Why return interface{} here instead of VCursor?

maxenglander · 2023-07-14T22:33:35Z

go/vt/vtgate/engine/route.go

Might be good to add some stats to add visibility into this.

Ah I see in the main conversation now that there's talk about getting this at the tablet level. FWIW I think it could be useful here in case for whatever reason the queries fail to reach the tablets.

deepthi · 2023-07-15T00:37:36Z

I think it might be nice to have the option to change this value at runtime.

Good point. I think the viper work is actually ready, if you want to use that to make this dynamically configurable.
See #11456 and corresponding docs.

GuptaManan100

I really like the idea of warming reads!

GuptaManan100 · 2023-07-17T11:32:25Z

go/vt/vtgate/engine/route.go

MultiEqual might also be something to consider in the opcodes to allow.

go/vt/vtgate/engine/route.go

github-actions · 2023-08-17T01:22:55Z

This PR is being marked as stale because it has been open for 30 days with no activity. To rectify, you may do any of the following:

Push additional commits to the associated branch.
Remove the stale label.
Add a comment indicating why it is not stale.

If no action is taken within 7 days, this PR will be closed.

github-actions · 2023-08-24T01:23:50Z

This PR was closed because it has been stale for 7 days with no activity.

Signed-off-by: Olga Shestopalova <oshestopalova@hubspot.com>

GuptaManan100

I was able to track down one of the unit test failures to this. Because we aren't passing in the context that we cancel later, instead we pass in context.Background this causes the topo watchers to not shutdown properly, which eventually show up as leaked golang threads -

 noleak.go:56: found unexpected goroutines:
        [Goroutine 50 in state chan receive, with vitess.io/vitess/go/vt/topo/memorytopo.(*Conn).Watch.func1 on top of the stack:
        goroutine 50 [chan receive]:
        vitess.io/vitess/go/vt/topo/memorytopo.(*Conn).Watch.func1()
        	/Users/manangupta/vitess/go/vt/topo/memorytopo/watch.go:56 +0x5c
        created by vitess.io/vitess/go/vt/topo/memorytopo.(*Conn).Watch in goroutine 15
        	/Users/manangupta/vitess/go/vt/topo/memorytopo/watch.go:55 +0x250
        
         Goroutine 51 in state chan receive, with vitess.io/vitess/go/vt/topo.(*Server).WatchSrvVSchema.func1 on top of the stack:
        goroutine 51 [chan receive]:
        vitess.io/vitess/go/vt/topo.(*Server).WatchSrvVSchema.func1()
        	/Users/manangupta/vitess/go/vt/topo/srv_vschema.go:74 +0x9c
        created by vitess.io/vitess/go/vt/topo.(*Server).WatchSrvVSchema in goroutine 15
        	/Users/manangupta/vitess/go/vt/topo/srv_vschema.go:70 +0x14c
        
         Goroutine 52 in state select, with vitess.io/vitess/go/vt/vtgate.(*sandboxTopo).WatchSrvVSchema.func1 on top of the stack:
        goroutine 52 [select]:
        vitess.io/vitess/go/vt/vtgate.(*sandboxTopo).WatchSrvVSchema.func1()
        	/Users/manangupta/vitess/go/vt/vtgate/sandbox_test.go:323 +0x94
        created by vitess.io/vitess/go/vt/vtgate.(*sandboxTopo).WatchSrvVSchema in goroutine 15
        	/Users/manangupta/vitess/go/vt/vtgate/sandbox_test.go:321 +0x16c
        
         Goroutine 59 in state chan receive, with vitess.io/vitess/go/vt/topo/memorytopo.(*Conn).Watch.func1 on top of the stack:
        goroutine 59 [chan receive]:
        vitess.io/vitess/go/vt/topo/memorytopo.(*Conn).Watch.func1()
        	/Users/manangupta/vitess/go/vt/topo/memorytopo/watch.go:56 +0x5c
        created by vitess.io/vitess/go/vt/topo/memorytopo.(*Conn).Watch in goroutine 15
        	/Users/manangupta/vitess/go/vt/topo/memorytopo/watch.go:55 +0x250
        
         Goroutine 60 in state chan receive, with vitess.io/vitess/go/vt/topo.(*Server).WatchSrvVSchema.func1 on top of the stack:
        goroutine 60 [chan receive]:
        vitess.io/vitess/go/vt/topo.(*Server).WatchSrvVSchema.func1()
        	/Users/manangupta/vitess/go/vt/topo/srv_vschema.go:74 +0x9c
        created by vitess.io/vitess/go/vt/topo.(*Server).WatchSrvVSchema in goroutine 15
        	/Users/manangupta/vitess/go/vt/topo/srv_vschema.go:70 +0x14c
        
         Goroutine 61 in state select, with vitess.io/vitess/go/vt/vtgate.(*sandboxTopo).WatchSrvVSchema.func1 on top of the stack:
        goroutine 61 [select]:
        vitess.io/vitess/go/vt/vtgate.(*sandboxTopo).WatchSrvVSchema.func1()
        	/Users/manangupta/vitess/go/vt/vtgate/sandbox_test.go:323 +0x94
        created by vitess.io/vitess/go/vt/vtgate.(*sandboxTopo).WatchSrvVSchema in goroutine 15
        	/Users/manangupta/vitess/go/vt/vtgate/sandbox_test.go:321 +0x16c
        ]

go/vt/vtgate/executor_framework_test.go

go/vt/vtgate/executor_select_test.go

GuptaManan100 · 2023-09-29T11:43:26Z

I would have committed these changes directly, but unfortunately I don't have the access to do that 😢

go/vt/vtgate/executor_select_test.go

GuptaManan100 · 2023-09-29T11:45:07Z

go/vt/vtgate/executor_select_test.go

+	utils.MustMatch(t, wantQueriesReplica, replica.Queries)
+	replica.Queries = nil
+
+	_, err = executor.Execute(ctx, nil, "TestSelect", session, "insert into user (age, city) values (5, 'Boston')", map[string]*querypb.BindVariable{})


Suggested change

_, err = executor.Execute(ctx, nil, "TestSelect", session, "insert into user (age, city) values (5, 'Boston')", map[string]*querypb.BindVariable{})

_, err = executor.Execute(ctx, nil, "TestWarmingReads", session, "insert into user (age, city) values (5, 'Boston')", map[string]*querypb.BindVariable{})

go/vt/vtgate/executor_select_test.go

GuptaManan100

TestHelpOutput is also failing -

--- FAIL: TestHelpOutput (0.70s)
    --- FAIL: TestHelpOutput/vtcombo (0.06s)
        flags_test.go:142: []: (-want +got)
              (
              	"""
              	... // 421 identical lines
              	      --vttablet_skip_buildinfo_tags string                              comma-separated list of buildinfo tags to skip from merging with --init_tags. each tag is either an exact match or a regular expression of the form '/regexp/'. (default "/.*/")
              	      --wait_for_backup_interval duration                                (init restore parameter) if this is greater than 0, instead of starting up empty when no backups are found, keep checking at this interval for a backup to appear
            + 	      --warming-reads-percent int                                        Percentage of reads on the primary to forward to replicas. Useful for keeping buffer pools warm (default 0)
            + 	      --warming-reads-pool-size int                                      Size of goroutine pool for warming reads (default 500) (default 500)
            + 	      --warming-reads-query-timeout duration                             Timeout of warming read queries (default 5s) (default 5s)
              	      --warn_memory_rows int                                             Warning threshold for in-memory results. A row count higher than this amount will cause the VtGateWarnings.ResultsExceeded counter to be incremented. (default 30000)
              	      --warn_payload_size int                                            The warning threshold for query payloads in bytes. A payload greater than this threshold will cause the VtGateWarnings.WarnPayloadSizeExceeded counter to be incremented.
              	... // 11 identical lines
              	"""
              )
    --- FAIL: TestHelpOutput/vtgate (0.10s)
        flags_test.go:142: []: (-want +got)
              strings.Join({
              	... // 32472 identical bytes
              	"\n      --warming-reads-pool-size int                            ",
              	"          Size of goroutine pool for warming reads (default 500)",
            + 	" (default 500)",
              	"\n      --warming-reads-query-timeout duration                   ",
              	"          Timeout of warming read queri",
            + 	"es (d",
              	"e",
            + 	"fault 5",
              	"s",
            + 	")",
              	" (default 5s)\n      --warn_memory_rows int                      ",
              	"                       Warning threshold for in-memory results. ",
              	... // 563 identical bytes
              }, "")
FAIL

This requires fixing the vtgate.txt and vtcombo.txt file in the flags/endtoend directory to match the new expectation.

Co-authored-by: Manan Gupta <35839558+GuptaManan100@users.noreply.github.com> Signed-off-by: Olga Shestopalova <olgash@mit.edu>

Signed-off-by: Olga Shestopalova <oshestopalova@hubspot.com>

deepthi · 2023-09-29T18:26:00Z

go/flags/endtoend/vtcombo.txt

      --vttablet_skip_buildinfo_tags string                              comma-separated list of buildinfo tags to skip from merging with --init_tags. each tag is either an exact match or a regular expression of the form '/regexp/'. (default "/.*/")
      --wait_for_backup_interval duration                                (init restore parameter) if this is greater than 0, instead of starting up empty when no backups are found, keep checking at this interval for a backup to appear
+      --warming-reads-percent int                                        Percentage of reads on the primary to forward to replicas. Useful for keeping buffer pools warm (default 0)
+      --warming-reads-pool-size int                                      Size of goroutine pool for warming reads (default 500)


In Vitess, pools usually mean connection pools. When I read the description I thought it's a waitGroup or something like that. Actually it turns out to be a channel bool whose capacity is set by this flag.
The flag name and description are misleading. They need to be changed to reflect the actual usage. It should be something like --warming-reads-concurrency and be documented as Number of concurrent warming reads allowed.

I edited the PR description at the top to list all 3 flags. Once these changes are made, that needs to change again.

Signed-off-by: Olga Shestopalova <oshestopalova@hubspot.com>

deepthi · 2023-09-29T20:49:58Z

So close. flags_test is still failing. you can fix that after addressing the feedback and then this will be ~~almost~~ ready to go.

deepthi · 2023-09-29T20:51:10Z

@ajm188 given https://vitess.slack.com/archives/CMDJ2KFEZ/p1695982611699689 can we say that this PR does not need a separate website docs PR to update the flags?

ajm188 · 2023-09-29T20:57:52Z

Yep that's correct!

Signed-off-by: Olga Shestopalova <oshestopalova@hubspot.com>

GuptaManan100

lgtm

olyazavr · 2023-10-04T15:52:17Z

What else is needed for approval here?

deepthi

Nice work! Thank you for the contribution!

olyazavr requested review from GuptaManan100, deepthi, frouioui, harshit-gangal and systay as code owners May 31, 2023 13:02

vitess-bot bot added NeedsDescriptionUpdate The description is not clear or comprehensive enough, and needs work NeedsIssue A linked issue is missing for this Pull Request NeedsWebsiteDocsUpdate What it says labels May 31, 2023

github-actions bot added this to the v17.0.0 milestone May 31, 2023

olyazavr force-pushed the warming-reads branch from 8c70827 to 0a2ab6f Compare May 31, 2023 19:59

frouioui modified the milestones: v17.0.0, v18.0.0 Jun 12, 2023

olyazavr force-pushed the warming-reads branch from 0a2ab6f to 7e9cc80 Compare June 23, 2023 14:38

olyazavr force-pushed the warming-reads branch from 7e9cc80 to 2ba8192 Compare July 13, 2023 13:32

deepthi requested a review from maxenglander July 14, 2023 18:41

maxenglander reviewed Jul 14, 2023

View reviewed changes

GuptaManan100 reviewed Jul 17, 2023

View reviewed changes

maxenglander mentioned this pull request Aug 13, 2023

RFC: routing rules to mirror traffic during MoveTables #13772

Closed

7 tasks

github-actions bot added the Stale Marks PRs as stale after a period of inactivity, which are then closed after a grace period. label Aug 17, 2023

github-actions bot closed this Aug 24, 2023

deepthi removed the Stale Marks PRs as stale after a period of inactivity, which are then closed after a grace period. label Sep 18, 2023

fix flags

c8576ca

Signed-off-by: Olga Shestopalova <oshestopalova@hubspot.com>

olyazavr force-pushed the warming-reads branch from d56d434 to c8576ca Compare September 28, 2023 20:35

GuptaManan100 suggested changes Sep 29, 2023

View reviewed changes

go/vt/vtgate/executor_framework_test.go Outdated Show resolved Hide resolved

go/vt/vtgate/executor_select_test.go Outdated Show resolved Hide resolved

GuptaManan100 reviewed Sep 29, 2023

View reviewed changes

olyazavr and others added 2 commits September 29, 2023 09:46

Apply suggestions from code review

a036088

Co-authored-by: Manan Gupta <35839558+GuptaManan100@users.noreply.github.com> Signed-off-by: Olga Shestopalova <olgash@mit.edu>

add flags to vtcombo and omit defaults in description

5892603

Signed-off-by: Olga Shestopalova <oshestopalova@hubspot.com>

frouioui modified the milestones: v18.0.0, v19.0.0 Sep 29, 2023

deepthi reviewed Sep 29, 2023

View reviewed changes

Olga Shestopalova added 2 commits September 29, 2023 15:51

rename warming reads pool to channel, use concurrency for flag name

96067a1

Signed-off-by: Olga Shestopalova <oshestopalova@hubspot.com>

rename warming reads pool variable and comment

dd0444c

Signed-off-by: Olga Shestopalova <oshestopalova@hubspot.com>

deepthi added the NeedsDescriptionUpdate The description is not clear or comprehensive enough, and needs work label Sep 29, 2023

deepthi removed the NeedsWebsiteDocsUpdate What it says label Sep 29, 2023

remove default 0 from flags

498d1dc

Signed-off-by: Olga Shestopalova <oshestopalova@hubspot.com>

olyazavr force-pushed the warming-reads branch from 8d4030b to 498d1dc Compare September 29, 2023 22:57

GuptaManan100 approved these changes Oct 2, 2023

View reviewed changes

GuptaManan100 removed the NeedsDescriptionUpdate The description is not clear or comprehensive enough, and needs work label Oct 4, 2023

deepthi approved these changes Oct 4, 2023

View reviewed changes

deepthi merged commit f74838e into vitessio:main Oct 4, 2023

GuptaManan100 mentioned this pull request Oct 5, 2023

Fix data race in TestWarmingReads #14187

Merged

4 tasks

mattlord mentioned this pull request Dec 5, 2025

replica buffer pool warming for smoother reparents/failovers #6211

Closed

This was referenced Mar 17, 2026

Bug Report: warming reads feature returns unnecessary query results #19655

Closed

VTTablet: do not return query results on warming reads #19656

Merged

	_, err = executor.Execute(ctx, nil, "TestSelect", session, "insert into user (age, city) values (5, 'Boston')", map[string]*querypb.BindVariable{})
	_, err = executor.Execute(ctx, nil, "TestWarmingReads", session, "insert into user (age, city) values (5, 'Boston')", map[string]*querypb.BindVariable{})

Conversation

olyazavr commented May 31, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issue(s)

Checklist

Deployment Notes

Uh oh!

vitess-bot bot commented May 31, 2023

Review Checklist

General

If a new flag is being introduced:

If a workflow is added or modified:

Bug fixes

Non-trivial changes

New/Existing features

Backward compatibility

Uh oh!

timvaillancourt commented Jun 2, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jfg956 commented Jun 2, 2023

Uh oh!

timvaillancourt commented Jun 4, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

olyazavr commented Jun 23, 2023

Uh oh!

timvaillancourt commented Jun 23, 2023

Uh oh!

maxenglander left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

deepthi commented Jul 15, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

GuptaManan100 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions bot commented Aug 17, 2023

Uh oh!

github-actions bot commented Aug 24, 2023

Uh oh!

GuptaManan100 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

GuptaManan100 commented Sep 29, 2023

olyazavr commented May 31, 2023 •

edited

Loading

timvaillancourt commented Jun 2, 2023 •

edited

Loading

timvaillancourt commented Jun 4, 2023 •

edited

Loading

deepthi commented Jul 15, 2023 •

edited

Loading

deepthi commented Sep 29, 2023 •

edited

Loading