VStream: Prevent buffering entire transactions (OOM risk), instead send chunks to client by twthorn · Pull Request #18849 · vitessio/vitess

twthorn · 2025-10-31T22:52:37Z

Description

There is a bug that causes OOM errors for vtgate when a very large transaction (e.g., multi-GB) but with many reasonably sized operations is sent over VStream.

The problem is caused by this logic. We buffer with eventss the entire transaction before sending it. Very large transactions eg multi-GB can cause OOM errors. Example described here

it was from an 11GB size transaction (11 MB per row * 1000 rows). So it seems there was some issue with breaking this transaction up between tablet to vtgate and vtgate to client. We are still looking into it, but wanted to share early findings

This PR aims to fix this by allowing for locking across multiple received event batches from a tablet. And thus allows for sending chunked transactions even before the COMMIT is received, while still preserving the order for the VStream (ie even for multi-shard, the transactions cannot be interleaved, each transaction is sent in its entirety before sending the next transaction of any shard).

I am open to putting this behind a flag. There may be performance implication from this additional locking.

Another approach may be a size in bytes like vstream_packet_size for a tablet, but for the vtgate. If that size in bytes is exceeded a lock is acquired, and then we will start sending the transaction as chunks (and stop accumulating it in memory). Open to discussion on this.

Testing

For reproduction, I added a test that fails without this change (asserts that transactions should NOT be accumulated before sending):

tthornton at tthornt-ltm9dwz in ~/git/twthorn/vitess on fix_vtgate_oom [$]
$ go test -v ./go/vt/vtgate -run TestVStreamLargeTransactionMemory 2>&1 | tail -30
=== RUN   TestVStreamLargeTransactionMemory
E1103 15:49:17.252008   26813 vstream_manager.go:442] Error in vstream for keyspace:"TestVStream" shard:"-20" gtid:"pos": context canceled
context ended while streaming from TestVStream/-20
    vstream_manager_test.go:2403: Max memory used: 57 MB
    vstream_manager_test.go:2407:
                Error Trace:    /Users/tthornton/git/twthorn/vitess/go/vt/vtgate/vstream_manager_test.go:2407
                Error:          "57" is not less than "25"
                Test:           TestVStreamLargeTransactionMemory
                Messages:       Memory usage should stay low due to immediate transaction chunk sending. VTGate should not accumulate all 50 chunks (~50MB) before sending.
--- FAIL: TestVStreamLargeTransactionMemory (0.01s)
FAIL
FAIL    vitess.io/vitess/go/vt/vtgate   1.147s
FAIL

With these changes, the test passes.

Docs PR: vitessio/website#2028

Related Issue(s)

Fixes: Bug Report: VTGates OOM on large transactions #18850

Checklist

"Backport to:" labels have been added if this change should be back-ported to release branches
If this change is to be back-ported to previous releases, a justification is included in the PR description
Tests were added or are not required
Did the new or modified tests pass consistently locally and on CI?
Documentation was added or is not required

Deployment Notes

AI Disclosure

vitess-bot · 2025-10-31T22:52:39Z

…nd chunks to client Signed-off-by: twthorn <thomaswilliamthornton@gmail.com>

codecov · 2025-10-31T23:22:44Z

Codecov Report

❌ Patch coverage is 80.32787% with 12 lines in your changes missing coverage. Please review.
✅ Project coverage is 69.81%. Comparing base (79af4c1) to head (0fa4663).
⚠️ Report is 42 commits behind head on main.

Files with missing lines	Patch %	Lines
go/vt/vtgate/vstream_manager.go	80.32%	12 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main   #18849      +/-   ##
==========================================
+ Coverage   69.73%   69.81%   +0.07%     
==========================================
  Files        1608     1610       +2     
  Lines      214776   215360     +584     
==========================================
+ Hits       149781   150347     +566     
- Misses      64995    65013      +18

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

go/vt/vtgate/vstream_manager.go

Signed-off-by: twthorn <thomaswilliamthornton@gmail.com>

…n/vstream_test Signed-off-by: twthorn <thomaswilliamthornton@gmail.com>

Signed-off-by: twthorn <thomaswilliamthornton@gmail.com>

Copilot

Pull request overview

This PR addresses a critical OOM (Out Of Memory) issue in VTGate when streaming very large transactions (multi-GB) through VStream. The root cause was that VTGate buffered entire transactions in memory before sending them to clients, even when transactions were chunked from tablets.

Key Changes:

Introduces a configurable transaction chunk size threshold (default 128MB) that triggers lock-based contiguous delivery for large transactions
Implements dynamic lock acquisition when transactions exceed the threshold to ensure non-interleaved delivery across shards while still allowing chunked transmission
Adds comprehensive unit and e2e tests to verify chunking behavior and prevent transaction interleaving

Reviewed changes

Copilot reviewed 8 out of 8 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
proto/vtgate.proto	Adds `transaction_chunk_size` field to VStreamFlags for configurable chunking threshold
go/vt/proto/vtgate/vtgate.pb.go	Generated protobuf code for the new transaction_chunk_size field
go/vt/proto/vtgate/vtgate_vtproto.pb.go	Generated vtproto code for serialization/deserialization of transaction_chunk_size
go/vt/vtgate/vstream_manager.go	Core implementation: adds transaction state tracking, dynamic lock acquisition for large transactions, and chunked event sending
go/vt/vtgate/vstream_manager_test.go	New unit test verifying that large transactions from one shard don't interleave with events from other shards
go/test/endtoend/vreplication/vstream_test.go	Updates e2e tests with 1KB chunk size to ensure chunking is actually tested
go/test/endtoend/vreplication/initial_data_test.go	Adds helper function to insert large transactions for testing chunking behavior
go/test/endtoend/vreplication/vreplication_test.go	Minor refactoring to move connection initialization earlier in test function

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

go/vt/vtgate/vstream_manager.go

proto/vtgate.proto

mattlord · 2025-11-25T19:53:15Z

I also added the needs website docs label as we'll need to update this page: https://vitess.io/docs/24.0/reference/vreplication/vstream/

Signed-off-by: twthorn <thomaswilliamthornton@gmail.com>

go/vt/vtgate/vstream_manager.go

mattlord · 2025-11-25T19:47:38Z

go/vt/vtgate/vstream_manager.go

+				// Large incomplete transaction detected - acquire lock to prevent interleaving
+				// Lock will be held across subsequent callbacks until transaction completes


At this point there may already be interleaved events, no? Is that a problem?

There will actually not be interleaved events.

This is because current/default behavior is to send transactions atomically, only once all events from BEGIN to COMMIT are accumulated.

There are two cases when we reach this code:

The lock is not held - this means that other streams are sending transactions atomically. When we go to acquire the lock, atomic transactions may have completed just before us. But they are atomic so it's fine. All other streams will be halted while we send our chunked transaction.

The lock is held - this means another shard is holding the lock and chunking. In this case we wait until that shard has finished its transaction. Then we acquire the lock, thus the complete, chunked transaction of another shard will be sent prior to us beginning to chunk our shard's transaction.

In either case, the events between different transactions of different shards are not interleaved. The only interleaving happens at the transaction-level, ie whole transactions interleaved across shards (not inter-transaction event level)

go/vt/vtgate/vstream_manager.go

Signed-off-by: twthorn <thomaswilliamthornton@gmail.com>

mattlord

I'm approving now, although I'd like to discuss my comments before we merge (still need a second reviewer). Please let me know what you think.

Nice work on this, @twthorn ! I appreciate your patience and persistence on this. ❤️

go/vt/vtgate/vstream_manager.go

mattlord · 2025-12-04T19:12:03Z

go/vt/vtgate/vstream_manager.go

+// defaultTransactionChunkSizeBytes is the default threshold for chunking transactions.
+// 0 (the default value for protobuf int64) means disabled, clients must explicitly set a value to opt in for chunking.
+const defaultTransactionChunkSizeBytes = 0


Do we still need this now that the flag's default is also 0? I guess it makes it cleaner when we later make it opt-out. Totally fine to leave it here.

Yes, I think it's good to have it there for future ease of updating, and i added a comment related to that. It also makes the code explicit and self documenting

Signed-off-by: twthorn <thomaswilliamthornton@gmail.com>

…nd chunks to client (vitessio#18849) Signed-off-by: twthorn <thomaswilliamthornton@gmail.com>

#764) * VStream: Prevent buffering entire transactions (OOM risk), instead send chunks to client (vitessio#18849) Signed-off-by: twthorn <thomaswilliamthornton@gmail.com> * Fix static code checks Signed-off-by: twthorn <thomaswilliamthornton@gmail.com> * Remove utils import Signed-off-by: twthorn <thomaswilliamthornton@gmail.com> * Fix keyspaces to watch test Signed-off-by: twthorn <thomaswilliamthornton@gmail.com> --------- Signed-off-by: twthorn <thomaswilliamthornton@gmail.com> Co-authored-by: Tanjin Xu <109303790+tanjinx@users.noreply.github.com>

…nd chunks to client (vitessio#18849) Signed-off-by: twthorn <thomaswilliamthornton@gmail.com>

…nd chunks to client (vitessio#18849) Signed-off-by: twthorn <thomaswilliamthornton@gmail.com> Signed-off-by: Thomas Thornton <thomaswilliamthornton@gmail.com>

* Improve cgroup metric management (vitessio#18791) Signed-off-by: Matt Lord <mattalord@gmail.com> Signed-off-by: Thomas Thornton <thomaswilliamthornton@gmail.com> * VStream: Prevent buffering entire transactions (OOM risk), instead send chunks to client (vitessio#18849) Signed-off-by: twthorn <thomaswilliamthornton@gmail.com> Signed-off-by: Thomas Thornton <thomaswilliamthornton@gmail.com> * Run VStream copy only when VGTID requires it, use TablesToCopy in those cases (vitessio#18938) Signed-off-by: twthorn <thomaswilliamthornton@gmail.com> Signed-off-by: Thomas Thornton <thomaswilliamthornton@gmail.com> Signed-off-by: Thomas Thornton <thomaswilliamthornton@gmail.com> * Regenerate vtgate.pb.go proto file Signed-off-by: Thomas Thornton <thomaswilliamthornton@gmail.com> * Fix tests Signed-off-by: Thomas Thornton <thomaswilliamthornton@gmail.com> * Complete PR vitessio#18791 backport: Update metrics_cgroup.go Apply missing changes from PR vitessio#18791 to metrics_cgroup.go: - Replace cgroup1Manager and cgroup2Manager with single cgroupManager - Add errCgroupMetricsNotAvailable error variable - Add sync.Once for lazy initialization - Remove cgroup v1 support, only support cgroup v2 - Simplify implementation with unified cgroup manager This fixes compilation errors in metrics_cgroup_test.go. * Add missing github.com/containerd/cgroups dependency Required by metrics_cgroup.go for cgroup v1/v2 support. Signed-off-by: Thomas Thornton <thomaswilliamthornton@gmail.com> * Fix cgroups import to use v3 The v1 cgroups package is incompatible with Go 1.24.10. Use cgroups/v3 consistently throughout the file. Signed-off-by: Thomas Thornton <thomaswilliamthornton@gmail.com> * Fix goimports formatting Signed-off-by: Thomas Thornton <thomaswilliamthornton@gmail.com> --------- Signed-off-by: Thomas Thornton <thomaswilliamthornton@gmail.com>

twthorn requested review from beingnoble03, mattlord, rohit-nayak-ps and shlomi-noach as code owners October 31, 2025 22:52

github-actions bot added this to the v24.0.0 milestone Oct 31, 2025

VStream: Prevent buffering entire transactions (OOM risk), instead se…

80fb058

…nd chunks to client Signed-off-by: twthorn <thomaswilliamthornton@gmail.com>

twthorn force-pushed the fix_vtgate_oom branch from 23bdc58 to 80fb058 Compare October 31, 2025 22:57

mattlord reviewed Nov 3, 2025

View reviewed changes

go/vt/vtgate/vstream_manager.go Show resolved Hide resolved

twthorn added 2 commits November 3, 2025 15:49

VStream: Add large transaction test for VTGate

9b5b92f

Signed-off-by: twthorn <thomaswilliamthornton@gmail.com>

VStream: Add transaction chunk size flag for VTGate

b2ff2d2

Signed-off-by: twthorn <thomaswilliamthornton@gmail.com>

twthorn requested a review from harshit-gangal as a code owner November 4, 2025 01:53

Fix lint static code checks

840b95c

Signed-off-by: twthorn <thomaswilliamthornton@gmail.com>

twthorn requested a review from mattlord November 4, 2025 02:32

twthorn added 5 commits November 13, 2025 12:07

Test transaction chunking in e2e tests, simplify lock logic

0c9ff21

Signed-off-by: twthorn <thomaswilliamthornton@gmail.com>

Merge branch 'main' into fix_vtgate_oom

ac0f86e

Signed-off-by: twthorn <thomaswilliamthornton@gmail.com>

Add large transaction to trigger chunking in all tests in vreplicatio…

b6d9b5e

…n/vstream_test Signed-off-by: twthorn <thomaswilliamthornton@gmail.com>

Fit data within column limits for e2e vstream tests

5e46b81

Signed-off-by: twthorn <thomaswilliamthornton@gmail.com>

Regenerate proto files

33b9bf1

Signed-off-by: twthorn <thomaswilliamthornton@gmail.com>

Copilot finished reviewing on behalf of mattlord November 25, 2025 14:20

Copilot AI reviewed Nov 25, 2025

View reviewed changes

go/vt/vtgate/vstream_manager.go Outdated Show resolved Hide resolved

proto/vtgate.proto Outdated Show resolved Hide resolved

mattlord added the NeedsWebsiteDocsUpdate What it says label Nov 25, 2025

Handle rollback, clean up comments used when debugging

ccace81

Signed-off-by: twthorn <thomaswilliamthornton@gmail.com>

twthorn mentioned this pull request Nov 26, 2025

Add docs for TransactionChunkSize vitessio/website#2028

Merged

twthorn removed the NeedsWebsiteDocsUpdate What it says label Nov 26, 2025

Regenerate proto files

a3bde55

Signed-off-by: twthorn <thomaswilliamthornton@gmail.com>

twthorn added the NeedsWebsiteDocsUpdate What it says label Nov 26, 2025

mattlord reviewed Nov 30, 2025

View reviewed changes

Prevent minimize skew with transaction chunking, add comments and logs

c05ae93

Signed-off-by: twthorn <thomaswilliamthornton@gmail.com>

twthorn requested a review from mattlord December 2, 2025 20:42

twthorn added 3 commits December 3, 2025 10:14

Make VStream metrics test less brittle

f392c83

Signed-off-by: twthorn <thomaswilliamthornton@gmail.com>

Release lock after VStream ends

dc91e10

Signed-off-by: twthorn <thomaswilliamthornton@gmail.com>

Make transaction chunking opt-in

c334f00

Signed-off-by: twthorn <thomaswilliamthornton@gmail.com>

mattlord removed the NeedsWebsiteDocsUpdate What it says label Dec 4, 2025

mattlord approved these changes Dec 4, 2025

View reviewed changes

Clean up handling of chuking and minimize skew, add comments

0fa4663

Signed-off-by: twthorn <thomaswilliamthornton@gmail.com>

nickvanw approved these changes Dec 5, 2025

View reviewed changes

mattlord merged commit 488ef3d into vitessio:main Dec 5, 2025
103 of 105 checks passed

twthorn added a commit to slackhq/vitess that referenced this pull request Dec 11, 2025

VStream: Prevent buffering entire transactions (OOM risk), instead se…

b78d5d7

…nd chunks to client (vitessio#18849) Signed-off-by: twthorn <thomaswilliamthornton@gmail.com>

twthorn mentioned this pull request Dec 11, 2025

VStream: Prevent buffering entire transactions (OOM risk), instead se… slackhq/vitess#764

Merged

5 tasks

twthorn mentioned this pull request Jan 9, 2026

Support for transaction chunking in vitess-connector debezium/dbz#1519

Open

twthorn added a commit to slackhq/vitess that referenced this pull request Jan 28, 2026

VStream: Prevent buffering entire transactions (OOM risk), instead se…

ea160b5

…nd chunks to client (vitessio#18849) Signed-off-by: twthorn <thomaswilliamthornton@gmail.com>

twthorn mentioned this pull request Jan 28, 2026

[slack-22.0] V22 cdc backports round two slackhq/vitess#784

Merged

5 tasks

		// Large incomplete transaction detected - acquire lock to prevent interleaving
		// Lock will be held across subsequent callbacks until transaction completes

Conversation

twthorn commented Oct 31, 2025 • edited by mattlord Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Testing

Related Issue(s)

Checklist

Deployment Notes

AI Disclosure

Uh oh!

vitess-bot bot commented Oct 31, 2025

Review Checklist

General

Tests

Documentation

New flags

If a workflow is added or modified:

Backward compatibility

Uh oh!

codecov bot commented Oct 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

mattlord commented Nov 25, 2025

Uh oh!

Uh oh!

Uh oh!

mattlord Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

twthorn Dec 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mattlord left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mattlord Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

twthorn Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

twthorn commented Oct 31, 2025 •

edited by mattlord

Loading

codecov bot commented Oct 31, 2025 •

edited

Loading

twthorn Dec 2, 2025 •

edited

Loading