Postgres: avoid tiny writes to improve performance in read-heavy scenarios by Tener · Pull Request #29812 · gravitational/teleport

Tener · 2023-07-31T14:33:16Z

Currently, for Postgres, we read individual server messages one by one before forwarding them to the client, also one by one. This limits the performance if the messages are small, as sending them in this manner is costly: each takes at least one syscall to perform. In particular, this can be observed in read-heavy scenarios, such as pg_dump or SELECT * FROM large_table.

This PR restructures the handling of the server byte stream. The main driver is io.Copy call, which ensures performance due to the large buffers involved. For analysis purposes, we copy the byte stream for consumption in a separate goroutine, which counts the parsed messages. The byte stream is copied synchronously courtesy of io.Pipe, which ensures the stream does not consume significant memory resources.

I have tested the performance of this part of the code using home-grown scripts available here.

I ran the tests on my local MacBook, with native Postgres (postgres (PostgreSQL) 15.3 (Homebrew)).

version	command	runs	average time	slowdown
N/A (without Teleport)	`make read-benchmark`	5	0.612s	x1.00
`master @ 9960697d35`	`make read-benchmark`	5	8.390s	x13.71
`tener/postgres-io-copy @ bf8ca89573`	`make read-benchmark`	5	0.682s	x1.15

master is CPU-bound and limited by single-core performance on the Teleport side and is ~13.7 times slower compared to native performance (without Teleport). This PR closes the gap by a large margin, leaving only 15% overhead.

Results from the synthetic benchmark also added in PR:

$ go test ./lib/srv/db -bench=. -run=^$ -benchtime=3x
goos: darwin
goarch: arm64
pkg: github.com/gravitational/teleport/lib/srv/db
BenchmarkPostgresReadLargeTable/size=11-10         	       3	 286618514 ns/op
BenchmarkPostgresReadLargeTable/size=20-10         	       3	 253457917 ns/op
BenchmarkPostgresReadLargeTable/size=100-10        	       3	 222804292 ns/op
BenchmarkPostgresReadLargeTable/size=1000-10       	       3	 216612764 ns/op
BenchmarkPostgresReadLargeTable/size=2000-10       	       3	 214121861 ns/op
BenchmarkPostgresReadLargeTable/size=8000-10       	       3	 215046472 ns/op

Contrast with master:

$ git checkout master -- lib/srv/db/postgres/engine.go
$ go test ./lib/srv/db -bench=. -run=^$ -benchtime=3x
goos: darwin
goarch: arm64
pkg: github.com/gravitational/teleport/lib/srv/db
BenchmarkPostgresReadLargeTable/size=11-10         	       3	55277246278 ns/op
BenchmarkPostgresReadLargeTable/size=20-10         	       3	31743084695 ns/op
BenchmarkPostgresReadLargeTable/size=100-10        	       3	6678759083 ns/op
BenchmarkPostgresReadLargeTable/size=1000-10       	       3	 920221875 ns/op
BenchmarkPostgresReadLargeTable/size=2000-10       	       3	 567684528 ns/op
BenchmarkPostgresReadLargeTable/size=8000-10       	       3	 297351194 ns/op

For this benchmark, the new code achieves a speedup of at least 1.38x, or as much as 192x - depending on the message size. Running the benchmark with -benchmem (results not shown for brevity) indicates the updated code allocates less memory in fewer individual allocations, ultimately leading to decreased GC pressure.

Changelog: Postgres: improve performance in read-heavy scenarios.

Fixes #26868.

Tener · 2023-07-31T14:33:28Z

@smallinsky FYI.

Tener · 2023-07-31T15:04:06Z

@smallinsky e352893 implements message parsing loop running in a separate goroutine, enabling analysis. No performance impact observed.

To count the messages, consume copied message stream in a separate goroutine.

smallinsky · 2023-08-09T11:20:03Z

@Tener
Could update a PR comment and provided performance insights and provide some data ?

Tener · 2023-08-09T11:55:04Z

@Tener Could update a PR comment and provided performance insights and provide some data ?

Yep, just working on it right now. Just made a repo with my scripts: https://github.com/Tener/teleport-bench-dbs, I'll reference it in my findings.

rosstimothy

Can we write a benchmark test that exercises this before and after the change so we have some tangible numbers on just how much better this solution is?

Tener · 2023-08-09T13:06:18Z

Can we write a benchmark test that exercises this before and after the change so we have some tangible numbers on just how much better this solution is?

@rosstimothy Sure, I just added the results from the benchmark I've been using throughout this work.

Co-authored-by: rosstimothy <39066650+rosstimothy@users.noreply.github.com>

rosstimothy · 2023-08-10T12:11:52Z

Can we write a benchmark test that exercises this before and after the change so we have some tangible numbers on just how much better this solution is?

@rosstimothy Sure, I just added the results from the benchmark I've been using throughout this work.

That's not quite what I was talking about. The numbers look good but it'd be nice if we had a BenchmarkPostgres(b *testing.b) that could be exercised regularly.

Tener · 2023-08-10T14:02:21Z

Can we write a benchmark test that exercises this before and after the change so we have some tangible numbers on just how much better this solution is?

@rosstimothy Sure, I just added the results from the benchmark I've been using throughout this work.

That's not quite what I was talking about. The numbers look good but it'd be nice if we had a BenchmarkPostgres(b *testing.b) that could be exercised regularly.

That would be nice indeed, and we do have relevant RFD open:

RFD 141: tsh bench db #28727.

It isn't implemented, however, which is why I went with custom scripts.

rosstimothy · 2023-08-10T14:05:55Z

Can we write a benchmark test that exercises this before and after the change so we have some tangible numbers on just how much better this solution is?

@rosstimothy Sure, I just added the results from the benchmark I've been using throughout this work.

That's not quite what I was talking about. The numbers look good but it'd be nice if we had a BenchmarkPostgres(b *testing.b) that could be exercised regularly.

That would be nice indeed, and we do have relevant RFD open:

RFD 141: tsh bench db #28727.

It isn't implemented, however, which is why I went with custom scripts.

I don't think that RFD applies. That is implementing tests to exercise db access via tsh bench postgres. I'm asking if we can implement a go benchmark test that covers the changes in this specific PR.

Tener · 2023-08-10T15:25:06Z

I can write a limited benchmark, for example exercising only receiveFromServer, if this is what you mean here. Shouldn't take too much work to do so.

I believe that the tests proposed in RFD#141 will ultimately be much more useful, being end-to-end and representing actual real-world workloads.

Tener · 2023-08-17T14:08:41Z

@rosstimothy I've added the benchmark to the PR. The results are in the PR description.

rosstimothy

Thanks for the benchmark test! LGTM other than a few nits.

Co-authored-by: rosstimothy <39066650+rosstimothy@users.noreply.github.com>

public-teleport-github-review-bot · 2023-08-18T10:12:33Z

@Tener See the table below for backport results.

Branch	Result
branch/v13	Failed

This change is analogous to the Postgres change made in #29812.

…os (#31204) * MySQL: avoid tiny writes to improve performance in read-heavy scenarios This change is analogous to the Postgres change made in #29812. * Rename variable for ease of comprehension. * Correct variable names in the comment.

This change is analogous to the Postgres change made in #29812.

…cenarios (#31347) * MySQL: avoid tiny writes to improve performance in read-heavy scenarios This change is analogous to the Postgres change made in #29812. * Rename variable for ease of comprehension. * Correct variable names in the comment.

This change is analogous to the Postgres change made in #29812.

…os (#31403) This change is analogous to the Postgres change made in #29812.

…os (#31402) This change is analogous to the Postgres change made in #29812.

Postgres: avoid tiny writes to improve performance.

bf8ca89

To count the messages, consume copied message stream in a separate goroutine.

Tener force-pushed the tener/postgres-io-copy branch from e352893 to bf8ca89 Compare August 9, 2023 11:11

Tener changed the title ~~Postgres: don't parse server messages, copy directly to the client~~ Postgres: avoid tiny writes to improve performance. Aug 9, 2023

Tener marked this pull request as ready for review August 9, 2023 11:13

github-actions Bot added database-access Database access related issues and PRs size/sm labels Aug 9, 2023

github-actions Bot requested review from bl-nero and rosstimothy August 9, 2023 11:13

Tener requested review from GavinFrazar, r0mant and smallinsky August 9, 2023 11:14

bl-nero approved these changes Aug 9, 2023

View reviewed changes

Comment thread lib/srv/db/postgres/engine.go

rosstimothy reviewed Aug 9, 2023

View reviewed changes

Comment thread lib/srv/db/postgres/engine.go Outdated

Comment thread lib/srv/db/postgres/engine.go Outdated

rosstimothy self-requested a review August 9, 2023 13:19

Apply suggestions from code review

bd42623

Co-authored-by: rosstimothy <39066650+rosstimothy@users.noreply.github.com>

Tener added the changelog label Aug 9, 2023

Tener changed the title ~~Postgres: avoid tiny writes to improve performance.~~ Postgres: avoid tiny writes to improve performance in read-heavy scenarios Aug 9, 2023

Improve comments.

9fa317b

Tener added the backport/branch/v13 label Aug 17, 2023

Implement benchmark for read-heavy Postgres usage.

35c6159

rosstimothy approved these changes Aug 17, 2023

View reviewed changes

Comment thread lib/srv/db/benchmark_test.go

Comment thread lib/srv/db/benchmark_test.go Outdated

Comment thread lib/srv/db/postgres/test.go

Comment thread lib/srv/db/postgres/test.go Outdated

Comment thread lib/srv/db/postgres/test.go

Comment thread lib/srv/db/postgres/engine.go Outdated

public-teleport-github-review-bot Bot removed request for GavinFrazar, r0mant and smallinsky August 17, 2023 14:31

Tener and others added 2 commits August 17, 2023 16:42

Update lib/srv/db/postgres/engine.go

39a70e1

Co-authored-by: rosstimothy <39066650+rosstimothy@users.noreply.github.com>

Update lib/srv/db/postgres/test.go

5de24dd

Co-authored-by: rosstimothy <39066650+rosstimothy@users.noreply.github.com>

Tener enabled auto-merge August 17, 2023 14:46

Tener added 2 commits August 17, 2023 18:43

Linter.

c5492c4

Merge branch 'master' into tener/postgres-io-copy

cd380e1

Tener disabled auto-merge August 17, 2023 17:12

Tener added 2 commits August 17, 2023 19:36

Improve benchmark accuracy - prepare test bench without a timer.

30f17bf

Merge branch 'master' into tener/postgres-io-copy

27346d3

Tener enabled auto-merge August 17, 2023 17:39

Merge branch 'master' into tener/postgres-io-copy

30fb757

Tener added this pull request to the merge queue Aug 18, 2023

Merged via the queue into master with commit 8d5025a Aug 18, 2023

Tener deleted the tener/postgres-io-copy branch August 18, 2023 10:11

Tener added a commit that referenced this pull request Aug 30, 2023

MySQL: avoid tiny writes to improve performance in read-heavy scenarios

0cdf4a6

This change is analogous to the Postgres change made in #29812.

Tener mentioned this pull request Aug 30, 2023

MySQL: avoid tiny writes to improve performance in read-heavy scenarios #31204

Merged

github-actions Bot pushed a commit that referenced this pull request Aug 31, 2023

MySQL: avoid tiny writes to improve performance in read-heavy scenarios

6a21cca

This change is analogous to the Postgres change made in #29812.

Tener added a commit that referenced this pull request Sep 4, 2023

MySQL: avoid tiny writes to improve performance in read-heavy scenarios

d4aca85

This change is analogous to the Postgres change made in #29812.

Tener added a commit that referenced this pull request Sep 4, 2023

MySQL: avoid tiny writes to improve performance in read-heavy scenarios

52f2900

This change is analogous to the Postgres change made in #29812.

github-merge-queue Bot pushed a commit that referenced this pull request Sep 5, 2023

MySQL: avoid tiny writes to improve performance in read-heavy scenari…

b13b73c

…os (#31403) This change is analogous to the Postgres change made in #29812.

github-merge-queue Bot pushed a commit that referenced this pull request Sep 5, 2023

MySQL: avoid tiny writes to improve performance in read-heavy scenari…

69a3595

…os (#31402) This change is analogous to the Postgres change made in #29812.

Conversation

Tener commented Jul 31, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Tener commented Jul 31, 2023

Uh oh!

Tener commented Jul 31, 2023

Uh oh!

smallinsky commented Aug 9, 2023

Uh oh!

Tener commented Aug 9, 2023

Uh oh!

Uh oh!

rosstimothy left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Tener commented Aug 9, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rosstimothy commented Aug 10, 2023

Uh oh!

Tener commented Aug 10, 2023

Uh oh!

rosstimothy commented Aug 10, 2023

Uh oh!

Tener commented Aug 10, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Tener commented Aug 17, 2023

Uh oh!

rosstimothy left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

public-teleport-github-review-bot Bot commented Aug 18, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Tener commented Jul 31, 2023 •

edited

Loading

Tener commented Aug 9, 2023 •

edited

Loading

Tener commented Aug 10, 2023 •

edited

Loading