Reduce memory usage in writer with more memory efficient output buffer implementation by chenyangfb · Pull Request #24913 · prestodb/presto

chenyangfb · 2025-04-14T15:11:05Z

Description

Currently ChunkedSliceOutput is used for storing compressed output in writer. It managed list of buffers with size of power of 2 (e.g. 8k, 16k, 32k), and reuse buffers after flushing. It could leads to extra memory usage and OOM due to 1) mismatch in compressed output size and buffer size, 2) reusing buffers and not freeing buffers leads to extra memory usage by design.

Common scenario which leads to OOM includes

large number of streams with small amount of data (100k stream with 1k compressed bytes), each using minimal buffer size (e.g. 8k)
Each stream is wasting half of largest buffer (e.g. 8M out of 16M buffer)
Writer memory usage is high even after flushing (Reduce memory usage in writer by freeing unused buffers #23724 support freeing unused buffer in chunk supplier during reset)

This PR introduce OrcLazyChunkedOutputBuffer which focus on avoiding used memory.

Create buffer size based on the size of compressed output, this avoid the issue 1) and 2) mentioned above
lazy initialization in OrcLazyChunkedOutputBuffer and OrcOutputBuffer
Reset all the closed buffers, only keep the active buffer.

This behavior is controlled by lazyOutputBuffer in OrcWriterOptions, and it's disabled by default.

Impact

Reduce memory usage in writer.

Test Plan

Tested with Spark workload with high memory usage.
~10% improvement in run time and resource usage (memory reservation time), reduced GC time.
Tested with general Spark workload
No change in cpu time, slight reduction in run time and GC time.

== RELEASE NOTES ==
General Changes
* Improve efficiency of output buffer implementation to reduce memory usage in writer.

steveburnett · 2025-04-21T15:51:25Z

Thanks for the release note! Formatting and rephrasing suggestions to help follow the Release Notes Guidelines:

Release Notes

General Changes
* Improve efficiency of output buffer implementation to reduce memory usage in writer.

chenyangfb · 2025-04-22T11:55:08Z

Thanks for the release note! Formatting and rephrasing suggestions to help follow the Release Notes Guidelines:
Release Notes

General Changes
* Improve efficiency of output buffer implementation to reduce memory usage in writer. 

Thanks. updated

steveburnett · 2025-04-30T19:33:03Z

Please format the release note with a row of three ` above and below, like this:

== RELEASE NOTES ==

General Changes
* Improve efficiency of output buffer implementation to reduce memory usage in writer.

tdcmeehan · 2025-05-19T16:27:43Z

presto-orc/src/main/java/com/facebook/presto/orc/ChunkedSliceOutput.java

+                        bufferPool.size(),
+                        bufferPool.stream().mapToInt(b -> b.length).sum());
+                bufferPool.clear();
+                System.setProperty("RESET_OUTPUT_BUFFER", "RESET_OUTPUT_BUFFER");


What does this do?

sdruzkin · 2025-06-05T17:00:34Z

Can you please rebase it? Also would be nice to run this change in Vader to get 1-2k samples. I'll do a second round after that.

chenyangfb · 2025-06-06T16:10:54Z

Rebased. What is Vader? How do I run this change in Vader?

Can you please rebase it? Also would be nice to run this change in Vader to get 1-2k samples. I'll do a second round after that.

chenyangfb · 2025-06-09T17:39:03Z

Discussed with Sergii offline, ran Validation Service (Vader), results looks good, ~2k successful samples without failures https://fburl.com/scuba/dwrf_reader_checksum/dzgyy5xi

sdruzkin · 2025-06-26T19:28:31Z

LGTM, please do the minor change I requested and we will land it.

chenyangfb · 2025-07-03T21:39:01Z

rebased

sdruzkin · 2025-06-26T19:25:24Z

presto-orc/src/main/java/com/facebook/presto/orc/OrcLazyChunkedOutputBuffer.java

+    @Override
+    public void writeHeader(int value)
+    {
+        buffer[bufferPosition] = (byte) (value & 0x00_00FF);


Please change to a more concise:

buffer[bufferPosition++] = (byte) (value & 0x00_00FF); buffer[bufferPosition++] = (byte) ((value & 0x00_FF00) >> 8); buffer[bufferPosition++] = (byte) ((value & 0xFF_0000) >> 16);

chenyangfb force-pushed the orc_output_buffer branch 4 times, most recently from b88e026 to 04a0df7 Compare April 14, 2025 17:13

chenyangfb changed the title ~~Add OrcLazyChunkedOutputBuffer which is more memory efficient~~ Reduce memory usage in writer with more memory efficient output buffer implementation Apr 14, 2025

chenyangfb force-pushed the orc_output_buffer branch from 04a0df7 to 4d7394d Compare April 14, 2025 18:15

chenyangfb marked this pull request as ready for review April 14, 2025 18:18

chenyangfb requested review from a team and sdruzkin as code owners April 14, 2025 18:18

chenyangfb requested a review from presto-oss April 14, 2025 18:18

chenyangfb force-pushed the orc_output_buffer branch from 4d7394d to 4b14d15 Compare April 18, 2025 23:52

chenyangfb force-pushed the orc_output_buffer branch 2 times, most recently from 5eeb72e to 48e410b Compare May 5, 2025 21:56

tdcmeehan reviewed May 19, 2025

View reviewed changes

chenyangfb force-pushed the orc_output_buffer branch from 48e410b to 8323da4 Compare June 6, 2025 16:08

chenyangfb force-pushed the orc_output_buffer branch from 8323da4 to 7bc6321 Compare June 13, 2025 22:37

Add OrcLazyChunkedOutputBuffer which is more memory efficient

5252922

chenyangfb force-pushed the orc_output_buffer branch from 7bc6321 to 5252922 Compare July 3, 2025 21:37

sdruzkin approved these changes Jul 5, 2025

View reviewed changes

sdruzkin merged commit 75243fd into prestodb:master Jul 5, 2025
108 checks passed

unidevel mentioned this pull request Jul 11, 2025

Add release notes for 0.294 unix280/presto#37

Closed

7 tasks

unidevel mentioned this pull request Jul 24, 2025

Add release notes for 0.294 unix280/presto#39

Merged

7 tasks

unidevel mentioned this pull request Jul 27, 2025

Add release notes for 0.294 unix280/presto#40

Merged

9 tasks

prestodb-ci mentioned this pull request Jul 28, 2025

Add release notes for 0.294 #25633

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce memory usage in writer with more memory efficient output buffer implementation #24913

Reduce memory usage in writer with more memory efficient output buffer implementation #24913
sdruzkin merged 1 commit intoprestodb:masterfrom
chenyangfb:orc_output_buffer

chenyangfb commented Apr 14, 2025 •

edited

Loading

Uh oh!

steveburnett commented Apr 21, 2025

Uh oh!

chenyangfb commented Apr 22, 2025

Uh oh!

steveburnett commented Apr 30, 2025

Uh oh!

tdcmeehan May 19, 2025

Uh oh!

sdruzkin commented Jun 5, 2025

Uh oh!

chenyangfb commented Jun 6, 2025

Uh oh!

chenyangfb commented Jun 9, 2025

Uh oh!

sdruzkin commented Jun 26, 2025 •

edited

Loading

Uh oh!

chenyangfb commented Jul 3, 2025

Uh oh!

sdruzkin Jun 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

chenyangfb commented Apr 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Impact

Test Plan

Uh oh!

steveburnett commented Apr 21, 2025

Uh oh!

chenyangfb commented Apr 22, 2025

Uh oh!

steveburnett commented Apr 30, 2025

Uh oh!

tdcmeehan May 19, 2025

Choose a reason for hiding this comment

Uh oh!

sdruzkin commented Jun 5, 2025

Uh oh!

chenyangfb commented Jun 6, 2025

Uh oh!

chenyangfb commented Jun 9, 2025

Uh oh!

sdruzkin commented Jun 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chenyangfb commented Jul 3, 2025

Uh oh!

sdruzkin Jun 26, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

chenyangfb commented Apr 14, 2025 •

edited

Loading

sdruzkin commented Jun 26, 2025 •

edited

Loading