RowConverter keeps growing in size while merging streams on high-cardinality dictionary fields #7200

JayjeetAtGithub · 2023-08-04T20:26:39Z

Describe the bug

On executing SortPreservingMerge on multiple streams of record batches and using a high-cardinality dictionary field as the sort key, the RowConverter instance used to merge the multiple RowCursorStreams keeps growing in memory (as it keeps accumulating the dict mappings internally in the OrderPreservingInterner structure). This unbounded memory growth eventually causes data fusion to get killed by the OOM killer.

To Reproduce

Detailed steps to reproduce this issue is given here.

Expected behavior

SortPreservingMerge on streams of record batches with high-cardinality dictionary-encoded sort keys should be memory aware and keep memory usage within a user-defined limit.

Additional context

Possible solution:

Keep track of the memory usage for the RowConverter using the size() method which in the case of Dictionary fields returns the size of the OrderPreservingInterner.
If the size of the RowConverter grows more than a user-defined memory limit, take note of the RowCursorStream that are still getting converted, delete the converter, create a new one, and re-do the aborted conversions.

The text was updated successfully, but these errors were encountered:

JayjeetAtGithub · 2023-08-04T21:41:58Z

Related issues:

alamb · 2023-08-09T13:44:08Z

I was thinking about how to do this -- one thought I had was to "rewrite" any existing Rows using a RowConverter / convert_rows https://docs.rs/arrow-row/45.0.0/arrow_row/struct.RowConverter.html#method.convert_rows

Something like

let old_converter: RowConverter = ...
let old_rows: Rows = old_converter.convert_columns(input);
....
// now we need to rewrite to use `new_converter`:
let new_converter: RowConverter = ...
let new_rows: Rows = new_converter.convert_columns(
  // convert old Rows back to Arrays
  &old_converter.convert_rows(old_rows)?
)?;

So in that way you don't need to keep around the original input columns

You have probably already thought of this, but I figured I would write it down

alamb · 2023-08-15T16:42:29Z

@wiedld, @tustvold @crepererum @JayjeetAtGithub and I had a discussion and here are some notes:

The proposal is to look at the input before starting to do the merge or convert any rows and change how the row converter works for high cardinality dictionaries

The assumption is that for low cardinality dictionaries (a small
number of distinct values), using preserve_dictionaries is
important for performance but for high cardinality dictionaries (with
a large number of distinct values) using preserve_dictionaries not
only consumes large amounts of memory as described in this ticket, but
also will be slower as the size of the interned keys will be substantial.

If we do not use preserve_dictionaries the RowInternerwill no
longer keep a mapping and thus the memory consumption will not grow.

So specificially this would look like:

Based on some heuristic, if the dictionary is high cardinality then use the normal string encoding (set preserve_dictionaries false)
if the dictionary is low cardinality then use the dictionaries encoding (set preserve_dictionaries true, the default)

Open questions:

What heuristic to use to determine high cardinality (The heuristic needs to be reasonably fast / memory efficient to compute)
Can we improve the performance of preserve_dictionaries=false, conversion (Update ticket: Improve the performance of "DictionaryValue" row encoding arrow-rs#4712)
How to verify this doesn't cause a performance regressions

Other options we discussed:

Try update the state of RowConverter to prune out unused entries (not clear we could make this work)
Recreate the RowConverter
Use the "non dictionary encoding" mode (what is described above)

alamb · 2023-08-17T16:48:48Z

I filed apache/arrow-rs#4712 to track a possible performance improvement

alamb · 2023-08-23T15:13:19Z

@JayjeetAtGithub -- in terms of calculating "high cardinality" dictionaries perhaps we can use some sort of heuristic like "total number of distinct values used in the dictionary is greater than N" where "N" is a constant like 8 or 32 (maybe @tustvold has some thoughts on the right values to use

You can find the number of values used with this method: https://docs.rs/arrow/latest/arrow/array/struct.DictionaryArray.html#method.occupancy

(and then compute the number of set bits)

JayjeetAtGithub · 2023-08-23T17:39:47Z

Sounds good !

alamb · 2023-08-24T18:21:31Z

@JayjeetAtGithub I was thinking about this issue after some analysis I did on https://github.com/influxdata/influxdb_iox/issues/8568. I think my observation is that the RowConverter memory consumption explodes for high cardinality dictionaries wherever it is used, wherever it is used (not just in merge). Now that I type it out, it seems obvious 😆

Thus it seems like it might be a good patten to encapsulate / reuse the logic with some sort of wrapper around the row converter. Maybe something like:

/// wrapper around a Row converter that automatically
/// picks appropriate dictionary encoding
struct DataFusionRowConverter { 
  inner: Option<RowEncoder>
}

impl DataFusionRowConverter {
  pub fn convert_columns(
    &mut self,
    columns: &[ArrayRef]
  ) -> Result<Rows, ArrowError> {
    if self.inner.is_none() {
     // Check the arrays, detect high cardinality dictionaries
     // and fallback to normal decoding for that case
   }
   // after the first batch, use the pre-configured row coverter
   self.inner.as_mut().unwrap().convert_columns(columns)
}

JayjeetAtGithub · 2023-08-24T21:00:10Z

Quick question: We are implementing this wrapper inside arrow-rs or arrow-datafusion ? Because doing this inside data fusion, I get a lot of private field errors.

I think arrow-rs is the right place.

alamb · 2023-08-25T00:14:24Z

Quick question: We are implementing this wrapper inside arrow-rs or arrow-datafusion ? Because doing this inside data fusion, I get a lot of private field errors.

I think it can be done in either repo but the code would look different depending on where it is

alamb · 2023-08-25T16:33:34Z

TLDR I left some comments on the PRs -- they are looking good. I think we should put the code into DataFusion to start.

JayjeetAtGithub added the bug Something isn't working label Aug 4, 2023

This was referenced Aug 10, 2023

WIP: Fix SortPreservingMerge OOM issue #7260

Closed

wip: Fix SortPreservingMerge OOM kill issue #7261

Closed

alamb mentioned this issue Aug 17, 2023

Improve the performance of "DictionaryValue" row encoding apache/arrow-rs#4712

Closed

This was referenced Aug 24, 2023

Add a CardinalityAwareRowConverter apache/arrow-rs#4736

Closed

Implement CardinalityAwareRowConverter while doing streaming merge #7401

Merged

alamb mentioned this issue Sep 15, 2023

Stateless Row Encoding / Don't Preserve Dictionaries in RowConverter (#4811) apache/arrow-rs#4819

Merged

alamb closed this as completed in #7401 Sep 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RowConverter keeps growing in size while merging streams on high-cardinality dictionary fields #7200

RowConverter keeps growing in size while merging streams on high-cardinality dictionary fields #7200

JayjeetAtGithub commented Aug 4, 2023

JayjeetAtGithub commented Aug 4, 2023

alamb commented Aug 9, 2023

alamb commented Aug 15, 2023 •

edited

Loading

alamb commented Aug 17, 2023

alamb commented Aug 23, 2023

JayjeetAtGithub commented Aug 23, 2023

alamb commented Aug 24, 2023

JayjeetAtGithub commented Aug 24, 2023 •

edited

Loading

alamb commented Aug 25, 2023

alamb commented Aug 25, 2023

RowConverter keeps growing in size while merging streams on high-cardinality dictionary fields #7200

RowConverter keeps growing in size while merging streams on high-cardinality dictionary fields #7200

Comments

JayjeetAtGithub commented Aug 4, 2023

Describe the bug

To Reproduce

Expected behavior

Additional context

JayjeetAtGithub commented Aug 4, 2023

alamb commented Aug 9, 2023

alamb commented Aug 15, 2023 • edited Loading

alamb commented Aug 17, 2023

alamb commented Aug 23, 2023

JayjeetAtGithub commented Aug 23, 2023

alamb commented Aug 24, 2023

JayjeetAtGithub commented Aug 24, 2023 • edited Loading

alamb commented Aug 25, 2023

alamb commented Aug 25, 2023

alamb commented Aug 15, 2023 •

edited

Loading

JayjeetAtGithub commented Aug 24, 2023 •

edited

Loading