Skip to content

Collect data size statistic #11185

Merged
arhimondr merged 0 commit intoprestodb:masterfrom
arhimondr:collect-colunm-data-size
Aug 6, 2018
Merged

Collect data size statistic #11185
arhimondr merged 0 commit intoprestodb:masterfrom
arhimondr:collect-colunm-data-size

Conversation

@arhimondr
Copy link
Member

In the Hive connector for CHAR, VARCHAR, VARBINARY columns.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

should be only non-null rows

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

averageColumnLength only includes non-null rows.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This name sounds strange, how about TOTAL_SIZE_IN_BYTES

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Similar to the enum, let's drop the values part and call this totalSizeInBytes everywhere?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As a separate refactor, it might be nice to introduce a constant for the name:

@AggregationFunction(value = SumDataSizeForStats.NAME, hidden = true)
public final class SumDataSizeForStats
{
    public static final String NAME = "$internal$sum_data_size_for_stats";
    ...

@arhimondr arhimondr force-pushed the collect-colunm-data-size branch from c5af79f to 51a1064 Compare August 4, 2018 13:56
@arhimondr
Copy link
Member Author

@electrum @rschlussel Comments addressed

Copy link
Contributor

@rschlussel rschlussel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Check the travis build. The failure looks related.

@arhimondr arhimondr force-pushed the collect-colunm-data-size branch from 51a1064 to 35b77b3 Compare August 6, 2018 14:15
@arhimondr arhimondr closed this Aug 6, 2018
@arhimondr arhimondr force-pushed the collect-colunm-data-size branch from 35b77b3 to 9b31a8c Compare August 6, 2018 14:56
@arhimondr arhimondr deleted the collect-colunm-data-size branch August 6, 2018 14:57
@arhimondr arhimondr merged commit 9b31a8c into prestodb:master Aug 6, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants