Improve BenchmarkPartitionedOutputOperator by lukasz-stec · Pull Request #12234 · trinodb/trino

lukasz-stec · 2022-05-04T07:20:20Z

Description

Introduce a consistent TestType naming convention.
Add descriptions to the TestType cases.
Add more Dictionary test cases.
Add RowType with RLE field case.

Extracted (except for the docs commit) from #11289.

Is this change a fix, improvement, new feature, refactoring, or other?

benchmark refactoring + documentation

Is this a change to the core query engine, a connector, client library, or the SPI interfaces? (be specific)

only BenchmarkPartitionedOutputOperator

How would you describe this change to a non-technical end user or system administrator?

Improved code readibility

Related issues, pull requests, and links

Documentation

( ) No documentation is needed.
( ) Sufficient documentation is included in this PR.
( ) Documentation PR is available with #prnumber.
( ) Documentation issue #issuenumber is filed, and can be handled later.

Release notes

( ) No release notes entries required.
( ) Release notes entries required with the following suggested text:

# Section
* Fix some things. ({issue}`issuenumber`)

sopel39

lgtm % nits (optional)

sopel39 · 2022-05-04T12:22:31Z

core/trino-main/src/test/java/io/trino/operator/output/BenchmarkPartitionedOutputOperator.java

nit: that should be separate commit

sopel39 · 2022-05-04T12:27:14Z

core/trino-main/src/test/java/io/trino/operator/output/BenchmarkPartitionedOutputOperator.java

nit: it would be better to use composition instead of overriding createPage. Right now you pass some arguments to TestType, but you also optionally override a method. With composition, you could reuse createRandomDictionaryPage rather than override it every time. Composition is also cleaner and more consistent than inheritance

makes sense. I refactored it to use new PageGenerator interface

sopel39 · 2022-05-04T12:28:18Z

core/trino-main/src/test/java/io/trino/operator/output/BenchmarkPartitionedOutputOperator.java

this should go to commit that adds row-wise benchmark. BTW: do you know if it works?

there is no explicit row-wise benchmark. It has to be set up manually by choosing positionCount ~= partitionCount.

And yes, row-wise code path pollution works, it has visible influence on benchmark results:

arm ### without row-wise pollution Benchmark (channelCount) (enableCompression) (nullRate) (partitionCount) (positionCount) (type) Mode Cnt Score Error Units BenchmarkPartitionedOutputOperator.addPage 2 false 0 512 512 BIGINT avgt 10 276.219 ± 3.432 ms/op ### with row-wise pollution Benchmark (channelCount) (enableCompression) (nullRate) (partitionCount) (positionCount) (type) Mode Cnt Score Error Units BenchmarkPartitionedOutputOperator.addPage 2 false 0 512 512 BIGINT avgt 10 352.991 ± 9.117 ms/op```

Use PageGenerator composition inside TestType instead of overriding createPage method. It makes the intent clearer, especially if the PageGenerators are re-used.

Extend BenchmarkPartitionedOutputOperator.pollute with row-wise code path pollution

Introduce a consistent TestType naming convention. Add descriptions to the TestType cases.

lukasz-stec

as suggested I refactored the benchmark TestType to use page generation via composition over overriding the createPage method

lukasz-stec · 2022-05-04T13:44:28Z

core/trino-main/src/test/java/io/trino/operator/output/BenchmarkPartitionedOutputOperator.java

lukasz-stec · 2022-05-04T13:44:56Z

core/trino-main/src/test/java/io/trino/operator/output/BenchmarkPartitionedOutputOperator.java

makes sense. I refactored it to use new PageGenerator interface

lukasz-stec · 2022-05-04T13:55:47Z

core/trino-main/src/test/java/io/trino/operator/output/BenchmarkPartitionedOutputOperator.java

there is no explicit row-wise benchmark. It has to be set up manually by choosing positionCount ~= partitionCount.

And yes, row-wise code path pollution works, it has visible influence on benchmark results:

arm ### without row-wise pollution Benchmark (channelCount) (enableCompression) (nullRate) (partitionCount) (positionCount) (type) Mode Cnt Score Error Units BenchmarkPartitionedOutputOperator.addPage 2 false 0 512 512 BIGINT avgt 10 276.219 ± 3.432 ms/op ### with row-wise pollution Benchmark (channelCount) (enableCompression) (nullRate) (partitionCount) (positionCount) (type) Mode Cnt Score Error Units BenchmarkPartitionedOutputOperator.addPage 2 false 0 512 512 BIGINT avgt 10 352.991 ± 9.117 ms/op```

cla-bot bot added the cla-signed label May 4, 2022

lukasz-stec requested a review from sopel39 May 4, 2022 07:20

lukasz-stec mentioned this pull request May 4, 2022

PartitionedOutputOperator RLE blocks support #11289

Merged

lukasz-stec force-pushed the ls/014-poo-benchmark-docs branch from 2c46d6a to 6f9558c Compare May 4, 2022 11:02

lukasz-stec changed the title ~~Document BenchmarkPartitionedOutputOperator cases~~ Improve BenchmarkPartitionedOutputOperator May 4, 2022

sopel39 approved these changes May 4, 2022

View reviewed changes

lukasz-stec added 7 commits May 4, 2022 15:10

Use PageGenerator in BenchmarkPartitionedOutputOperator

65bda09

Use PageGenerator composition inside TestType instead of overriding createPage method. It makes the intent clearer, especially if the PageGenerators are re-used.

Extract createDictionaryPartitionChannelPage

3dea0a5

Add Row with RLE field case to BenchmarkPartitionedOutputOperator

15508ba

Make pageCount for DICTIONARY_BIGINT consistent with BIGINT

dd8d122

Add dictionary cases to BenchmarkPartitionedOutputOperator

a2f7bb9

Pollute row-wise processing

93bb653

Extend BenchmarkPartitionedOutputOperator.pollute with row-wise code path pollution

Document BenchmarkPartitionedOutputOperator cases

8e75abd

Introduce a consistent TestType naming convention. Add descriptions to the TestType cases.

lukasz-stec force-pushed the ls/014-poo-benchmark-docs branch from 6f9558c to 8e75abd Compare May 4, 2022 13:58

lukasz-stec commented May 4, 2022

View reviewed changes

sopel39 approved these changes May 5, 2022

View reviewed changes

sopel39 merged commit f626817 into trinodb:master May 5, 2022

github-actions bot added this to the 380 milestone May 5, 2022

mosabua mentioned this pull request May 5, 2022

Add Trino 380 release notes #12184

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve BenchmarkPartitionedOutputOperator#12234

Improve BenchmarkPartitionedOutputOperator#12234
sopel39 merged 7 commits intotrinodb:masterfrom
starburstdata:ls/014-poo-benchmark-docs

lukasz-stec commented May 4, 2022 •

edited

Loading

Uh oh!

sopel39 left a comment

Uh oh!

sopel39 May 4, 2022

Uh oh!

lukasz-stec May 4, 2022

Uh oh!

sopel39 May 4, 2022

Uh oh!

lukasz-stec May 4, 2022

Uh oh!

sopel39 May 4, 2022

Uh oh!

lukasz-stec May 4, 2022 •

edited

Loading

Uh oh!

lukasz-stec left a comment

Uh oh!

lukasz-stec May 4, 2022

Uh oh!

lukasz-stec May 4, 2022

Uh oh!

lukasz-stec May 4, 2022 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

2 participants

Conversation

lukasz-stec commented May 4, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related issues, pull requests, and links

Documentation

Release notes

Uh oh!

sopel39 left a comment

Choose a reason for hiding this comment

Uh oh!

sopel39 May 4, 2022

Choose a reason for hiding this comment

Uh oh!

lukasz-stec May 4, 2022

Choose a reason for hiding this comment

Uh oh!

sopel39 May 4, 2022

Choose a reason for hiding this comment

Uh oh!

lukasz-stec May 4, 2022

Choose a reason for hiding this comment

Uh oh!

sopel39 May 4, 2022

Choose a reason for hiding this comment

Uh oh!

lukasz-stec May 4, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lukasz-stec left a comment

Choose a reason for hiding this comment

Uh oh!

lukasz-stec May 4, 2022

Choose a reason for hiding this comment

Uh oh!

lukasz-stec May 4, 2022

Choose a reason for hiding this comment

Uh oh!

lukasz-stec May 4, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

2 participants

lukasz-stec commented May 4, 2022 •

edited

Loading

lukasz-stec May 4, 2022 •

edited

Loading

lukasz-stec May 4, 2022 •

edited

Loading