Skip to content

Simplify Hive connector scale writers test#18688

Merged
raunaqmorarka merged 1 commit intotrinodb:masterfrom
gaurav8297:fix_scale_writer_flaky
Aug 17, 2023
Merged

Simplify Hive connector scale writers test#18688
raunaqmorarka merged 1 commit intotrinodb:masterfrom
gaurav8297:fix_scale_writer_flaky

Conversation

@gaurav8297
Copy link
Copy Markdown
Member

@gaurav8297 gaurav8297 commented Aug 15, 2023

Description

Two changes:

  • First, We need to use a large table (sf2) to see the
    effect. Otherwise, a single writer will write the
    entire data before ScaledWriterScheduler is able to
    scale it to multiple machines.

  • Second, since we now use page size to scale up instead
    of physical written bytes, we don't have to test on different
    file formats. This reduces the number of tests and overall
    runtime.

fixes: #18670

Additional context and related issues

Release notes

(x) This is not user-visible or is docs only, and no release notes are required.
( ) Release notes are required. Please propose a release note for me.
( ) Release notes are required, with the following suggested text:

@cla-bot cla-bot bot added the cla-signed label Aug 15, 2023
@github-actions github-actions bot added tests:hive hive Hive connector labels Aug 15, 2023
@gaurav8297 gaurav8297 force-pushed the fix_scale_writer_flaky branch 3 times, most recently from 628da68 to c2de9d5 Compare August 15, 2023 23:09
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

How much memory it will use and how long can it take?

@findepi
Copy link
Copy Markdown
Member

findepi commented Aug 16, 2023

test (plugin/trino-hive)
failed 7 hours ago in 1h 0m 22s

Error: The operation was canceled.

is the build failure related?

@findepi
Copy link
Copy Markdown
Member

findepi commented Aug 16, 2023

cc @lukasz-stec

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

intentional?

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yes, to test ci

@gaurav8297 gaurav8297 force-pushed the fix_scale_writer_flaky branch from c2de9d5 to 429156b Compare August 16, 2023 17:39
@gaurav8297 gaurav8297 changed the title Use sf5 table in testScaleWriters Simplify Hive connector scale writers test Aug 16, 2023
@gaurav8297 gaurav8297 requested a review from lukasz-stec August 16, 2023 17:41
@gaurav8297 gaurav8297 force-pushed the fix_scale_writer_flaky branch from 429156b to 60ba1bd Compare August 16, 2023 18:30
First, We need to use large table (sf2) to see the
effect. Otherwise, a single writer will write the
entire data before ScaledWriterScheduler is able to
scale it to multiple machines.

Second, Since we now use page size to scale up instead
of physical written bytes, we don't have to test on different
file formats. This reduces the number of tests and overall
runtime.
@raunaqmorarka raunaqmorarka merged commit 5c3b039 into trinodb:master Aug 17, 2023
@github-actions github-actions bot added this to the 424 milestone Aug 17, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cla-signed hive Hive connector

Development

Successfully merging this pull request may close these issues.

Flaky test TestHiveConnectorTest.testScaleWriters

4 participants