Limit number of nodes that execute writing stages by radek-kondziolka · Pull Request #15987 · trinodb/trino

radek-kondziolka · 2023-02-06T13:16:02Z

Description

The option query.max-writer-node-count was added to limit number of nodes that take part in executing writer stages.
It was implemented by some changes in ScaledWriterScheduler (for unpartitioned data) and by adding the new rule LimitMaxWriterNodesCount to the optimizer (for partitioned data).

Release notes

( ) This is not user-visible or docs only and no release notes are required.
( *) Release notes are required, please propose a release note for me.
( ) Release notes are required, with the following suggested text:

Documentation

Documentation update is need and will be done soon.

sopel39 · 2023-02-06T13:45:17Z

Close #15877?

radek-kondziolka · 2023-02-06T14:18:03Z

Yes, #15877 was closed. Let's wait with merging until preparing PR to documentation staff.

gaurav8297 · 2023-02-07T07:52:51Z

core/trino-main/src/main/java/io/trino/sql/planner/optimizations/LimitMaxWriterNodesCount.java

Instead of having a separate rule, you can do this directly inside AddExchange#visitTableWriter

Yes, it could. I do it here because:

Most probably in the future we can make it adaptive basing on stats.

There is a rule that is similar in the sense of setting PartitioningScheme - ApplyPreferredTableExecutePartitioning. This rule is very simple, just basing on configuration toggle and could be part of AddExchanges as well.

This is why I decided to wrap it within another rule.

TBH, I don't think we need a separate rule for this since we are just applying the configuration. In ApplyPreferredTableExecutePartitioning we are taking a decision based on estimates to use preferred partitioning or not. So, Its kinda makes sense.

And in future if we ever make it adaptive based on stats we can always add a new rule and remove from AddExchanges. It shouldn't be a big change.

WDYT? @sopel39

I think the easier approach is preferred. If it's just few lines of code, then AddExchange#visitTableWriter is good enough.

Be sure to have proper testing coverage. With AddExchange I think you have to have BasePlanTest kind of tests

@gaurav8297 , the rule LimitMaxWriterNodesCount is now much complicated. Especially, we skip the rule in some cases that is not skipped in AddExchange rule. I do not think that we should merge them. It is complicated it seems to be at the beginning.

core/trino-main/src/test/java/io/trino/execution/scheduler/TestScaledWriterScheduler.java

Added an option query.max-writer-nodes-count to QueryManagerConfig and a session option that limits number of nodes that take part in writing tasks.

The option query.max-writer-nodes-count was used as a maximal number of nodes that take part in writing stages when ScaledWriterScheduler is used.

The optimizer rule LimitMaxWriterNodesCount was added to limit number of nodes that take part in executing writer stages.

hashhar · 2023-02-13T12:27:40Z

Have we also considered making the target catalog's connector participate in deciding how many writer tasks to use? Clusters can be connected to very homogenous systems so for example one might want all nodes to write to object storage or Kafka but only limited number of writers to databases.

sopel39 · 2023-02-14T12:05:01Z

Have we also considered making the target catalog's connector participate in deciding how many writer tasks to use?

You mean round-robin writers or partitioned? For partitioned connector can always assigned fixed partition->node mapping

hashhar · 2023-02-14T12:57:53Z

For round-robin writers.

EDIT: Is it possible to partition writes on some column which is actually not part of the output? e.g. If I wanted to limit writes to 2 nodes maybe I can partition the output on some generated column and assign to two nodes. That would also achieve what I'm trying to think of.

radek-kondziolka · 2023-02-20T08:40:22Z

@hashhar ,

Is it possible to partition writes on some column which is actually not part of the output?

For now it is not possible.

radek-kondziolka · 2023-02-23T15:15:57Z

@hashhar , I am closing this one and I've opened this one: #16238
Beyond to that, I've added possibility for connectors to decide on number of tasks / nodes that take part in writing.

cla-bot bot added the cla-signed label Feb 6, 2023

radek-kondziolka force-pushed the rk/add_upper_limit_on_writer_scaling_backup branch from 1152188 to 213b5b6 Compare February 6, 2023 13:23

radek-kondziolka changed the title ~~Rk/add upper limit on writer scaling backup~~ Limit number of nodes that execute writing stages Feb 6, 2023

radek-kondziolka force-pushed the rk/add_upper_limit_on_writer_scaling_backup branch 2 times, most recently from 5cd907f to 27c9024 Compare February 6, 2023 13:39

radek-kondziolka force-pushed the rk/add_upper_limit_on_writer_scaling_backup branch 4 times, most recently from 059c5e6 to 8864c30 Compare February 6, 2023 14:16

radek-kondziolka requested review from gaurav8297 and sopel39 February 6, 2023 14:18

radek-kondziolka marked this pull request as ready for review February 6, 2023 14:22

gaurav8297 reviewed Feb 7, 2023

View reviewed changes

radek-kondziolka force-pushed the rk/add_upper_limit_on_writer_scaling_backup branch 2 times, most recently from 0b6408f to 128f1db Compare February 7, 2023 13:58

radek-kondziolka added 3 commits February 9, 2023 14:20

Add query.max-writer-nodes-count property to limit writer nodes

9e87a3b

Added an option query.max-writer-nodes-count to QueryManagerConfig and a session option that limits number of nodes that take part in writing tasks.

Limit maximal writer nodes count in scale writers

2b976c0

The option query.max-writer-nodes-count was used as a maximal number of nodes that take part in writing stages when ScaledWriterScheduler is used.

Add LimitMaxWriterNodesCount rule to limit numer of writer nodes

687ddde

The optimizer rule LimitMaxWriterNodesCount was added to limit number of nodes that take part in executing writer stages.

radek-kondziolka force-pushed the rk/add_upper_limit_on_writer_scaling_backup branch from 128f1db to 687ddde Compare February 9, 2023 13:26

radek-kondziolka requested a review from gaurav8297 February 9, 2023 13:34

radek-kondziolka closed this Feb 23, 2023

Conversation

radek-kondziolka commented Feb 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Release notes

Documentation

Uh oh!

sopel39 commented Feb 6, 2023

Uh oh!

radek-kondziolka commented Feb 6, 2023

Uh oh!

gaurav8297 Feb 7, 2023

Choose a reason for hiding this comment

Uh oh!

radek-kondziolka Feb 7, 2023

Choose a reason for hiding this comment

Uh oh!

gaurav8297 Feb 7, 2023

Choose a reason for hiding this comment

Uh oh!

gaurav8297 Feb 7, 2023

Choose a reason for hiding this comment

Uh oh!

gaurav8297 Feb 7, 2023

Choose a reason for hiding this comment

Uh oh!

sopel39 Feb 7, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

radek-kondziolka Feb 9, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

hashhar commented Feb 13, 2023

Uh oh!

sopel39 commented Feb 14, 2023

Uh oh!

hashhar commented Feb 14, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

radek-kondziolka commented Feb 20, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

radek-kondziolka commented Feb 23, 2023

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

4 participants

radek-kondziolka commented Feb 6, 2023 •

edited

Loading

sopel39 Feb 7, 2023 •

edited

Loading

hashhar commented Feb 14, 2023 •

edited

Loading

radek-kondziolka commented Feb 20, 2023 •

edited

Loading