[native] Push down FilterNode into TableScan by Yuhta · Pull Request #23755 · prestodb/presto

Yuhta · 2024-10-01T15:38:13Z

Sometimes the planner put a FilterNode right after TableScanNode. This change merges these 2 nodes by moving the filter expression into remaining filter of the table scan when possible. This results in fewer IO and CPU because table scan can leverage the information in remaining filter to skip file stripes in case of random sampling.

Also add $path and $bucket in split info columns and fix split counts in coordinator UI.

aditi-pandit

Thanks @Yuhta. Had a high level question:

Am curious why we are doing the pushdown of FilterNode into TableScan during Presto -> Velox plan conversion ? This could be an optimization in Velox Local Planner as well. That would make it common to Presto and Spark.

presto-native-execution/presto_cpp/main/PrestoTask.cpp

presto-native-execution/presto_cpp/main/types/PrestoToVeloxConnector.cpp

Yuhta · 2024-10-01T19:27:29Z

@aditi-pandit Local planner has no dependency on Hive so it does not have knowledge about remaining filter. I guess this was the same reason why Presto planner does not put it in remaining filter as well (did not verify). Seems Prestissimo is a reasonable place to fix this, as we have all the dependency needed and we have the knowledge that this will benefit the execution layer (Velox).

presto-native-execution/presto_cpp/main/types/PrestoToVeloxQueryPlan.cpp

aditi-pandit · 2024-10-01T20:27:37Z

@aditi-pandit Local planner has no dependency on Hive so it does not have knowledge about remaining filter. I guess this was the same reason why Presto planner does not put it in remaining filter as well (did not verify). Seems Prestissimo is a reasonable place to fix this, as we have all the dependency needed and we have the knowledge that this will benefit the execution layer (Velox).

Hmm... Yeah, Velox Local Planner doesn't have dependency on Hive. Alright, your approach is reasonable.

Also add `$path` and `$bucket` in split info columns and fix split counts in coordinator UI.

aditi-pandit

Thanks @Yuhta for this code, and answering all my questions.

jaystarshot · 2024-11-04T18:08:19Z

Please consider adding release notes following our release notes guide - link.

Using below for now

*Merged FilterNode into TableScanNode where possible, reducing I/O and CPU; added $path and $bucket to split info, and fixed split counts in coordinator UI. :pr:23755``

Yuhta force-pushed the tasks/T202164911/0 branch from 493b3ce to 8c97879 Compare October 1, 2024 15:45

Yuhta marked this pull request as ready for review October 1, 2024 17:32

Yuhta requested a review from a team as a code owner October 1, 2024 17:32

aditi-pandit reviewed Oct 1, 2024

View reviewed changes

presto-native-execution/presto_cpp/main/PrestoTask.cpp Show resolved Hide resolved

presto-native-execution/presto_cpp/main/types/PrestoToVeloxConnector.cpp Show resolved Hide resolved

aditi-pandit reviewed Oct 1, 2024

View reviewed changes

presto-native-execution/presto_cpp/main/types/PrestoToVeloxQueryPlan.cpp Show resolved Hide resolved

Yuhta force-pushed the tasks/T202164911/0 branch from 8c97879 to ee4bb8e Compare October 2, 2024 15:10

[native] Push down FilterNode into TableScan

cff51f6

Also add `$path` and `$bucket` in split info columns and fix split counts in coordinator UI.

Yuhta force-pushed the tasks/T202164911/0 branch from ee4bb8e to cff51f6 Compare October 2, 2024 15:10

aditi-pandit approved these changes Oct 2, 2024

View reviewed changes

aditi-pandit merged commit 176e886 into prestodb:master Oct 2, 2024

jaystarshot mentioned this pull request Nov 1, 2024

Add release notes for 0.290 #23936

Merged

25 tasks

Yuhta mentioned this pull request Nov 5, 2024

[native] Revert remaining filter pushdown from FilterNode #23855

Merged

tdcmeehan added the from:Meta PR from Meta label Dec 13, 2024

majetideepak mentioned this pull request Mar 4, 2025

Prestissimo over reporting split/driver counts #23441

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[native] Push down FilterNode into TableScan#23755

[native] Push down FilterNode into TableScan#23755
aditi-pandit merged 1 commit intoprestodb:masterfrom
Yuhta:tasks/T202164911/0

Yuhta commented Oct 1, 2024

Uh oh!

aditi-pandit left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Yuhta commented Oct 1, 2024

Uh oh!

Uh oh!

aditi-pandit commented Oct 1, 2024

Uh oh!

aditi-pandit left a comment

Uh oh!

jaystarshot commented Nov 4, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Comments

Conversation

Yuhta commented Oct 1, 2024

Uh oh!

aditi-pandit left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Yuhta commented Oct 1, 2024

Uh oh!

Uh oh!

aditi-pandit commented Oct 1, 2024

Uh oh!

aditi-pandit left a comment

Choose a reason for hiding this comment

Uh oh!

jaystarshot commented Nov 4, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Comments

aditi-pandit left a comment •

edited

Loading