Add a limit on total number of bytes read from storage in table scan by fgwang7w · Pull Request #14739 · prestodb/presto

fgwang7w · 2020-06-28T04:40:43Z

== RELEASE NOTES ==

General Changes
* Add `query.max-scan-physical-bytes` configuration and `query_max_scan_physical_bytes` session properties to limit total number of bytes read from storage during table scan. The default limit is 1PB.

mbasmanova

@fgwang7w Some initial comments. Would it make sense to have both soft and hard limit? E.g. sort limit would generate a warning, while hard limit will fail the query. See query_max_scan_physical_bytes for an example.

mbasmanova · 2020-06-29T13:21:14Z

presto-main/src/main/java/com/facebook/presto/execution/SqlQueryManager.java

getQueryMaxScanPhysicalBytes(session) returns a value that can be used as is without further checking.

This code ensures that this property defaults to what's specified in config:

dataSizeProperty( QUERY_MAX_SCAN_PHYSICAL_BYTES, "Maximum scan physical bytes of a query", queryManagerConfig.getQueryMaxScanPhysicalBytes(), false),

and this code defines the default value that is used if nothing is specified in config file:

private DataSize queryMaxScanPhysicalBytes = DataSize.succinctDataSize(1, PETABYTE);

Hence, the code can be simplified like this:

for (QueryExecution query : queryTracker.getAllQueries()) { DataSize limit = getQueryMaxScanPhysicalBytes(query.getSession()); DataSize scan = query.getQueryInfo().getQueryStats().getRawInputDataSize(); if (scan.compareTo(limit) >= 0) { query.fail(new ExceededScanLimitException(limit)); } }

Many thanks for the proposal. I have made another version of how to implement this limit based on your suggestion. The new version is now includes a method defines in QueryExecution to collect scanned data size which is implemented in SqlQueryExecution to collect finalQueryInfo's rawInputDataSize. Please help review again.

mbasmanova · 2020-06-29T13:22:29Z

presto-main/src/main/java/com/facebook/presto/ExceededScanLimitException.java

Let's clarify what does the limit applies to? E.g. whether it applies to amount of data read from storage before or after compression?

This is the amount of consumed input bytes read from storage via QueryInfo's getQueryStats().getRawInputDataSize()

Concurring @mbasmanova's comment - we should be clear about what scan limit has been exceeded. I will recommend EXCEEDED_SCAN_RAW_BYTES_READ_LIMIT. The message can also be made more explicit

the message has been fixed to indicate the scan bytes limit has exceeded for this exception

presto-main/src/main/java/com/facebook/presto/SystemSessionProperties.java

viczhang861

Release note can be improved, see instruction here https://github.com/prestodb/presto/wiki/Release-Notes-Guidelines

viczhang861 · 2020-06-29T22:47:31Z

presto-main/src/main/java/com/facebook/presto/execution/QueryManagerConfig.java

Is QueryStats::rawInputDataSize equivalent to scanPhysicalBytes? does reading from materialized intermediate table included? cc @arhimondr

Same comment as in SystemSessionProperties - lets use the same name as the actual metric - queryMaxRawInputBytes. In accordance with that lets change the config name as well - query.max-raw-input-bytes.

presto-main/src/main/java/com/facebook/presto/execution/SqlQueryManager.java

mbasmanova · 2020-07-02T10:52:15Z

@fgwang7w Would you squash all 5 commits into one?

mayankgarg1990

Overall, lets ensure that the logic works and in addition to that, lets use the same term rawInputBytes instead of scannedBytes so that it is easy for the users to understand which metric did they exceed.

presto-main/src/main/java/com/facebook/presto/execution/QueryExecution.java

mayankgarg1990 · 2020-07-02T16:32:09Z

presto-main/src/main/java/com/facebook/presto/ExceededScanLimitException.java

Concurring @mbasmanova's comment - we should be clear about what scan limit has been exceeded. I will recommend EXCEEDED_SCAN_RAW_BYTES_READ_LIMIT. The message can also be made more explicit

presto-main/src/main/java/com/facebook/presto/SystemSessionProperties.java

mayankgarg1990 · 2020-07-02T16:38:59Z

presto-main/src/main/java/com/facebook/presto/execution/QueryStateMachine.java

I can see a lot of issues with this logic:

We are reading finalQueryInfo which is published only when the query is finished - can you test and see if this number is actually published when the query is running ?

For every stage that is a scan stage, we take the whole query's bytes read and add it. So if there are 2 scan stages - in that case, you will just return 2 * bytesreadbyquery.

getAllStages does a DFS traversal every time - and given that it will be called in a loop, this might be an expensive operation and we don't really care about the ordering here.

In my opinion - we should do something similar to the existing logic that exists for getTotalCpuTime. That will help ensure that the logics are similar and it is easy to change them together if we ever decide to head that path.

ok so this is newly implemented, the original proposal was a simple approach which is to obtain the scanned data size from queryInfo::queryStats if the final resolution is to do something similiar to the existing logic with getTotalCpuTime.
DataSize scan = query.getQueryInfo().getQueryStats().getRawInputDataSize();
@mbasmanova what's your suggestion on this one?

query.getQueryInfo().getQueryStats() is a sort of expensive method since it aggregates all the stats and not just bytes read. We should keep this data collection as light as possible in my opinion.

Agree with @mayankgarg1990 -- this can be made to be less expensive and consistent with how we calculate CPU time.

thank you @mayankgarg1990 @tdcmeehan for the comment, will revise this code accordingly to align with general data collection logic

Were these comments addressed?

yes this code block has been removed as to make it simple and similar to how cpu limits enforcement is handled

presto-main/src/main/java/com/facebook/presto/execution/SqlQueryManager.java

fgwang7w · 2020-07-02T19:16:21Z

@fgwang7w Would you squash all 5 commits into one?

done, squashed into 1 commit now

mbasmanova · 2020-07-02T19:52:18Z

@fgwang7w I'm in meetings all day today and won't have time to review/answer questions on this PR before the long weekend. I'll take a look early next week.

fgwang7w · 2020-07-02T20:03:16Z

@fgwang7w I'm in meetings all day today and won't have time to review/answer questions on this PR before the long weekend. I'll take a look early next week.

Sure thank you, and in the meantime I will do more testing to ensure quality of the code

fgwang7w · 2020-07-18T04:37:38Z

hi @mbasmanova please help review the revised commit, many thanks~

mbasmanova · 2020-07-20T14:05:30Z

@fgwang7w Looks good to, but I'll defer to @mayankgarg1990 and @tdcmeehan to confirm that their comments have been addressed properly. Commit message needs to be updated to match the guidelines at https://chris.beams.io/posts/git-commit/ . For example,

Add a limit on total number of bytes read from storage in table scan

Add query.max-scan-physical-bytes configuration and query_max_scan_physical_bytes 
session properties to limit the total number of bytes reads from storage during table scan. 
The default limit is 1PB.

mbasmanova · 2020-07-21T14:03:35Z

@mayankgarg1990 @tdcmeehan Mayank, Tim, would you take another look?

mayankgarg1990 · 2020-07-21T16:07:28Z

@mbasmanova , this is on my radar, I will get to it by tomorrow (7/22) :)

mbasmanova · 2020-07-21T18:32:27Z

Thank you, Mayank.

fgwang7w · 2020-07-24T21:53:53Z

@mayankgarg1990 @tdcmeehan Hi Could you please help review the commit for the upcoming release merge? many thanks!

mayankgarg1990 · 2020-07-24T21:56:19Z

@fgwang7w - As @tdcmeehan pointed out - my comments from my previous review are not addressed yet. @tdcmeehan commented 2 days ago

fgwang7w · 2020-07-25T02:11:15Z

@fgwang7w - As @tdcmeehan pointed out - my comments from my previous review are not addressed yet. @tdcmeehan commented 2 days ago

yes I resubmitted the squashed commit code just now and replied to both you @mayankgarg1990 and @tdcmeehan regarding the querystats perf issue for getScannedBytes method and default scan limit. Basically I have simplified the code and reduce unnecessary method calls per suggestions. The default scan limit is also removed for now. Please give another round of review, many thanks!

mayankgarg1990 · 2020-07-25T15:56:43Z

presto-main/src/main/java/com/facebook/presto/SystemSessionProperties.java

Putting a new comment since the old comment was marked as resolved. Lets match this with the actual metric (rawInputBytes) and the exception name - QUERY_MAX_SCAN_RAW_INPUT_BYTES

sure actually I would synchronize all method call names and varilables to QueryMaxScanRawInputBytes to map with rawInputBytes

mayankgarg1990 · 2020-07-25T16:00:33Z

presto-main/src/main/java/com/facebook/presto/execution/QueryManagerConfig.java

Same comment as in SystemSessionProperties - lets use the same name as the actual metric - queryMaxRawInputBytes. In accordance with that lets change the config name as well - query.max-raw-input-bytes.

mayankgarg1990 · 2020-07-25T16:12:29Z

presto-main/src/main/java/com/facebook/presto/execution/SqlQueryManager.java

these 2 are not being used - remove these

sure, it's removed

mayankgarg1990 · 2020-07-25T16:40:37Z

presto-main/src/main/java/com/facebook/presto/execution/SqlQueryManager.java

This is still a bit expensive since getting querystats involves -> for all stages, for all tasks, for all metrics.

The way CPU does it is still cheaper -> all stages, all tasks, just the cpu - so we can do something similar here and make it even cheaper -

We can follow a similar flow here

SqlQueryManager -> SqlQuerySchedulerInterface#getTotalCpuTime -> SqlStageExecution#getTotalCpuTime

and just sum up the bytes involved.

I'm not certain if your approach about summing up bytes from SqlStageExecution#getTotalScanBytes is accurate... it's always the best practice to inherit scanned byte size from existing querystats to ensure the result is correct. I suggest we maintain current implementation with SqlQueryManager:getBasicQueryInfo-> BasicQueryStats:getQueryStats . How much cheaper are we suggesting to reduce if we want to bypass the existing logic with a risk that we might have a false result?
My option is that rawInputDataSize should be the only reliable source of truth when setting against with the hard scan limit.

I don't agree that BasicQueryStats:getQueryStats is the only trusted source. As you can see, we are already doing this for total cpu ms and memory limits so I don't see why raw input bytes will be any different here. Again, my only concern is that this is a single thread already doing cpu and memory enforcements and we should keep it as light weight as possible and by keeping this trend, we will ensure that the new entries that are added also follow this lighter weight approach.

understood, thank you! I have implemented your approach, I think it is safer and cheaper to acquire actual number via SqlQuerySchedulerInterface::getRawInputDataSize -> QueryExecution::getRawInputDataSize, and sum up all task's stats via SqlStageExecution::getRawInputDataSize. Please review the revised commit again, many thanks!

mayankgarg1990

looks good - just last comment

mayankgarg1990 · 2020-07-29T16:11:22Z

presto-main/src/main/java/com/facebook/presto/execution/SqlQueryManager.java

nit - lets rename this as rawInputSize

fixed, thanks

mayankgarg1990 · 2020-07-29T16:17:24Z

presto-main/src/main/java/com/facebook/presto/execution/SqlStageExecution.java

Lets add this check at the top to ensure that we are only considering table scan nodes -

if (planFragment.getTableScanSchedulingOrder().isEmpty()) { return new DataSize(0, BYTE); }

agree, we need to bypass when source is empty, fixed

mayankgarg1990

looks good - one last comment

mayankgarg1990 · 2020-07-29T18:34:08Z

presto-main/src/main/java/com/facebook/presto/execution/QueryManagerConfig.java

Lets set the default to a higher number to avoid unexpected failures when people deploy this new version. Every sys admin should be able to set a reasonable value in their configurations. How about an exabyte (1000, PETABYTE)?

sure, this field is adjustable anytime by DBAs. Default is revised to 1000PB.

Add query.max-scan-physical-bytes configuration and query_max_scan_physical_bytes session properties to limit the total number of bytes reads from storage during table scan. The default limit is 1PB.

tdcmeehan · 2020-07-30T14:10:58Z

presto-spi/src/main/java/com/facebook/presto/spi/session/PropertyMetadata.java

                object -> object);
    }
+
+    public static PropertyMetadata<DataSize> dataSizeProperty(String name, String description, DataSize defaultValue, boolean hidden)


fgwang7w changed the title ~~to #14701: Support query level scan bytes limits~~ Support query level scan bytes limits Jun 28, 2020

fgwang7w requested a review from mbasmanova June 29, 2020 04:04

fgwang7w mentioned this pull request Jun 29, 2020

Support query level scan bytes limits #14701

Closed

mbasmanova requested review from mayankgarg1990, tdcmeehan and viczhang861 June 29, 2020 11:24

mbasmanova reviewed Jun 29, 2020

View reviewed changes

viczhang861 reviewed Jun 29, 2020

View reviewed changes

fgwang7w force-pushed the 14701 branch from 084ff0e to ebe957c Compare July 2, 2020 08:24

mayankgarg1990 suggested changes Jul 2, 2020

View reviewed changes

fgwang7w force-pushed the 14701 branch 2 times, most recently from 8f3188d to 0bb4527 Compare July 2, 2020 19:15

fgwang7w force-pushed the 14701 branch from 0bb4527 to d2d5b4e Compare July 2, 2020 20:13

mbasmanova requested review from bhhari and yingsu00 July 20, 2020 14:06

mbasmanova changed the title ~~Support query level scan bytes limits~~ Add a limit on total number of bytes read from storage in table scan Jul 20, 2020

fgwang7w force-pushed the 14701 branch from d2d5b4e to c733355 Compare July 21, 2020 06:26

fgwang7w force-pushed the 14701 branch 2 times, most recently from 3196c11 to 2f7fffb Compare July 21, 2020 21:33

fgwang7w removed the request for review from bhhari July 24, 2020 21:55

fgwang7w requested review from mayankgarg1990 and mbasmanova and removed request for yingsu00 July 24, 2020 21:55

fgwang7w force-pushed the 14701 branch from 2f7fffb to e714b67 Compare July 25, 2020 02:04

mayankgarg1990 suggested changes Jul 25, 2020

View reviewed changes

fgwang7w force-pushed the 14701 branch 3 times, most recently from 0d8cff0 to 7837b6b Compare July 29, 2020 07:52

mayankgarg1990 reviewed Jul 29, 2020

View reviewed changes

fgwang7w force-pushed the 14701 branch from 7837b6b to e4279dc Compare July 29, 2020 18:16

mayankgarg1990 approved these changes Jul 29, 2020

View reviewed changes

fgwang7w force-pushed the 14701 branch from e4279dc to fce6be0 Compare July 29, 2020 18:49

Add a limit on total number of bytes read from storage in table scan

0c712cb

Add query.max-scan-physical-bytes configuration and query_max_scan_physical_bytes session properties to limit the total number of bytes reads from storage during table scan. The default limit is 1PB.

fgwang7w force-pushed the 14701 branch from fce6be0 to 0c712cb Compare July 29, 2020 18:50

tdcmeehan approved these changes Jul 30, 2020

View reviewed changes

tdcmeehan merged commit fb8bb9f into prestodb:master Jul 31, 2020

caithagoras mentioned this pull request Aug 14, 2020

Add release notes for 0.240 #15032

Merged

7 tasks

caithagoras mentioned this pull request Aug 25, 2020

Add release notes for 0.239.2 #15082

Closed

fgwang7w deleted the 14701 branch September 4, 2020 03:18

Conversation

fgwang7w commented Jun 28, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mbasmanova left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

viczhang861 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mbasmanova commented Jul 2, 2020

Uh oh!

mayankgarg1990 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

fgwang7w commented Jul 2, 2020

Uh oh!

mbasmanova commented Jul 2, 2020

Uh oh!

fgwang7w commented Jul 2, 2020

Uh oh!

fgwang7w commented Jul 18, 2020

Uh oh!

mbasmanova commented Jul 20, 2020

Uh oh!

mbasmanova commented Jul 21, 2020

Uh oh!

mayankgarg1990 commented Jul 21, 2020

Uh oh!

mbasmanova commented Jul 21, 2020

Uh oh!

fgwang7w commented Jul 24, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mayankgarg1990 commented Jul 24, 2020

Uh oh!

fgwang7w commented Jul 25, 2020

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

fgwang7w commented Jun 28, 2020 •

edited

Loading

fgwang7w commented Jul 24, 2020 •

edited

Loading