Finish query when client has consumed results #14122

electrum · 2022-09-14T03:16:50Z

This works around broken clients that don't fetch the final link.

Release notes

(x) This is not user-visible and no release notes are required.

martint · 2022-09-14T04:31:19Z

What does "when client has consumed results" mean, concretely (i.e., in terms of protocol interactions with the server). If I recall correctly, following the last link was the way for the client to signal to the server that it had received and seen the results successfully.

hashhar · 2022-09-14T06:13:00Z

I'd argue the clients are buggy and had not been following the protocol as defined - the recent change #13055 only exposed the fact that the clients are buggy.

electrum · 2022-09-14T07:27:19Z

@martint It means that all the results have been fetched from the exchange client and the client has (potentially) seen the final page of data. Meaning that we returned a response with the final page.

@hashhar Yes, they definitely are buggy, no argument there. That’s why the description says “This works around broken clients”.

Ideally, all clients would be immediately fixed and users would instantly update to the fixed versions. In practice, this is a serious regression on the server for anyone using such clients, as it results in queries hanging around (for five minutes by default) and tying up the max concurrency slots.

Related, we should probably decrease the default client query timeout, since five minutes seems way too long, and no client retires that long anyway.

findepi · 2022-09-14T07:43:12Z

(x) This is not user-visible and no release notes are required.

Ideally, all clients would be immediately fixed and users would instantly update to the fixed versions. In practice, this is a serious regression on the server for anyone using such clients

if this change is important for users, let's be sure to mention this in release notes.

arhimondr · 2022-09-14T15:51:36Z

The fix looks good, though there are many test failures. I wonder if in some cases the query is getting closed prematurely

arhimondr · 2022-09-14T15:53:38Z

core/trino-main/src/main/java/io/trino/server/protocol/Query.java

I'm afraid that might break the logic. I wonder if this boolean has to be set here or is it sufficient to call queryManager.resultsConsumed to let the query "finish"?

martint · 2022-09-14T16:02:12Z

It means that all the results have been fetched from the exchange client and the client has (potentially) seen the final page of data. Meaning that we returned a response with the final page.

One possible problem to consider is that this will also potentially means the query is marked as finished before the client thinks it's finished -- e.g., if there are any delays in a properly working client to when fetching the final link, or if the client is having problems fetching the final result set and is in the process of retrying.

This works around broken clients that don't fetch the final link.

findepi · 2022-09-15T08:04:12Z

One possible problem to consider is that this will also potentially means the query is marked as finished before the client thinks it's finished

is it a problem?

server is executing the query, so it's done when it's done
client is observing the results only

martint · 2022-09-15T15:25:48Z

is it a problem?

Some possible problems:

Incorrect reporting of query execution time
Early pruning of query state / results in clusters with large query volume (based on query.max-history)
Possibly other future weird behavior if the coordinator gets stricter about removing certain state for queries that have already finished.

electrum · 2022-09-15T18:50:26Z

Incorrect reporting of query execution time

I'm not sure it's incorrect. At that point, the client has requested the final page of data. In extremely rare cases, the client might not receive the response and need to retry, but the query has already finished executing on the server.

martint · 2022-09-15T20:54:35Z

This works around broken clients that don't fetch the final link.

I still don't understand what problem this is solving. How are those clients deciding they don't want to follow the final link? What if they stop two steps from the final link, or three, or N? How do they know there's no more data to be returned?

cla-bot bot added the cla-signed label Sep 14, 2022

arhimondr approved these changes Sep 14, 2022

View reviewed changes

arhimondr reviewed Sep 14, 2022

View reviewed changes

arhimondr self-requested a review September 14, 2022 15:54

Finish query when client has consumed results

fcb7774

This works around broken clients that don't fetch the final link.

electrum force-pushed the queryfinish branch from 7460594 to fcb7774 Compare September 14, 2022 20:09

arhimondr approved these changes Sep 15, 2022

View reviewed changes

electrum merged commit 4ad7177 into trinodb:master Sep 15, 2022

electrum deleted the queryfinish branch September 15, 2022 18:42

github-actions bot added this to the 396 milestone Sep 15, 2022

hashhar mentioned this pull request Sep 20, 2022

Acknowledge reception of data in TrinoResult trinodb/trino-python-client#220

Merged

findepi mentioned this pull request Sep 27, 2022

Queries stuck in "FINISHING" state #14298

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Finish query when client has consumed results #14122

Finish query when client has consumed results #14122

Uh oh!

electrum commented Sep 14, 2022

Uh oh!

martint commented Sep 14, 2022

Uh oh!

hashhar commented Sep 14, 2022

Uh oh!

electrum commented Sep 14, 2022

Uh oh!

findepi commented Sep 14, 2022

Uh oh!

arhimondr commented Sep 14, 2022

Uh oh!

arhimondr Sep 14, 2022

Uh oh!

martint commented Sep 14, 2022

Uh oh!

findepi commented Sep 15, 2022

Uh oh!

martint commented Sep 15, 2022

Uh oh!

electrum commented Sep 15, 2022

Uh oh!

martint commented Sep 15, 2022

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

5 participants

Finish query when client has consumed results #14122

Finish query when client has consumed results #14122

Uh oh!

Conversation

electrum commented Sep 14, 2022

Release notes

Uh oh!

martint commented Sep 14, 2022

Uh oh!

hashhar commented Sep 14, 2022

Uh oh!

electrum commented Sep 14, 2022

Uh oh!

findepi commented Sep 14, 2022

Uh oh!

arhimondr commented Sep 14, 2022

Uh oh!

arhimondr Sep 14, 2022

Choose a reason for hiding this comment

Uh oh!

martint commented Sep 14, 2022

Uh oh!

findepi commented Sep 15, 2022

Uh oh!

martint commented Sep 15, 2022

Uh oh!

electrum commented Sep 15, 2022

Uh oh!

martint commented Sep 15, 2022

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

5 participants