-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-53850][SDP] Define proto for Sinks and Rename DefineDataset to DefineOutput #52553
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-53850][SDP] Define proto for Sinks and Rename DefineDataset to DefineOutput #52553
Conversation
9ace5ed to
cc15705
Compare
This reverts commit cc15705.
8ad194c to
f5a5fef
Compare
f5a5fef to
2541b01
Compare
| } | ||
|
|
||
| // Metadata that's only applicable to external sinks. | ||
| message ExternalSinkDetails { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Any particular reason we need "external" in the name? Why not just call them "sink details"?
sql/connect/common/src/main/protobuf/spark/connect/pipelines.proto
Outdated
Show resolved
Hide resolved
sql/connect/common/src/main/protobuf/spark/connect/pipelines.proto
Outdated
Show resolved
Hide resolved
|
Also had to do some code related refactoring since |
sryza
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
One small comment – otherwise LGTM!
| // The type of output. | ||
| enum OutputType { | ||
| // Safe default value. Should not be used. | ||
| DATASET_TYPE_UNSPECIFIED = 0; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should this be OUTPUT_TYPE_UNSPECIFIED?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
good catch, fixed
|
+1, LGTM, too. |
…th `4.1.0-preview3` RC1 ### What changes were proposed in this pull request? This PR aims to update Spark Connect-generated Swift source code with Apache Spark `4.1.0-preview3` RC1. ### Why are the changes needed? There are many changes between Apache Spark 4.1.0-preview2 and preview3. - apache/spark#52685 - apache/spark#52613 - apache/spark#52553 - apache/spark#52532 - apache/spark#52517 - apache/spark#52514 - apache/spark#52487 - apache/spark#52328 - apache/spark#52200 - apache/spark#52154 - apache/spark#51344 To use the latest bug fixes and new messages to develop for new features of `4.1.0-preview3`. ``` $ git clone -b v4.1.0-preview3 https://github.com/apache/spark.git $ cd spark/sql/connect/common/src/main/protobuf/ $ protoc --swift_out=. spark/connect/*.proto $ protoc --grpc-swift_out=. spark/connect/*.proto // Remove empty GRPC files $ cd spark/connect $ grep 'This file contained no services' * | awk -F: '{print $1}' | xargs rm ``` ### Does this PR introduce _any_ user-facing change? Pass the CIs. ### How was this patch tested? Pass the CIs. I manually tested with `Apache Spark 4.1.0-preview3` (with the two SDP ignored tests). ``` $ swift test --no-parallel ... ✔ Test run with 203 tests in 21 suites passed after 19.088 seconds. ``` ``` ### Was this patch authored or co-authored using generative AI tooling? No. Closes #252 from dongjoon-hyun/SPARK-54043. Authored-by: Dongjoon Hyun <[email protected]> Signed-off-by: Dongjoon Hyun <[email protected]>
… DefineOutput ### What changes were proposed in this pull request? * Create the spark connect proto for SDP sinks * The encapsulating `DefineDataset` renamed to `DefineOutput` since sink isn't a "dataset". ### Why are the changes needed? To be able to issue these requests from client to server. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Proto changes ### Was this patch authored or co-authored using generative AI tooling? No Closes apache#52553 from JiaqiWang18/SPARK-53850-sdp-sinks-proto. Authored-by: Jacky Wang <[email protected]> Signed-off-by: Sandy Ryza <[email protected]>
What changes were proposed in this pull request?
DefineDatasetrenamed toDefineOutputsince sink isn't a "dataset".Why are the changes needed?
To be able to issue these requests from client to server.
Does this PR introduce any user-facing change?
No
How was this patch tested?
Proto changes
Was this patch authored or co-authored using generative AI tooling?
No