-
Notifications
You must be signed in to change notification settings - Fork 331
Changes to support Arrow 2.0 and Spark 3.0 #711
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Does this mean we will need to ship a new version of MDA? Is it possible to get that minor change in now? |
src/csharp/Microsoft.Spark.E2ETest/IpcTests/Sql/DataFrameTests.cs
Outdated
Show resolved
Hide resolved
src/csharp/Microsoft.Spark.E2ETest/IpcTests/Sql/DataFrameTests.cs
Outdated
Show resolved
Hide resolved
src/csharp/Microsoft.Spark.Worker/Command/SqlCommandExecutor.cs
Outdated
Show resolved
Hide resolved
src/csharp/Microsoft.Spark.Worker.UnitTest/CommandExecutorTests.cs
Outdated
Show resolved
Hide resolved
src/csharp/Microsoft.Spark.Worker/Command/SqlCommandExecutor.cs
Outdated
Show resolved
Hide resolved
src/csharp/Microsoft.Spark.E2ETest/IpcTests/Sql/DataFrameTests.cs
Outdated
Show resolved
Hide resolved
src/csharp/Microsoft.Spark.Worker/Command/SqlCommandExecutor.cs
Outdated
Show resolved
Hide resolved
src/csharp/Microsoft.Spark.Worker/Command/SqlCommandExecutor.cs
Outdated
Show resolved
Hide resolved
src/csharp/Microsoft.Spark.Worker/Command/SqlCommandExecutor.cs
Outdated
Show resolved
Hide resolved
|
Done, I believe this is ready to review now. We're handling the spark version internally in the Spark worker, so no UDFs have to change. |
src/csharp/Microsoft.Spark.Worker/Command/SqlCommandExecutor.cs
Outdated
Show resolved
Hide resolved
eerhardt
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Changes look good to me. Thanks for the great work here, @pgovind.
You may want to change the PR from "Draft" now.
suhsteve
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Just a few minor comments. Thanks @pgovind !
src/csharp/Microsoft.Spark.Worker.UnitTest/CommandExecutorTests.cs
Outdated
Show resolved
Hide resolved
src/csharp/Microsoft.Spark.Worker.UnitTest/CommandExecutorTests.cs
Outdated
Show resolved
Hide resolved
src/csharp/Microsoft.Spark.Worker/Command/SqlCommandExecutor.cs
Outdated
Show resolved
Hide resolved
|
@pgovind would it be possible to add a E2E test that uses chained Arrow/FxDataFrame UDFs ? |
I think they exist already right? I see chained UDFs being tested in E2ETests/IpcTests/Sql/DataFrameTests.cs? |
Great, I see them now. |
suhsteve
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
some minor nits.
src/csharp/Microsoft.Spark.Worker/Command/SqlCommandExecutor.cs
Outdated
Show resolved
Hide resolved
|
|
||
| private RecordBatch WrapColumnsInStructIfApplicable(RecordBatch batch) | ||
| { | ||
| if (_version >= new Version(Versions.V3_0_0)) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
maybe this instead ?
| if (_version >= new Version(Versions.V3_0_0)) | |
| if (_version.Major >= 3) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ha, that's actually what I had first, until I saw this:
| if (SparkSettings.Version < new Version(version)) |
Are you worried about the new ? I don't mind changing it!!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
just a very minor micro optimization. We can keep, I don't mind either. 👍
|
Just so you know, I don't have merge permissions here :) So one of you will need to merge this PR |
imback82
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, thanks for the great work @pgovind!
Fix for #654
Apache arrow changes are here: apache/arrow#8348
Alright, this is ready to review now!
cc @eerhardt @suhsteve