Skip to content

Conversation

@RussellSpitzer
Copy link
Member

To enable core Spark3 code to use distribution and ordering information when
performing writes or other operations. One example is to allow Sort Strategy
to transform SortOrder information so it can be used when rewriting data files.

@RussellSpitzer
Copy link
Member Author

@aokolnychyi + @rdblue I would like to have access to these Utilities for doing some work with the Sort Rewrite Action. If anyone has another Idea on how to get access to this functionality I would be glad to hear it. Ideally I would like to keep Scala and this code out of the Spark3 module, but I can't have an Action use code from within the Extensions module.

scala.srcDirs = ['src/main/scala', 'src/main/java']
java.srcDirs = []
}
// Compile test source against Spark 3.1 and main classes compiled against Spark 3.0
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do we need to do anything with the spark31 sourceSet?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nope, we still use the "main" source set

build.gradle Outdated

sourceSets {
main {
// Use Scala Compiler for everything so we can do cross compilation within the module
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think I understand why we can't use the default scala plugin settings. This is disabling Java compilation by removing srcDirs. Should we just remove the Java plugin?

@rdblue
Copy link
Contributor

rdblue commented Aug 24, 2021

I tried to convert some of these classes to Java today and didn't get anywhere. Some of the traits we need to implement just don't exist for some reason and I think it's probably better just to do it this way.

@RussellSpitzer
Copy link
Member Author

@rdblue Are we good to go with this?

To enable core Spark3 code to use distribution and ordering information when
performing writes or other operations. One example is to allow Sort Strategy
to transform SortOrder information so it can be used when rewriting data files.
@rdblue
Copy link
Contributor

rdblue commented Sep 8, 2021

Looks good to me. The other build settings weren't needed after all?

@RussellSpitzer
Copy link
Member Author

Looks good to me. The other build settings weren't needed after all?

Yep I'm not sure what I was thinking :)

@RussellSpitzer RussellSpitzer merged commit a84fa30 into apache:master Sep 9, 2021
@RussellSpitzer RussellSpitzer deleted the MoveSortToolsToSpark3 branch September 9, 2021 14:01
@RussellSpitzer
Copy link
Member Author

Thanks for the review @wypoon + @rdblue

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants