[SPARK-28755][R][TESTS] Increase tolerance in 'spark.mlp' SparkR test for JDK 11 #25478

HyukjinKwon · 2019-08-16T12:41:58Z

What changes were proposed in this pull request?

This PR proposes to increase the tolerance for the exact value comparison in spark.mlp test. I don't know the root cause but some tolerance is already expected. I suspect it is not a big deal considering all other tests pass.

The values are fairly close:

JDK 8:

-24.28415, 107.8701, 16.86376, 1.103736, 9.244488

JDK 11:

-24.33892, 108.0316, 16.89082, 1.090723, 9.260533

Why are the changes needed?

To fully support JDK 11. See, for instance, #25443 and #25423 for ongoing efforts.

Does this PR introduce any user-facing change?

No

How was this patch tested?

Manually tested on the top of #25472 with JDK 11

./build/mvn -DskipTests -Psparkr -Phadoop-3.2 package
./bin/sparkR

absoluteSparkPath <- function(x) {
  sparkHome <- sparkR.conf("spark.home")
  file.path(sparkHome, x)
}
df <- read.df(absoluteSparkPath("data/mllib/sample_multiclass_classification_data.txt"),
              source = "libsvm")
model <- spark.mlp(df, label ~ features, blockSize = 128, layers = c(4, 5, 4, 3),
                   solver = "l-bfgs", maxIter = 100, tol = 0.00001, stepSize = 1, seed = 1)
summary <- summary(model)
head(summary$weights, 5)

HyukjinKwon · 2019-08-16T12:44:38Z

@felixcheung and @shivaram, we're actually a bit rushing to let Hive 2.3.6 (with JDK 11 fix) release and Spark uses it to support JDK 11 in Spark 3. (see #25405 (comment), #25443 and #25423 for instance).

If you guys won't mind, I will just merge this. This is test-only and won't affect anything in main codes.

shivaram · 2019-08-16T15:26:51Z

@HyukjinKwon Sure it sounds fine to me. Lets file another JIRA to track where this error is coming from?

dongjoon-hyun · 2019-08-16T15:48:30Z

Retest this please.

dongjoon-hyun

+1, LGTM.

SparkQA · 2019-08-16T17:02:58Z

Test build #109216 has finished for PR 25478 at commit fcee12a.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

dongjoon-hyun · 2019-08-16T17:06:23Z

Thank you, @HyukjinKwon , @shivaram , and @srowen .
Merged to master.

HyukjinKwon · 2019-08-17T01:32:41Z

@HyukjinKwon Sure it sounds fine to me. Lets file another JIRA to track where this error is coming from?

Sure, let me file it in few days soon (after #25443)

… for JDK 11  ### What changes were proposed in this pull request?  This PR proposes to increase the tolerance for the exact value comparison in `spark.mlp` test. I don't know the root cause but some tolerance is already expected. I suspect it is not a big deal considering all other tests pass. The values are fairly close: JDK 8: ``` -24.28415, 107.8701, 16.86376, 1.103736, 9.244488 ``` JDK 11: ``` -24.33892, 108.0316, 16.89082, 1.090723, 9.260533 ``` ### Why are the changes needed?  To fully support JDK 11. See, for instance, apache#25443 and apache#25423 for ongoing efforts. ### Does this PR introduce any user-facing change?  No ### How was this patch tested?  Manually tested on the top of apache#25472 with JDK 11 ```bash ./build/mvn -DskipTests -Psparkr -Phadoop-3.2 package ./bin/sparkR ``` ```R absoluteSparkPath <- function(x) { sparkHome <- sparkR.conf("spark.home") file.path(sparkHome, x) } df <- read.df(absoluteSparkPath("data/mllib/sample_multiclass_classification_data.txt"), source = "libsvm") model <- spark.mlp(df, label ~ features, blockSize = 128, layers = c(4, 5, 4, 3), solver = "l-bfgs", maxIter = 100, tol = 0.00001, stepSize = 1, seed = 1) summary <- summary(model) head(summary$weights, 5) ``` Closes apache#25478 from HyukjinKwon/SPARK-28755. Authored-by: HyukjinKwon <[email protected]> Signed-off-by: Dongjoon Hyun <[email protected]>

Increase tolerance in 'spark.mlp' SparkR test for JDK 11

fcee12a

HyukjinKwon mentioned this pull request Aug 16, 2019

[SPARK-28723][SQL] Upgrade to Hive 2.3.6 for HiveMetastore Client and Hadoop-3.2 profile #25443

Closed

dongjoon-hyun approved these changes Aug 16, 2019

View reviewed changes

srowen approved these changes Aug 16, 2019

View reviewed changes

dongjoon-hyun added SPARKR TESTS labels Aug 16, 2019

dongjoon-hyun closed this in 7f44a6e Aug 16, 2019

HyukjinKwon deleted the SPARK-28755 branch March 3, 2020 01:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-28755][R][TESTS] Increase tolerance in 'spark.mlp' SparkR test for JDK 11 #25478

[SPARK-28755][R][TESTS] Increase tolerance in 'spark.mlp' SparkR test for JDK 11 #25478

Uh oh!

HyukjinKwon commented Aug 16, 2019

Uh oh!

HyukjinKwon commented Aug 16, 2019

Uh oh!

shivaram commented Aug 16, 2019

Uh oh!

dongjoon-hyun commented Aug 16, 2019

Uh oh!

dongjoon-hyun left a comment

Uh oh!

SparkQA commented Aug 16, 2019

Uh oh!

dongjoon-hyun commented Aug 16, 2019 •

edited

Loading

Uh oh!

HyukjinKwon commented Aug 17, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[SPARK-28755][R][TESTS] Increase tolerance in 'spark.mlp' SparkR test for JDK 11 #25478

[SPARK-28755][R][TESTS] Increase tolerance in 'spark.mlp' SparkR test for JDK 11 #25478

Uh oh!

Conversation

HyukjinKwon commented Aug 16, 2019

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

HyukjinKwon commented Aug 16, 2019

Uh oh!

shivaram commented Aug 16, 2019

Uh oh!

dongjoon-hyun commented Aug 16, 2019

Uh oh!

dongjoon-hyun left a comment

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Aug 16, 2019

Uh oh!

dongjoon-hyun commented Aug 16, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HyukjinKwon commented Aug 17, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

dongjoon-hyun commented Aug 16, 2019 •

edited

Loading