In Boosting Assembler wrapping each estimator into a subroutine causes a performance degradation #152

izeigerman · 2020-01-21T17:19:19Z

I've recalled the real motivation behind not wrapping every individual estimator into its own subroutine - generation of many nested function calls leads to a performance degradation in Java. The observed difference reaches 4x for larger models (eg. XGBoost with 1000 estimators). The basic test I created (sorry about Scala):

@ import com.github.m2cgen.ModelOld
import com.github.m2cgen.ModelOld

@ import com.github.m2cgen.ModelNew
import com.github.m2cgen.ModelNew

@ def nextRandomData(): Array[Double] = (0 until 4).map(_ => Random.nextDouble).toArray
defined function nextRandomData

@ def testScore: Unit = {
    val start = System.currentTimeMillis()
    (0 until 100000).foreach(_ => <ModelNew|ModelOld>.score(nextRandomData))
    println("Runtime: " + (System.currentTimeMillis() - start).toString)
  }

Results for ModelOld:

@ testScore
Runtime: 2973

For ModelNew:

@ testScore
Runtime: 10747

The test model has been trained using the sklearn.datasets.load_iris() dataset. Classifier has been created as following:

model = XGBClassifier(n_estimators=1000)

In the attached archive I included the following:

ModelNew.java - java code generated with the most recent master.
ModelOld.java - java code generated with the release 0.5.0 version.
Models.jar - the jar containing both compiled sources.
xgboost_model2 - the trained estimator in Pickle format.

CC: @StrikerRUS FYI

The text was updated successfully, but these errors were encountered:

StrikerRUS · 2020-01-22T01:23:53Z

x4 slowdown is very awful! However, Java slowdown due to wrapping estimators into subroutine should be less panful compared to R slowdown due to not wrapping, because slow Java code doesn't exceed Travis limit while R code does this.
Do you have any idea for possible compromise yet?

izeigerman · 2020-01-22T04:09:18Z

Solving this issue on assembler's end rather than in interpreters certainly wasn't the best idea ever. So far I don't have any great ideas, but I'm sure I don't like penalizing one language in favor of the other.

UPD: We should also keep in mind that JVM usage is/will be potentially more widespread and have stricter execution time constraints.

UPD2: I actually do have one idea which I'm going to explore this upcoming weekend (or hopefully sooner).

izeigerman · 2020-01-22T16:07:42Z

Turns out I've accidentally generated ModelNew.java for a wrong model. After rerunning my tests the numbers look a lot more reasonable for the current master:

@ testScore
Runtime: 3526

The difference is still significant but not nearly as dramatic. Attaching the updated archive
java_boosting_test.tar.gz

StrikerRUS · 2020-01-22T20:40:54Z

... but I'm sure I don't like penalizing one language in favor of the other.

I absolutely agree with you! I think we should try to support all our languages equally.

UPD: We should also keep in mind that JVM usage is/will be potentially more widespread and have stricter execution time constraints.

Sorry, I have no any experience in Java 🙁 .

UPD2: I actually do have one idea which I'm going to explore this upcoming weekend (or hopefully sooner).

Sounds promising!

The difference is still significant but not nearly as dramatic.

Still this slowdown is not good thing.

StrikerRUS · 2020-03-16T21:57:33Z

Can we close this?

izeigerman · 2020-03-16T22:32:20Z

Yes, absolutely. Thank you!

izeigerman self-assigned this Jan 22, 2020

izeigerman mentioned this issue Jan 26, 2020

In Java interpreter ignore subroutines and perform code split based on the AST size #158

Merged

izeigerman closed this as completed Mar 16, 2020

StrikerRUS mentioned this issue Jul 17, 2021

Fix java export for huge models #306

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

In Boosting Assembler wrapping each estimator into a subroutine causes a performance degradation #152

In Boosting Assembler wrapping each estimator into a subroutine causes a performance degradation #152

izeigerman commented Jan 21, 2020

StrikerRUS commented Jan 22, 2020 •

edited

Loading

izeigerman commented Jan 22, 2020 •

edited

Loading

izeigerman commented Jan 22, 2020

StrikerRUS commented Jan 22, 2020

StrikerRUS commented Mar 16, 2020

izeigerman commented Mar 16, 2020

In Boosting Assembler wrapping each estimator into a subroutine causes a performance degradation #152

In Boosting Assembler wrapping each estimator into a subroutine causes a performance degradation #152

Comments

izeigerman commented Jan 21, 2020

StrikerRUS commented Jan 22, 2020 • edited Loading

izeigerman commented Jan 22, 2020 • edited Loading

izeigerman commented Jan 22, 2020

StrikerRUS commented Jan 22, 2020

StrikerRUS commented Mar 16, 2020

izeigerman commented Mar 16, 2020

StrikerRUS commented Jan 22, 2020 •

edited

Loading

izeigerman commented Jan 22, 2020 •

edited

Loading