-
Notifications
You must be signed in to change notification settings - Fork 153
Add training and test functions to integrate the native XGBoost library #281
Conversation
<version>0.5</version> | ||
<scope>system</scope> | ||
<systemPath>${basedir}/lib/xgboost4j-0.5-jar-with-dependencies.jar</systemPath> | ||
</dependency> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Currently, no xgboost4j
exists in the maven repository. I'll ask the author later why the library is not uploaded there.
An obvious other problem is that the native
Also, |
I think |
I asked |
better to create our own package |
okay |
@@ -359,6 +359,21 @@ public static double getAsConstDouble(@Nonnull final ObjectInspector numberOI) | |||
+ TypeInfoUtils.getTypeInfoFromObjectInspector(numberOI)); | |||
} | |||
|
|||
@SuppressWarnings("unchecked") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please move to Primitives, not related to Hive. Also, I think asFloat
is not necessary while asFloatArray
is useful.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
okay, I'll fix it.
84b4c80
to
6461121
Compare
I implemented a XGBoost multiclass classifier;
|
I did a XGBoost binary classifier;
|
7884c16
to
9080a9f
Compare
ref. to discussions #251 |
bfe447f
to
88682d0
Compare
88682d0
to
4f5706c
Compare
How to use
|
Changes Unknown when pulling 73d8090 on maropu:XgboostIntegration into * on myui:master*. |
okay, I removed the dependencies ( |
@@ -9,13 +9,12 @@ | |||
<relativePath>../../pom.xml</relativePath> | |||
</parent> | |||
|
|||
<artifactId>hivemall-spark</artifactId> | |||
<name>Hivemall on Spark</name> | |||
<artifactId>hivemall-spark_2.11</artifactId> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
why 2.11?
hivemall-spark-v1 and hivemall-spark-v2 might be good for artifact id.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This means this jar supports spark-2.0 compiled with scala-2.11.
The naming rule is along with sprak jar.
Changes Unknown when pulling 0ad666f on maropu:XgboostIntegration into * on myui:master*. |
377d61c
to
4c0e33c
Compare
Changes Unknown when pulling 4c0e33c on maropu:XgboostIntegration into * on myui:master*. |
Changes Unknown when pulling 4c0e33c on maropu:XgboostIntegration into * on myui:master*. |
e0cb90d
to
0ad666f
Compare
Changes Unknown when pulling a8f4cf2 on maropu:XgboostIntegration into * on myui:master*. |
1 similar comment
Changes Unknown when pulling a8f4cf2 on maropu:XgboostIntegration into * on myui:master*. |
Changes Unknown when pulling 3296a82 on maropu:XgboostIntegration into * on myui:master*. |
Changes Unknown when pulling 3296a82 on maropu:XgboostIntegration into * on myui:master*. |
3296a82
to
826b390
Compare
Changes Unknown when pulling 826b390 on maropu:XgboostIntegration into * on myui:master*. |
d13a3cf
to
e6889dc
Compare
Changes Unknown when pulling e6889dc on maropu:XgboostIntegration into * on myui:master*. |
fyi: If you get some errors in a |
@maropu LGTM. Merged. Well done. |
I'm working on the integration of the
XGBoost
.As a first step, each UDTF training worker simply outputs a single
xgboost
model, and then a testing phase computes prediction values for each built model and averages them.