-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
replace dask-xgboost with xgboost #50
Conversation
Check out this pull request on See visual diffs & provide feedback on Jupyter Notebooks. Powered by ReviewNB |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks great!! Found a couple small typos.
And regarding the start script/images, would it be possible to coordinate the release of images and examples together so that we can avoid the workaround in the start script? I'm afraid users could get confused seeing the longer start script, and they might even remove some of the lines, causing problems when they try to spin up the example resources.
"cell_type": "markdown", | ||
"metadata": {}, | ||
"source": [ | ||
"`xgboost.dask.predict()` can be used to create predictiosn on a Dask collection using an XGBoost model object. Note that this model object is just a regular XGBoost booster, not a special Dask-specific model object.\n", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
"`xgboost.dask.predict()` can be used to create predictiosn on a Dask collection using an XGBoost model object. Note that this model object is just a regular XGBoost booster, not a special Dask-specific model object.\n", | |
"`xgboost.dask.predict()` can be used to create predictions on a Dask collection using an XGBoost model object. Note that this model object is just a regular XGBoost booster, not a special Dask-specific model object.\n", |
"cell_type": "markdown", | ||
"metadata": {}, | ||
"source": [ | ||
"`xgboost.dask.predict()` can be used to create predictiosn on a Dask collection using an XGBoost model object. Note that this model object is just a regular XGBoost booster, not a special Dask-specific model object.\n", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
"`xgboost.dask.predict()` can be used to create predictiosn on a Dask collection using an XGBoost model object. Note that this model object is just a regular XGBoost booster, not a special Dask-specific model object.\n", | |
"`xgboost.dask.predict()` can be used to create predictions on a Dask collection using an XGBoost model object. Note that this model object is just a regular XGBoost booster, not a special Dask-specific model object.\n", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
🚢
Now that xgboost 1.3.0 is available, I think it's time to remove uses of
dask-xgboost
in the example here and replace them withxgboost.dask
(the Dask interface built in toxgboost
). This is the direction the XGBoost + Dask community is moving (dask/community#104, dask/dask-xgboost#39).This PR proposes replacing
dask-xgboost
withxgboost
. I tested these changes on the 2020.11.30 release of Saturn (in an internal environment), using the imagesaturncloud/saturn:2020.11.30
.While I'm touching the XGBoost code, this PR also replaces cutting over all of the XGBoost examples to
"tree_method": "hist"
. We had previously used the slower"tree_method": "approx"
becausedask-xgboost
required pinning to an old XGBoost version that didn't supporthist
. Buthist
is available in 1.3.0 and can be much faster.Notes for Reviewers
In this PR, I'm proposing adding a
start
script forexamples-cpu
to deal with old XGBoost versions that are currently installed in the 2020.11.30 images. I'll 've added another PR to update the images (saturncloud/images#127), and then whenever those are rebuilt we can change the image used here and remove that start script. Please let me know what you think about this.