XGBoost 0.90 Roadmap #4389

hcho3 · 2019-04-21T07:54:08Z

This thread is to keep track of all the good things that will be included in 0.90 release. It will be updated as the planned release date (~~May 1, 2019~~ as soon as Spark 2.4.3 is out) approaches.

The text was updated successfully, but these errors were encountered:

CodingCat · 2019-04-22T16:16:46Z

as we are going to have breaking changes like #4349 and #4377

shall we bump version to 0.9?

hcho3 · 2019-04-22T16:20:07Z

@CodingCat Sure, we can bump to 0.90, if the breaking change is significant. Can you do me a favor and write one-paragraph description of why #4349 was needed?

CodingCat · 2019-04-22T16:43:37Z

sure,

alexvorobiev · 2019-04-22T18:25:07Z

* Spark 2.3 is reaching its end-of-life in a few months

Is there an official statement on that? They released 2.2.3 in January and 2.3.3 in February. Our vendor (MapR) still ships 2.3.1.

CodingCat · 2019-04-22T18:32:51Z

@alexvorobiev #4350, you can check with @srowen from databricks

srowen · 2019-04-22T18:54:22Z

This is not a question for Databricks but for the Spark project. The default policy is maintenance releases for branches for 18 months: https://spark.apache.org/versioning-policy.html That would put 2.3.x at EOL in about July, so wouldn't expect more 2.3.x releases after that from the OSS project.

alexvorobiev · 2019-04-22T20:12:56Z

@srowen Thanks!

hcho3 · 2019-04-24T18:23:25Z

@srowen @CodingCat @alexvorobiev Let's also discuss the possibility of supporting Scala 2.12 / 2.13. Right now, XGBoost4J is compiled for Scala 2.11:

xgboost/jvm-packages/pom.xml

Lines 38 to 39 in 2c61f02

    
           <scala.version>2.11.12</scala.version> 
        
           <scala.binary.version>2.11</scala.binary.version>

A user reported that XGBoost4J JARs compiled for Scala 2.11 is not binary compatible with Scala 2.12.

srowen · 2019-04-24T18:36:17Z

Yeah, 2.11 / 2.12 are still binary-incompatible, and Spark has two distributions. Both are supported in 2.4.x though 2.12 is the default from here on in 2.4.x. 3.0 will drop Scala 2.11 support.

It may just be a matter of compiling two versions rather than much or any code change. If you run into any funny errors in 2.12 let me know because I stared at lots of these issues when updating Spark.

2.13 is still not GA and think it will be a smaller change from 2.12->2.13 than 2.11->2.12 (big difference here is totally different representation of lambdas).

CodingCat · 2019-04-25T15:44:44Z

the only issue is that we need to introduce a breaking change to the artifact name of xgboost in maven, xgboost4j-spark => xgboost4j-spark_2.11/xgboost4j-spark_2.12, like spark https://mvnrepository.com/artifact/org.apache.spark/spark-core and we need to double check if we have any transient dependency on 2.11 (I think no)

Hi, @srowen though 2.12 is the default from here on in 2.4.x, I checked branch-2.4 pom.xml, if you don't specify profile scala-2.12, you still get a 2.11 build, no?

srowen · 2019-04-25T15:58:23Z

You could choose to only support 2.12 in 0.9x, and then you don't have to suffix the artifact name. If you support both, yeah, you'd really want to change the artifact name unfortunately and have _2.11 and _2.12 versions.

Yes the default Spark 2.4.x build will be for 2.11; -Pscala-2.12 gets the 2.12 build.

CodingCat · 2019-04-25T16:22:50Z

thanks, I'd stay conservative in supporting 2.12 at least for the coming version

as far as I know, most of Spark users are still using 2.11 since they are used to following previous versions of Spark

I may not have bandwidth to go through every test I have for introducing 2.12 support

I would choose to support 2.12 + 2.11 or 2.12 in 1.0 release...

CodingCat · 2019-04-25T18:35:45Z

@hcho3 FYI, I just removed the dense matrix support from the roadmap given the limited bandwidth

trivialfis · 2019-04-26T04:25:29Z

@hcho3 Could you take a look at dmlc/dmlc-core#514 when time allows? It might be worth merging before the next release hit.

hcho3 · 2019-04-26T04:32:54Z

@trivialfis Will look at it

hcho3 · 2019-04-28T01:59:20Z

@CodingCat I think we should push back the release date, as Spark 2.4.1 and 2.4.2 have issues. What do you think?

@srowen Do you know when Spark 2.4.3 would be out?

CodingCat · 2019-04-28T02:34:26Z

I think it’s fine to have some slight delay

hcho3 · 2019-04-28T02:46:16Z

Okay, let’s wait until Spark 2.4.3 is out

tovbinm · 2019-04-29T18:00:06Z

Would there be the last 0.83 release for Spark 2.3.x?

hcho3 · 2019-04-29T18:20:46Z

@CodingCat What if we make two parallel releases 0.83 and 0.90, where 0.83 includes all commits just before #4377? The 0.83 version would be only released as JVM packages, and Python and R packages would get 0.90. It won't be any more work for me, since I have to write a release note for 0.90 anyway.

One issue though is the user experience with missing value handling. Maybe forcing everyone to use Spark 2.4.x will prevent them from messing up with missing values (the issue which motivated #4349)

CodingCat · 2019-04-29T21:29:23Z

@hcho3 I am a bit concerned on the inconsistency of different versions in the availability of pkgs.

I can imagine questions like hey, I find 0.83 in maven so I upgrade our Spark pkg, but I cannot use 0.83 in notebook when attempting to explore my new model setup with a small amount of data with python pkg?

I would suggest we either have a full maintenance release on 0.8x branch or nothing

hcho3 · 2019-04-29T21:39:44Z

@CodingCat Got it. We'll do consistent releases for all packages. What's your take on 0.83 release then? Should we do it?

hcho3 · 2019-04-29T21:48:36Z

@CodingCat Actually, this will create work for other maintainers, we'll need to ask them first

CodingCat · 2019-04-29T21:57:58Z

short answer from a personal view is yes in theory, but it might be more than cutting right before a commit (as you said, it will create work for others as well) (but I am kind of hesitated to do this because of the limited resources in the community...)

here is my 2 cents about how we should think about maintenance release like 0.8x

the reason to have a maintenance release is to bring in critical bug fixes, like 2d875ec and 995698b
on the other side, to make the community sustainable other than burning out all committers, we should drop support of previous version periodically
the innovations and improvements should be brought to the users through a feature release (jump from 0.8 to 0.9)

if we decide to go 0.83, we need to collect opinions from @RAMitchell @trivialfis as well and use their judge to see if we have important (more about correctness) bug fixes which are noticed by them

and then create a 0.83 branch based on 0.82 to cherry-pick commits......a lot of work actually

RAMitchell · 2019-04-29T22:27:23Z

If I understand correctly, 0.9 will not support older versions of spark, hence the proposal to support a 0.83 version as well as 0.9 to continue support for older spark versions while including bug fixes?

Generally I am against anything that uses developer time. Aren't we busy enough already? I do see some value in having a stable version however.

hcho3 · 2019-05-01T00:44:13Z

@tovbinm You can build XGBoost with commit 711397d to use Spark 2.3.x.

tovbinm · 2019-05-01T01:54:03Z

Great. So why not make a public release from that commit?

hcho3 · 2019-05-01T01:58:33Z

As @CodingCat said, maintenance releases are not simply a matter of cutting before a commit. Also, making public releases are implicit promises of support. I do not think maintainers are up for supporting two new releases at this point in time.

I'll defer to @CodingCat as to whether we should make a release from 711397d

hlbkin · 2019-05-01T14:15:24Z

External memory with GPU predictor - this would mean code would not crash with what(): std::bad_alloc: out of memory anymore? (i.e. temporarily swap into RAM?)

related issue I guess #4184 - this was mainly on temporal bursts of memory, the process of fitting itself never require so much memory

hcho3 · 2019-05-01T17:32:22Z

@hlbkin You'll need to explicitly enable external memory, according to https://xgboost.readthedocs.io/en/latest/tutorials/external_memory.html

CAM-Gerlach · 2019-05-03T03:25:10Z

I assume its not possible to switch otherwise without a major version bump (i.e. 1.0), but when you do, could you consider supporting conformant PEP 440 version numbers (i.e. x.y.z), and preferably semantic versioning? The standard interpretation of 0.90 (rather than 0.9.0) would be that it is the 90th minor release of the major version 0.x (i.e. pre-stable-release) series, and is no more significant than 0.83. Furthermore, this restricts you to a maximum of 9 point releases per minor version, and creates difficulties for some tools (and people) to interpret. Thanks!

CodingCat · 2019-05-03T03:26:02Z

+1

hcho3 · 2019-05-03T03:28:27Z

@CAM-Gerlach We'll consider it when we release 1.0. On the other hand, we don't want to rush to 1.0. We want 1.0 to be a milestone of some sort, in terms of features, stability, and performance.

CAM-Gerlach · 2019-05-03T04:16:41Z

Thanks for the explanation, @hcho3 .

You probably want to make sure you set the python_requires argument to '>=3.5' in setup() to ensure users with Python 2 don't get upgraded to an incompatible version accidentally.

hlbkin · 2019-05-04T08:05:58Z

@hcho3 External memory is not available with GPU algorithms

hcho3 · 2019-05-04T08:24:31Z

@hlbkin You are right. External memory will be available only for GPU predictor, not training.

@rongou @sriramch Am I correct that GPU training isn't available with external memory?

sriramch · 2019-05-06T14:55:26Z

@hcho3 yes you are correct. we are working on it. the changes are here if you are interested. i'll have to sync this change with master and write some tests.

hcho3 · 2019-05-06T17:21:01Z

@sriramch Awesome! Should we aim to include external memory training in the 0.90 release, or should we come back to it after 0.90?

CodingCat · 2019-05-06T18:10:44Z

just my two cents, let's reserve a bit on compacting many new features in 0.x (in a rush manner) and consider what is to be put in 1.0 as a milestone version

hcho3 · 2019-05-06T18:50:26Z

@CodingCat I agree. FYI, I deleted distributed customized objective from 0.90 roadmap, since there was substantial disagreement in #4280. We'll consider it again after 0.90.

@sriramch Let's consider external memory training after 0.90 release. Thanks a lot for your hard work.

RAMitchell · 2019-05-07T21:18:56Z

This might be a good time to release the cuda 9.0 binaries instead of 8.0. I think 9.0 will now be sufficiently supported by users driver version. Additionally the 9.0 binaries will not need to be JIT compiled for the newer Volta architectures.

CodingCat · 2019-05-10T04:37:31Z

@hcho3 are we ready to go?

hcho3 · 2019-05-10T20:43:04Z

Almost. I think #4438 should be merged.

hcho3 · 2019-05-10T21:45:35Z

All good now. I will go ahead and start working on the next release. ETA: May 16, 2019

Require Python 3 in setup.py
Change CI to build CUDA 9.0 wheels ([CI] Build XGBoost wheels with CUDA 9.0 #4459)
Fix Windows compilation ([CI] Add Windows GPU to Jenkins CI pipeline #4463)
Set up a minimal viable CI for Windows with GPU ([CI] Add Windows GPU to Jenkins CI pipeline #4463)

hcho3 · 2019-05-11T17:29:44Z

@RAMitchell Should we use CUDA 9.0 or 9.2 for wheel releases?

RAMitchell · 2019-05-11T22:05:56Z

Lets use 9.2 as that is already set up on CI. The danger is that we require Nvidia drivers that are too new. For reference here is the table showing the correspondence between cuda version and drivers: https://docs.nvidia.com/deploy/cuda-compatibility/index.html#binary-compatibility__table-toolkit-driver

As far as I know this should not impact CPU algorithms in anyway. If users begin to report issues then we can address this in future with better error messages around driver compatibility.

hcho3 · 2019-05-11T22:23:53Z

Hmm in that case I can try down-grading one of CI worker to CUDA 9.0. Since we are using Docker containers extensively, it should not be too difficult.

hcho3 · 2019-05-14T20:21:12Z

I'm going to prepare 0.90 release now. My goal is to have the release note complete by end of this week.

hcho3 · 2019-05-20T21:28:52Z

Closed by #4475

hcho3 added the type: roadmap label Apr 21, 2019

hcho3 changed the title ~~XGBoost 0.83 Roadmap~~ XGBoost 0.90 Roadmap Apr 22, 2019

hcho3 pinned this issue Apr 24, 2019

This comment has been minimized.

Sign in to view

CAM-Gerlach mentioned this issue May 3, 2019

[VOTE] Ending Python 2.x support in next release (0.83) #4379

Closed

hcho3 mentioned this issue May 17, 2019

[RFC] Version 0.90 release candidate #4475

Merged

hcho3 closed this as completed May 20, 2019

hcho3 unpinned this issue May 22, 2019

hcho3 mentioned this issue Aug 5, 2019

[legacy versions] how do we support old versions #4734

Closed

lock bot locked as resolved and limited conversation to collaborators Aug 18, 2019

XGBoost 0.90 Roadmap #4389

XGBoost 0.90 Roadmap #4389

Comments

hcho3 commented Apr 21, 2019 • edited Loading

CodingCat commented Apr 22, 2019

hcho3 commented Apr 22, 2019

CodingCat commented Apr 22, 2019

alexvorobiev commented Apr 22, 2019

CodingCat commented Apr 22, 2019

srowen commented Apr 22, 2019

alexvorobiev commented Apr 22, 2019

hcho3 commented Apr 24, 2019 • edited Loading

srowen commented Apr 24, 2019

This comment has been minimized.

This comment has been minimized.

CodingCat commented Apr 25, 2019 • edited Loading

srowen commented Apr 25, 2019

CodingCat commented Apr 25, 2019

CodingCat commented Apr 25, 2019

trivialfis commented Apr 26, 2019

hcho3 commented Apr 26, 2019

hcho3 commented Apr 28, 2019 • edited Loading

CodingCat commented Apr 28, 2019

hcho3 commented Apr 28, 2019

tovbinm commented Apr 29, 2019

hcho3 commented Apr 29, 2019 • edited Loading

CodingCat commented Apr 29, 2019 • edited Loading

hcho3 commented Apr 29, 2019 • edited Loading

hcho3 commented Apr 29, 2019

CodingCat commented Apr 29, 2019 • edited Loading

RAMitchell commented Apr 29, 2019

hcho3 commented May 1, 2019

tovbinm commented May 1, 2019

hcho3 commented May 1, 2019 • edited Loading

hlbkin commented May 1, 2019 • edited Loading

hcho3 commented May 1, 2019

CAM-Gerlach commented May 3, 2019

CodingCat commented May 3, 2019

hcho3 commented May 3, 2019

CAM-Gerlach commented May 3, 2019

hlbkin commented May 4, 2019

hcho3 commented May 4, 2019

sriramch commented May 6, 2019

hcho3 commented May 6, 2019

CodingCat commented May 6, 2019

hcho3 commented May 6, 2019

RAMitchell commented May 7, 2019

CodingCat commented May 10, 2019

hcho3 commented May 10, 2019

hcho3 commented May 10, 2019 • edited Loading

hcho3 commented May 11, 2019

RAMitchell commented May 11, 2019

hcho3 commented May 11, 2019

hcho3 commented May 14, 2019

hcho3 commented May 20, 2019

hcho3 commented Apr 21, 2019 •

edited

Loading

hcho3 commented Apr 24, 2019 •

edited

Loading

CodingCat commented Apr 25, 2019 •

edited

Loading

hcho3 commented Apr 28, 2019 •

edited

Loading

hcho3 commented Apr 29, 2019 •

edited

Loading

CodingCat commented Apr 29, 2019 •

edited

Loading

hcho3 commented Apr 29, 2019 •

edited

Loading

CodingCat commented Apr 29, 2019 •

edited

Loading

hcho3 commented May 1, 2019 •

edited

Loading

hlbkin commented May 1, 2019 •

edited

Loading

hcho3 commented May 10, 2019 •

edited

Loading