Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Decision forest random splitter #2270

Conversation

inteldimitrius
Copy link
Contributor

Description

Added random splitting strategy.

Changes proposed in this pull request:

  • Added random splitting strategy applied in computation in GPU.
  • Updated splitter class names..
  • Update descriptor and train_context classes.

@inteldimitrius
Copy link
Contributor Author

/intelci: run

@icfaust
Copy link
Contributor

icfaust commented Feb 2, 2023

Any chance you could run the CI Random Forest tests/examples with bootstrap=False and show the results?

@inteldimitrius
Copy link
Contributor Author

Any chance you could run the CI Random Forest tests/examples with bootstrap=False and show the results?

Hi, I added testcases for bootstrap_mode / splitter_mode switching.

Copy link
Contributor

@Vika-F Vika-F left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The main concern is that extra trees still process histogram bins in parallel to compute the split. This needs to be changed. And there are also several minor comments.

inteldimitrius added a commit to inteldimitrius/oneDAL that referenced this pull request Feb 8, 2023
Copy link
Contributor

@Vika-F Vika-F left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you for addressing part of the issues.
But the main change request remains the same as previously: need to remove threading by histogram bins (cgh.parallel_for) for extra trees.

inteldimitrius added a commit to inteldimitrius/oneDAL that referenced this pull request Feb 9, 2023
@inteldimitrius
Copy link
Contributor Author

Thank you for addressing part of the issues. But the main change request remains the same as previously: need to remove threading by histogram bins (cgh.parallel_for) for extra trees.

Yes, it is in progress.

inteldimitrius added a commit to inteldimitrius/oneDAL that referenced this pull request Feb 9, 2023
inteldimitrius added a commit to inteldimitrius/oneDAL that referenced this pull request Feb 15, 2023
Copy link
Contributor

@Vika-F Vika-F left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The overall logic of the random splitter is correct now, that is great.
But I have several comments regarding the implementation details that need to be addressed or proved that there is no need in fixing them.

inteldimitrius added a commit to inteldimitrius/oneDAL that referenced this pull request Mar 3, 2023
inteldimitrius added a commit to inteldimitrius/oneDAL that referenced this pull request Mar 3, 2023
inteldimitrius added a commit to inteldimitrius/oneDAL that referenced this pull request Mar 3, 2023
inteldimitrius added a commit to inteldimitrius/oneDAL that referenced this pull request Mar 3, 2023
@inteldimitrius inteldimitrius force-pushed the decision_forest_random_splitter branch from 7e2ebec to 1de2733 Compare March 3, 2023 18:05
@inteldimitrius
Copy link
Contributor Author

I have rebased onto master this branch and resolved all change requests. The implementation is ready for merge.

Copy link
Contributor

@Vika-F Vika-F left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It seems the adding of using v1::splitter_mode was done a bit incorrectly.
Please see the comments.

Copy link
Contributor

@Vika-F Vika-F left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The changes look good to me.

@inteldimitrius inteldimitrius force-pushed the decision_forest_random_splitter branch from c7ad5ee to 3286808 Compare March 22, 2023 15:59
inteldimitrius added a commit to inteldimitrius/oneDAL that referenced this pull request Mar 22, 2023
inteldimitrius added a commit to inteldimitrius/oneDAL that referenced this pull request Mar 22, 2023
inteldimitrius added a commit to inteldimitrius/oneDAL that referenced this pull request Mar 22, 2023
inteldimitrius added a commit to inteldimitrius/oneDAL that referenced this pull request Mar 22, 2023
@Alexandr-Solovev Alexandr-Solovev merged commit 5e080cf into uxlfoundation:master Mar 23, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

9 participants