Fix thread-safety in C API's PredictSingleRow #3771

AlbertoEAF · 2021-01-16T20:11:32Z

The single row predictor methods in the C APi are not thread safe as reported in #3751 and #3675. This fixes both reports. Thank you https://github.com/tarunreddy1018 for the detailed reports!

By using a unique lock instead of the shared lock the timings are very similar, but predictions are correct.

Remarks for future work:

Even so, by designing a small C++ benchmark with a very simple LGBM model, more threads on a simple model are slower than the single-thread case. This is probably due to very small work units, the lock contention overhead increases.
We should in the future benchmark with more complex models to see if supporting
threading on these calls is worth it in performance gains.
If not, then we could choose to not to provide thread-safety and remove the locks altogether for maximal throughput (though people that didn't read the documentation carefully might be surprised that such methods were not thread-safe).
- We should in that case use non-shared resources so there is no synchronization required and multiple calls can be issued in parallel with no lock contention.

See #3751 for timings of the small benchmark and the benchmark gist https://gist.github.com/AlbertoEAF/5972db15a27c294bab65b97e1bc4c315

By using a unique lock instead of the shared lock the timings are very similar, but predictions are correct. Even so, by designing a small C++ benchmark with a very simple LGBM model,more threads on a simple model are slower than the single-thread case. This is probably due to very small work units, the lock contention overhead increases. We should in the future benchmark with more complex models to see if supporting threading on these calls is worth it in performance gains. If not, then we could choose to not to provide thread-safety and remove the locks altogether for maximal throughput. See microsoft#3751 for timings. See gist for benchmark code: https://gist.github.com/AlbertoEAF/5972db15a27c294bab65b97e1bc4c315

AlbertoEAF · 2021-01-19T11:32:59Z

Hello, can anyone merge this fix?

guolinke · 2021-01-26T03:57:58Z

Hi @AlbertoEAF
Is that possible to expose single row related APIs in python/R package?

AlbertoEAF · 2021-01-26T09:43:53Z

Hello @guolinke :)
It's totally possible, their API are very similar to the non-single row methods so it shouldn't be difficult at all.

In python's case we could replace the input pandas dataframe by something lighter like a numpy array.

github-actions · 2023-08-24T01:28:55Z

This pull request has been automatically locked since there has not been any recent activity since it was closed. To start a new related discussion, open a new issue at https://github.com/microsoft/LightGBM/issues including a reference to this.

AlbertoEAF requested review from btrotta, chivee and guolinke as code owners January 16, 2021 20:11

AlbertoEAF changed the title ~~Fix thread-safety in PredictSingleRow~~ Fix thread-safety in C API's PredictSingleRow Jan 16, 2021

StrikerRUS added the fix label Jan 16, 2021

AlbertoEAF mentioned this pull request Jan 16, 2021

Issue with call to c_api Predict functions in multi threaded way from golang #3751

Closed

guolinke approved these changes Jan 21, 2021

View reviewed changes

StrikerRUS merged commit 4ae4abb into microsoft:master Jan 21, 2021

StrikerRUS mentioned this pull request Feb 6, 2021

Getting random outputs when calling LightGBM Predict for single row function in a multi threaded environment (LGBM_BoosterPredictForMatSingleRowFast) method #3675

Closed

Ten0 mentioned this pull request Aug 4, 2023

Fix single row prediction performance in a multi-threaded environment #6021

Closed

github-actions bot locked as resolved and limited conversation to collaborators Aug 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix thread-safety in C API's PredictSingleRow #3771

Fix thread-safety in C API's PredictSingleRow #3771

AlbertoEAF commented Jan 16, 2021 •

edited

Loading

AlbertoEAF commented Jan 19, 2021

guolinke commented Jan 26, 2021

AlbertoEAF commented Jan 26, 2021

github-actions bot commented Aug 24, 2023

Fix thread-safety in C API's PredictSingleRow #3771

Fix thread-safety in C API's PredictSingleRow #3771

Conversation

AlbertoEAF commented Jan 16, 2021 • edited Loading

AlbertoEAF commented Jan 19, 2021

guolinke commented Jan 26, 2021

AlbertoEAF commented Jan 26, 2021

github-actions bot commented Aug 24, 2023

AlbertoEAF commented Jan 16, 2021 •

edited

Loading