Improve runtime efficiency of TensorFlow gradient_tape computation of gradients and hessians. #1

StatMixedML · 2022-06-08T10:23:20Z

Summary

As we state in our paper, DGBM is not yet competitive in terms of runtime, constantly exceeding those of other approaches by several orders of magnitude. From experiments we have conducted so far it appears that the automatic derivation of the hessian poses a significant computational bottleneck. The reason is that when computing the hessian (i.e., gradient of a gradient) of a loss function with vector input (i.e., multi-parameter optimization), instead of returning the element-wise hessian, the second gradient appears to return the row-wise sum of the hessian. For a more detailed discussion, we refer to tensorflow/tensorflow#29064.

Ask to the community

As we show in this notebook, we can circumvent the problem of incorrect hessians. However, these implementations are not as efficient as the original nested gradient_tape version.

I am reaching out to the TensorFlow community asking for help and guidance on how to further improve the computational efficiency of gradients and hessians.

StatMixedML added the help wanted Extra attention is needed label Jun 8, 2022

This was referenced Jun 8, 2022

What does GradientTape.gradient(target, ...) when target is a list of tensors and not a single tensor ? tensorflow/tensorflow#29064

Closed

Probabilistic Forecasting microsoft/LightGBM#3200

Closed

StatMixedML closed this as completed Dec 16, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve runtime efficiency of TensorFlow gradient_tape computation of gradients and hessians. #1

Improve runtime efficiency of TensorFlow gradient_tape computation of gradients and hessians. #1

StatMixedML commented Jun 8, 2022 •

edited

Loading

Improve runtime efficiency of TensorFlow gradient_tape computation of gradients and hessians. #1

Improve runtime efficiency of TensorFlow gradient_tape computation of gradients and hessians. #1

Comments

StatMixedML commented Jun 8, 2022 • edited Loading

Summary

Ask to the community

StatMixedML commented Jun 8, 2022 •

edited

Loading