accuracy of forwardgrad isn't as good as regular backprop #8

ilonadem · 2022-06-28T11:49:31Z

Hi, really cool implementation!

I noticed that when I run your examples although both models achieve convergence, the accuracy of the forward grad method is always worse than that of regular backprop. In the paper they mention that the accuracy of the forward gradient should be pretty comparable/identical to that of backpropagation, is this behavior expected?

I was able to improve things marginally by having the model perform several random perturbations and taking the average of those for the parameter update in each forward pass (since this means that it is likelier to actually find the direction of the true gradient), but wasn't ever able to replicate backprop performance.

belerico · 2022-06-30T06:56:54Z

Hi @ilonadem, thank you for your words. Coming to your issue:

I noticed that when I run your examples although both models achieve convergence, the accuracy of the forward grad method is always worse than that of regular backprop. In the paper they mention that the accuracy of the forward gradient should be pretty comparable/identical to that of backpropagation, is this behavior expected?

Yes, they should be pretty comparable, although it's something that we have never measured. To this end we can add some test functions to test the trained model and add some results. If you already have something you can also open a PR :)

I was able to improve things marginally by having the model perform several random perturbations and taking the average of those for the parameter update in each forward pass (since this means that it is likelier to actually find the direction of the true gradient), but wasn't ever able to replicate backprop performance.

Yes, in our example we are estimating the expected value with only one sample, and more samples you use more precise become your estimation. This is also something that could be useful to have in our examples: we can add the number of samples to use in the estimation by setting some hydra parameters. Again, feel free to open a PR in case :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

accuracy of forwardgrad isn't as good as regular backprop #8

accuracy of forwardgrad isn't as good as regular backprop #8

ilonadem commented Jun 28, 2022 •

edited

Loading

belerico commented Jun 30, 2022 •

edited

Loading

accuracy of forwardgrad isn't as good as regular backprop #8

accuracy of forwardgrad isn't as good as regular backprop #8

Comments

ilonadem commented Jun 28, 2022 • edited Loading

belerico commented Jun 30, 2022 • edited Loading

ilonadem commented Jun 28, 2022 •

edited

Loading

belerico commented Jun 30, 2022 •

edited

Loading