-
Notifications
You must be signed in to change notification settings - Fork 590
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Distributional Reinforcement Learning with Quantile Regression #3
Comments
I think probably it should be something like:
|
After adding u = dist - expected_quantTypeError Traceback (most recent call last) in compute_td_loss(batch_size) /home/--/anaconda2/envs/tensorflow4/lib/python2.7/site-packages/torch/tensor.pyc in sub(self, other) TypeError: sub received an invalid combination of arguments - got (Variable), but expected one of:
|
Should be something like:
|
When I last looked at this it ran after converting to a variable: |
Friend, this a question. |
It confused me. |
Hi, what does the "u" means in the following code snippets? It seems that the "u" is not defined in the code? Thanks!
huber_loss = 0.5 * u.abs().clamp(min=0.0, max=k).pow(2)
huber_loss += k * (u.abs() - u.abs().clamp(min=0.0, max=k))
quantile_loss = (tau - (u < 0).float()).abs() * huber_loss
The text was updated successfully, but these errors were encountered: