-
I have the question in https://nvlabs.github.io/sionna/examples/Autoencoder.html#Trainable-End-to-end-System:-RL-based-Training For the RL-based Training, why "-" is added in the tx_loss? It may cause the loss become negative value. Thank you ao much. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
Hi, The code you are referring to implements the algorithm from this paper and more specifically equation (8) in the case where |
Beta Was this translation helpful? Give feedback.
Hi,
The code you are referring to implements the algorithm from this paper and more specifically equation (8) in the case where
\pi
corresponds to the density of a normal distribution. Taking the log of this density results in the implemented equation, where the "-" corresponds to the one that is in the exp(.) of the density.