Skip to content

Commit

Permalink
Fix issues with ASR notebooks (#2698)
Browse files Browse the repository at this point in the history
Signed-off-by: smajumdar <[email protected]>
  • Loading branch information
titu1994 authored and ericharper committed Aug 30, 2021
1 parent 407d955 commit 31e8349
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion tutorials/asr/Intro_to_Transducers.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -553,7 +553,7 @@
"\n",
"The Joint model config has several essential components which we discuss below :\n",
"\n",
"1) `log_softmax`: Due to the cost of computing softmax on such large tensors, the Numba CUDA implementation of RNNT loss will implicitly compute the log softmax when called (so its inputs should be logits). The CPU version of the loss doesn't face such memory issues so it requires log-probabilities instead. Since the behaviour is different for CPU-GPU, the `null` value will automatically switch behaviour dependent on whether the input tensor is on a CPU or GPU device.\n",
"1) `log_softmax`: Due to the cost of computing softmax on such large tensors, the Numba CUDA implementation of RNNT loss will implicitly compute the log softmax when called (so its inputs should be logits). The CPU version of the loss doesn't face such memory issues so it requires log-probabilities instead. Since the behaviour is different for CPU-GPU, the `None` value will automatically switch behaviour dependent on whether the input tensor is on a CPU or GPU device.\n",
"\n",
"2) `preserve_memory`: This flag will call `torch.cuda.empty_cache()` at certain critical sections when computing the Joint tensor. While this operation might allow us to preserve some memory, the empty_cache() operation is tremendously slow and will slow down training by an order of magnitude or more. It is available to use but not recommended.\n",
"\n",
Expand Down

0 comments on commit 31e8349

Please sign in to comment.