We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 4fedbb4 commit 312221aCopy full SHA for 312221a
tensor2tensor/layers/common_attention.py
@@ -423,7 +423,7 @@ def add_timing_signal_1d(x,
423
memory inputs to attention.
424
425
The use of relative position is possible because sin(x+y) and cos(x+y) can be
426
- experessed in terms of y, sin(x) and cos(x).
+ expressed in terms of y, sin(x) and cos(x).
427
428
In particular, we use a geometric sequence of timescales starting with
429
min_timescale and ending with max_timescale. The number of different
0 commit comments