-
Notifications
You must be signed in to change notification settings - Fork 335
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[zh] cs-230-recurrent-neural-networks #181
base: master
Are you sure you want to change the base?
Conversation
Commit first time
Change the order between translated text and <br>
Thank you for all your work @HsinJhao! Please feel free to invite anyone you know to review the translation. |
**17. [Drawbacks, Computation being slow, Difficulty of accessing information from a long time ago, Cannot consider any future input for the current state]** | ||
|
||
⟶ | ||
[缺点, 计算缓慢, 难以访问长时间的历史信息, 难以考虑未来时间步的输入对当前状态的影响] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
无法考虑未来时间步的输入对当前状态的影响
|
||
⟶ | ||
|
||
<br>误差分析 - 当所预测得到的翻译ˆy很差时,有人会想,为什么我们没有通过执行以下错误分析得到一个好的翻译y: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
误差分析―当获得较差的预测翻译y时,可以通过执行以下错误分析来思考为什么我们没有得到好的翻译y *:
**14. where Wax,Waa,Wya,ba,by are coefficients that are shared temporally and g1,g2 activation functions.** | ||
|
||
⟶ | ||
其中Wax,Waa,Wya,ba是相关的系数矩阵, 在时间尺度上被整个网络共享;g1,g2是相关的激活函数。 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
其中Wax,Waa,Wya,ba,by是在时间尺度上被整个网络共享系数矩阵,;g1,g2是激活函数。
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Applied!
**16. [Advantages, Possibility of processing input of any length, Model size not increasing with size of input, Computation takes into account historical information, Weights are shared across time]** | ||
|
||
⟶ | ||
[优点, 可处理任何长度的输入, 模型大小不会随输入大小增加, 计算考虑历史信息, 权重在时间尺度上被整个网络共享] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
可以处理任何长度的输入,模型大小不会随输入大小的增加而增加,计算时会考虑历史信息,权重在整个时间尺度上共享
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Applied, thanks!
**28. Gradient clipping ― It is a technique used to cope with the exploding gradient problem sometimes encountered when performing backpropagation. By capping the maximum value for the gradient, this phenomenon is controlled in practice.** | ||
|
||
⟶ | ||
梯度裁剪 - 该方法是用于解决进行反向传播时时而出现梯度爆炸问题的技术。通过限制梯度的最大值, 这种现象在实际中得到了相应的控制。 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
梯度裁剪 - 一种用于解决进行反向传播时时而出现梯度爆炸问题的方法。
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
To be consistent with the chinese version of book 《deep learning》(lan Goodfellow, et al),Gradient clipping will be translated to 梯度截断 rather than 梯度裁剪.
为与Ian Goodfellow等人编写的《deep learning》中文译本保持一致,将Gradient clipping翻译为梯度截断而不是梯度裁剪。
|
||
⟶ | ||
|
||
<br>嵌入矩阵 - 对于给定的词汇w, 将该词汇的one-hot表示ow映射至词嵌入表示ew的嵌入矩阵E满足下式: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
??
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Without changing the original intention, it will be translated to:
'' 嵌入矩阵 - 对于给定的词汇w, 通过嵌入矩阵E可将该词汇的one-hot表示向量ow映射为词嵌入表示向量ew, E满足下式:''
|
||
⟶ | ||
|
||
<br>bleu分数 ― 双语评估替补(bilingual evaluation understudy, bleu)分数通过基于n-gram精度计算相似度分数来量化机器翻译的好坏。其定义如下: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
双语评估替补???
通过基于n-gram精度计算相似性分数来量化机器翻译的质量。
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Through searching the related chinese information,'bilingual evaluation understudy score' will be translated to 双语评估替换分数.
通过查阅相关中文资料,将**'bilingual evaluation understudy score'翻译为双语评估替换分数**为宜.
Applied some suggestions from SpeakingTom. Reviewed some words: CBOW --> 连续词袋 LSTM --> 长短时记忆 Gradient clipping --> (梯度裁剪->梯度截断)...
Thank you @HsinJhao and @SpeakingTom for all your work! @HsinJhao: would it be possible to only keep the |
cs-230 recurrent neural networks translation is finished.