We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
您好,我在看代码的时候发现组成TransformerBlock的TransformerLayer的结构与您在论文中的图4给出的结构不一致。从图中的代码看残差结构应该是在最后一个dropout层之后,但是论文中给出的图4结构是这样的。作为一个刚接触深度学习的小白,我不太懂,也可能是我理解错了,希望您得空可以帮我解释一下,谢谢!
The text was updated successfully, but these errors were encountered:
No branches or pull requests
您好,我在看代码的时候发现组成TransformerBlock的TransformerLayer的结构与您在论文中的图4给出的结构不一致。从图中的代码看残差结构应该是在最后一个dropout层之后,但是论文中给出的图4结构是这样的。作为一个刚接触深度学习的小白,我不太懂,也可能是我理解错了,希望您得空可以帮我解释一下,谢谢!
The text was updated successfully, but these errors were encountered: