Skip to content
This repository has been archived by the owner on Nov 17, 2023. It is now read-only.

Feature request: GELU operator #12984

Closed
eric-haibin-lin opened this issue Oct 26, 2018 · 4 comments
Closed

Feature request: GELU operator #12984

eric-haibin-lin opened this issue Oct 26, 2018 · 4 comments

Comments

@hendrycks
Copy link

This nonlinearity is also used in BERT, the state-of-the-art NLP model for various tasks: https://arxiv.org/abs/1810.04805
(Disclosure: I'm on the GELU paper.)

@haojin2
Copy link
Contributor

haojin2 commented Mar 16, 2019

Working on it now...

@haojin2 haojin2 mentioned this issue Mar 16, 2019
8 tasks
@haojin2
Copy link
Contributor

haojin2 commented Mar 16, 2019

PR #14449 should address this issue @hendrycks Please give a review if you have time, thanks!

@sandeep-krishnamurthy
Copy link
Contributor

Closing as the PR is now merged. Thanks haojin

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests

4 participants