This repository has been archived by the owner on Nov 22, 2022. It is now read-only.
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Optimize Gelu operator for caffe2 export
Summary: TIL ONNX->Caffe2 is very memory inefficient, it creates an intermediate blob for each intermediate output. So, the Gelu operator creates a lot of intermediate ops since it does a bunch of math. Fix is to use the caffe2 Gelu operator, so all that computation is captured in a single op. https://pxl.cl/HzGf Differential Revision: D16849396 fbshipit-source-id: 4903c614833ae4ad8a84c6eddc2382b2a24872f3
- Loading branch information