fix and generate docs for FusedRMSNorm #1285

eqy · 2022-02-04T21:37:57Z

Followup to #1274, which left out (correct) documentation

crcrpar

The generated docs look nice

stas00 · 2022-02-10T22:40:14Z

apex/normalization/fused_layer_norm.py


    .. math::
-        y = \frac{x - \mathrm{E}[x]}{ \sqrt{\mathrm{Var}[x] + \epsilon}} * \gamma + \beta
+        y = \frac{x}{\mathrm{RMS}[x]} * \gamma


It's missing epsilon and I checked it's in the code.

Was left out as the syntax looked kind of involved... feel free to suggest a fix!

* FusedRMSNorm/"T5LayerNorm" based on FusedLayerNorm (NVIDIA#1274) * FusedRMSNorm based on FusedLayerNorm * refactor duplicated kernels * delete comments * delete comments * cleanup * cleanup * cleanup, fixed clobbering forward_affine_mixed_dtypes * fix pybind naming and add MixedFused test * undo skipping * check elementwise_affine * Update tests/L0/run_fused_layer_norm/test_fused_layer_norm.py Oof, nice catch, thanks Co-authored-by: Masaki Kozuki <masaki.kozuki.2014@gmail.com> Co-authored-by: Masaki Kozuki <masaki.kozuki.2014@gmail.com> * fix and generate docs for FusedRMSNorm (NVIDIA#1285) * [FusedRMSNorm doc] document where epsilon is added (NVIDIA#1295) * [FusedRMSNorm doc] add epsilon to formula * correct * better wording * Fix some bugs * Optimize HostRMSNormGradient and HostApplyRMSNorm for AMD GPUs * Fix NaN issues in FusedRMSNorm * Update test_fused_layer_norm.py * Skip test_fused_layer_norm.TestAutocastFusedRMSNorm on ROCm * Use at::cuda::warp_size() instead of at::cuda::getCurrentDeviceProperties()->warpSize Co-authored-by: eqy <eddiey@nvidia.com> Co-authored-by: Masaki Kozuki <masaki.kozuki.2014@gmail.com> Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

fix and generate docs for FusedRMSNorm

fb3efa0

crcrpar approved these changes Feb 7, 2022

View reviewed changes

crcrpar merged commit a786ca0 into NVIDIA:master Feb 7, 2022

crcrpar mentioned this pull request Feb 10, 2022

FusedRMSNorm/"T5LayerNorm" based on FusedLayerNorm #1274

Merged

stas00 reviewed Feb 10, 2022

View reviewed changes

crcrpar added this to the 22.03 milestone Feb 22, 2022

hubertlu-tw pushed a commit to ROCm/apex that referenced this pull request Apr 15, 2022

fix and generate docs for FusedRMSNorm (NVIDIA#1285)

fceec07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix and generate docs for FusedRMSNorm #1285

fix and generate docs for FusedRMSNorm #1285

Uh oh!

eqy commented Feb 4, 2022

Uh oh!

crcrpar left a comment

Uh oh!

stas00 Feb 10, 2022

Uh oh!

eqy Feb 11, 2022

Uh oh!

stas00 Feb 11, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix and generate docs for FusedRMSNorm #1285

fix and generate docs for FusedRMSNorm #1285

Uh oh!

Conversation

eqy commented Feb 4, 2022

Uh oh!

crcrpar left a comment

Choose a reason for hiding this comment

Uh oh!

stas00 Feb 10, 2022

Choose a reason for hiding this comment

Uh oh!

eqy Feb 11, 2022

Choose a reason for hiding this comment

Uh oh!

stas00 Feb 11, 2022

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants