[FusedRMSNorm doc] document where epsilon is added #1295

stas00 · 2022-02-11T01:31:43Z

This PR adds the the missing epsilon to formula to match the code

It's actually very unlikely that the denominator will ever be zero in this class - unless each input is zero, since this is the root of sum of squares of numbers.

I don't know if it's faster to add epsilon than to check if all inputs are zero, since the output will be zero then.

@eqy

eqy · 2022-02-11T03:05:02Z

I'm not sure this formula is accurate, as the epsilon value is used during the computation of the root-mean-square:

apex/csrc/layer_norm_cuda_kernel.cu

Line 378 in e1aa1fc

U c_invvar = rsqrt(sigma2 + epsilon);

rather than after.

stas00 · 2022-02-11T03:33:22Z

Thank you for catching that, @eqy. You're correct - perhaps we just document this nuance as in my last edit?

crcrpar · 2022-02-11T18:36:39Z

thank you @stas00

* [FusedRMSNorm doc] add epsilon to formula * correct * better wording

* FusedRMSNorm/"T5LayerNorm" based on FusedLayerNorm (NVIDIA#1274) * FusedRMSNorm based on FusedLayerNorm * refactor duplicated kernels * delete comments * delete comments * cleanup * cleanup * cleanup, fixed clobbering forward_affine_mixed_dtypes * fix pybind naming and add MixedFused test * undo skipping * check elementwise_affine * Update tests/L0/run_fused_layer_norm/test_fused_layer_norm.py Oof, nice catch, thanks Co-authored-by: Masaki Kozuki <masaki.kozuki.2014@gmail.com> Co-authored-by: Masaki Kozuki <masaki.kozuki.2014@gmail.com> * fix and generate docs for FusedRMSNorm (NVIDIA#1285) * [FusedRMSNorm doc] document where epsilon is added (NVIDIA#1295) * [FusedRMSNorm doc] add epsilon to formula * correct * better wording * Fix some bugs * Optimize HostRMSNormGradient and HostApplyRMSNorm for AMD GPUs * Fix NaN issues in FusedRMSNorm * Update test_fused_layer_norm.py * Skip test_fused_layer_norm.TestAutocastFusedRMSNorm on ROCm * Use at::cuda::warp_size() instead of at::cuda::getCurrentDeviceProperties()->warpSize Co-authored-by: eqy <eddiey@nvidia.com> Co-authored-by: Masaki Kozuki <masaki.kozuki.2014@gmail.com> Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

[FusedRMSNorm doc] add epsilon to formula

3925913

stas00 mentioned this pull request Feb 11, 2022

fix and generate docs for FusedRMSNorm #1285

Merged

correct

78d923c

stas00 changed the title ~~[FusedRMSNorm doc] add epsilon to formula~~ [FusedRMSNorm doc] document where epsilon is added Feb 11, 2022

better wording

2f01906

eqy approved these changes Feb 11, 2022

View reviewed changes

crcrpar approved these changes Feb 11, 2022

View reviewed changes

crcrpar merged commit c8c00ef into NVIDIA:master Feb 11, 2022

stas00 deleted the patch-1 branch February 11, 2022 18:38

crcrpar added this to the 22.03 milestone Feb 22, 2022

hubertlu-tw pushed a commit to ROCm/apex that referenced this pull request Apr 15, 2022

[FusedRMSNorm doc] document where epsilon is added (NVIDIA#1295)

4792170

* [FusedRMSNorm doc] add epsilon to formula * correct * better wording

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FusedRMSNorm doc] document where epsilon is added #1295

[FusedRMSNorm doc] document where epsilon is added #1295

Uh oh!

stas00 commented Feb 11, 2022 •

edited

Loading

Uh oh!

eqy commented Feb 11, 2022

Uh oh!

stas00 commented Feb 11, 2022

Uh oh!

crcrpar commented Feb 11, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[FusedRMSNorm doc] document where epsilon is added #1295

[FusedRMSNorm doc] document where epsilon is added #1295

Uh oh!

Conversation

stas00 commented Feb 11, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

eqy commented Feb 11, 2022

Uh oh!

stas00 commented Feb 11, 2022

Uh oh!

crcrpar commented Feb 11, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

stas00 commented Feb 11, 2022 •

edited

Loading