Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cache FP8 weight and transpose only at the first micro-batch in each validation and test routine #7483

Merged

Commits on Sep 22, 2023

  1. Cache FP8 weight and transpose only at the first micro-batch in each …

    …validation and test routine (#7470)
    
    * Cache weight and transpose only in the first batch in all training, val, and test runs
    
    Signed-off-by: Sangkug Lym <[email protected]>
    
    * [pre-commit.ci] auto fixes from pre-commit.com hooks
    
    for more information, see https://pre-commit.ci
    
    ---------
    
    Signed-off-by: Sangkug Lym <[email protected]>
    Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
    2 people authored and web-flow committed Sep 22, 2023
    Configuration menu
    Copy the full SHA
    45e541b View commit details
    Browse the repository at this point in the history