Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sample H100 job #1235

Open
wants to merge 5 commits into
base: main
Choose a base branch
from
Open

Sample H100 job #1235

wants to merge 5 commits into from

Conversation

msaroufim
Copy link
Member

@msaroufim msaroufim commented Nov 7, 2024

Based off #1232 by @jeanschmidt

The runner name is linux.aws.h100

We can't use our H100 runners on every commit cause the queue times will be insane so this is a sample job that just runs our fp8 tests - options are trigger on either

  1. nightly
  2. On some label like h100
  3. Manual workflow dispatch

Fp8 nightly test feels like the first useful h100 test we can do but opening this PR mostly as an example so people can write their own jobs

Copy link

pytorch-bot bot commented Nov 7, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/1235

Note: Links to docs will display an error until the docs builds have been completed.

❌ 3 New Failures

As of commit defeb5b with merge base 9b748de (image):

NEW FAILURES - The following jobs have failed:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 7, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants