Skip to content

Improve performance of prefill mode FP8 Grouped Gemm #8381

Improve performance of prefill mode FP8 Grouped Gemm

Improve performance of prefill mode FP8 Grouped Gemm #8381

Triggered via pull request December 31, 2024 00:36
Status Success
Total duration 2h 20m 40s
Artifacts 6

build_wheels_linux_x86.yml

on: pull_request
generate-matrix  /  generate
8s
generate-matrix / generate
Matrix: pytorch/FBGEMM / build
Matrix: build / upload / upload
Fit to window
Zoom out
Zoom in

Annotations

2 warnings
generate-matrix / generate
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
Deprecation notice: v1, v2, and v3 of the artifact actions
The following artifacts were uploaded using a version of actions/upload-artifact that is scheduled for deprecation: "pytorch_FBGEMM__3.9_cpu_x86_64", "pytorch_FBGEMM__3.9_cu118_x86_64", "pytorch_FBGEMM__3.9_cu124_x86_64", "pytorch_FBGEMM__3.9_cu126_x86_64", "pytorch_FBGEMM__3.9_rocm6.2.4_x86_64", "pytorch_FBGEMM__3.9_rocm6.3_x86_64". Please update your workflow to use v4 of the artifact actions. Learn more: https://github.blog/changelog/2024-04-16-deprecation-notice-v3-of-the-artifact-actions/

Artifacts

Produced during runtime
Name Size Digest
pytorch_FBGEMM__3.9_cpu_x86_64 Expired
3.67 MB
pytorch_FBGEMM__3.9_cu118_x86_64 Expired
294 MB
pytorch_FBGEMM__3.9_cu124_x86_64 Expired
393 MB
pytorch_FBGEMM__3.9_cu126_x86_64 Expired
391 MB
pytorch_FBGEMM__3.9_rocm6.2.4_x86_64 Expired
75.6 MB
pytorch_FBGEMM__3.9_rocm6.3_x86_64 Expired
72.7 MB