Skip to content

Conversation

@chilo-ms
Copy link
Contributor

BF16 support is primarily available on NVIDIA GPUs with the Ampere and later architectures with compute capability of 8.0 or higher.
If trt_bf16_enable = true and compute capability < 8, TRT EP will make trt_bf16_enable = false

@chilo-ms chilo-ms merged commit 89a2ff9 into main Jun 1, 2025
88 checks passed
@chilo-ms chilo-ms deleted the chi/address_trt_bf16_check branch June 1, 2025 23:16
quic-ankus pushed a commit to CodeLinaro/onnxruntime that referenced this pull request Nov 25, 2025
BF16 support is primarily available on NVIDIA GPUs with the Ampere and
later architectures with compute capability of 8.0 or higher.
If trt_bf16_enable = true and compute capability < 8, TRT EP will make
trt_bf16_enable = false
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants