Skip to content

Conversation

@keshavv27
Copy link
Contributor

Description

Set compute capability only on Turing arch

Motivation and Context

Setting the native compute capability was causing a regression in performance.

@gaugarg-nv @ishwar-raut1 @ankan-ban

@ankan-ban
Copy link
Contributor

Did we ask TRT-RTX guys why setting "current" profile causes perf regression ?

@keshavv27 keshavv27 changed the title Set Compute Capability only on Turing architecture [NV RTX EP] Set Compute Capability only on Turing architecture Jul 18, 2025
@keshavv27
Copy link
Contributor Author

keshavv27 commented Jul 18, 2025

Did we ask TRT-RTX guys why setting "current" profile causes perf regression ?

Did not get a response yet on the root cause for this. For current state of TRT RTX this was the change suggested.

@snnn
Copy link
Contributor

snnn commented Jul 18, 2025

/azp run Linux QNN CI Pipeline, Win_TRT_Minimal_CUDA_Test_CI, Windows ARM64 QNN CI Pipeline, Windows x64 QNN CI Pipeline, Windows GPU Doc Gen CI Pipeline

@azure-pipelines
Copy link

Azure Pipelines successfully started running 5 pipeline(s).

@snnn snnn requested a review from jywu-msft July 18, 2025 21:41
@ishwar-raut1
Copy link
Contributor

ishwar-raut1 commented Jul 19, 2025

Did we ask TRT-RTX guys why setting "current" profile causes perf regression ?

Did not get a response yet on the root cause for this. For current state of TRT RTX this was the change suggested.

Yes, change should be in the TRT RTX. But this WAR is fine for now.

@ankan-ban
Copy link
Contributor

ok. the WAR makes sense then. Thanks.

@jywu-msft jywu-msft merged commit 033ca86 into microsoft:main Jul 19, 2025
84 checks passed
qti-yuduo pushed a commit to CodeLinaro/onnxruntime that referenced this pull request Aug 8, 2025
…soft#25446)

### Description
<!-- Describe your changes. -->
Set compute capability only on Turing arch


### Motivation and Context
<!-- - Why is this change required? What problem does it solve?
- If it fixes an open issue, please link to the issue here. -->
Setting the native compute capability was causing a regression in
performance.

@gaugarg-nv @ishwar-raut1 @ankan-ban
sanketkaleoss pushed a commit to sanketkaleoss/onnxruntime that referenced this pull request Aug 11, 2025
…soft#25446)

### Description
<!-- Describe your changes. -->
Set compute capability only on Turing arch


### Motivation and Context
<!-- - Why is this change required? What problem does it solve?
- If it fixes an open issue, please link to the issue here. -->
Setting the native compute capability was causing a regression in
performance.

@gaugarg-nv @ishwar-raut1 @ankan-ban
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants