Skip to content

Disable CUDA graph for Phi LongRoPE models with IF nodes on TRT-RTX#1921

Merged
kunal-vaishnavi merged 2 commits into
microsoft:mainfrom
anujj:phi_cg_disable
Dec 17, 2025
Merged

Disable CUDA graph for Phi LongRoPE models with IF nodes on TRT-RTX#1921
kunal-vaishnavi merged 2 commits into
microsoft:mainfrom
anujj:phi_cg_disable

Conversation

@anujj
Copy link
Copy Markdown
Contributor

@anujj anujj commented Dec 16, 2025

Disable CUDA graph for Phi LongRoPE models with IF nodes on TRT-RTX

This change disables CUDA graph for the following
models when targeting TRT-RTX:

  • Phi3MiniLongRoPEModel
  • Phi3SmallLongRoPEModel
  • Phi3MoELongRoPEModel

@anujj
Copy link
Copy Markdown
Contributor Author

anujj commented Dec 16, 2025

@gaugarg-nv for review

@anujj
Copy link
Copy Markdown
Contributor Author

anujj commented Dec 17, 2025

@kunal-vaishnavi @baijumeswani for review

Comment thread src/python/py/models/builders/base.py Outdated
@kunal-vaishnavi kunal-vaishnavi enabled auto-merge (squash) December 17, 2025 19:32
@kunal-vaishnavi kunal-vaishnavi merged commit e22c89f into microsoft:main Dec 17, 2025
15 checks passed
apsonawane pushed a commit that referenced this pull request Dec 19, 2025
…1921)

Disable CUDA graph for Phi LongRoPE models with IF nodes on TRT-RTX

This change disables CUDA graph for the following
models when targeting TRT-RTX:
- Phi3MiniLongRoPEModel
- Phi3SmallLongRoPEModel
- Phi3MoELongRoPEModel
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants