-
Notifications
You must be signed in to change notification settings - Fork 3.7k
Ovep Feature Rel 1.23 #25435
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ovep Feature Rel 1.23 #25435
Conversation
* Add pass to perform QDQ stripping and propagate scales * Fix disconnected outptu node * Fixes to support session.disable_quant_qdq output, remove dangling nodes and duplicate DQ nodes * Fix lack of scales updates and remove stray QDQ nodes in certain models * Address issues with Linux CI * Fix for double QDQ issue
Co-authored-by: Ankit Maheshkar <[email protected]>
* Fix the model copies and redefinitions for CPU fallback * OV compatibility is not needed --------- Co-authored-by: sfatimar <[email protected]>
|
@jywu-msft please review |
onnxruntime/core/providers/openvino/qdq_transformations/qdq_scales_fix.cpp
Fixed
Show fixed
Hide fixed
|
fix lint errors. |
|
/azp run Linux QNN CI Pipeline, Win_TRT_Minimal_CUDA_Test_CI, Windows ARM64 QNN CI Pipeline, Windows x64 QNN CI Pipeline, Windows GPU Doc Gen CI Pipeline |
|
Azure Pipelines successfully started running 5 pipeline(s). |
Fixed |
|
@adrianlizarraga can you please review. |
|
Hi there! We haven't cut the release branch for this version yet, so I'm removing the |
### Description Add QDQ scale propagation pass Enable dynamic path for NPU when enable_causallm is enabled Allow zero-element tensors to get set Fix the model copies and redefinitions for CPU fallback Added support for 2025.2 and enabled SimplifiedLayerNormalization op ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> --------- Co-authored-by: Javier Martinez <[email protected]> Co-authored-by: Ryan Metcalfe <[email protected]> Co-authored-by: Ankit Maheshkar <[email protected]> Co-authored-by: Preetha Veeramalai <[email protected]> Co-authored-by: n1harika <[email protected]>
### Description Add QDQ scale propagation pass Enable dynamic path for NPU when enable_causallm is enabled Allow zero-element tensors to get set Fix the model copies and redefinitions for CPU fallback Added support for 2025.2 and enabled SimplifiedLayerNormalization op ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> --------- Co-authored-by: Javier Martinez <[email protected]> Co-authored-by: Ryan Metcalfe <[email protected]> Co-authored-by: Ankit Maheshkar <[email protected]> Co-authored-by: Preetha Veeramalai <[email protected]> Co-authored-by: n1harika <[email protected]>
### Description Add QDQ scale propagation pass Enable dynamic path for NPU when enable_causallm is enabled Allow zero-element tensors to get set Fix the model copies and redefinitions for CPU fallback Added support for 2025.2 and enabled SimplifiedLayerNormalization op ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> --------- Co-authored-by: Javier Martinez <[email protected]> Co-authored-by: Ryan Metcalfe <[email protected]> Co-authored-by: Ankit Maheshkar <[email protected]> Co-authored-by: Preetha Veeramalai <[email protected]> Co-authored-by: n1harika <[email protected]>
Description
Add QDQ scale propagation pass
Enable dynamic path for NPU when enable_causallm is enabled
Allow zero-element tensors to get set
Fix the model copies and redefinitions for CPU fallback
Added support for 2025.2 and enabled SimplifiedLayerNormalization op
Motivation and Context