Skip to content

Conversation

@sfatimar
Copy link
Contributor

Description

Add QDQ scale propagation pass

Enable dynamic path for NPU when enable_causallm is enabled

Allow zero-element tensors to get set
Fix the model copies and redefinitions for CPU fallback
Added support for 2025.2 and enabled SimplifiedLayerNormalization op

Motivation and Context

javier-intel and others added 5 commits July 17, 2025 19:10
* Add pass to perform QDQ stripping and propagate scales

* Fix disconnected outptu node

* Fixes to support session.disable_quant_qdq output, remove dangling nodes and duplicate DQ nodes

* Fix lack of scales updates and remove stray QDQ nodes in certain models

* Address issues with Linux CI

* Fix for double QDQ issue
* Fix the model copies and redefinitions for CPU fallback

* OV compatibility is not needed

---------

Co-authored-by: sfatimar <[email protected]>
…714)

* Added support for 2025.2 and SimplifiedLayerNormalization op

* [OVEP] Update OV version to 2025.2.0

* Revert "[OVEP] Update OV version to 2025.2.0"

This reverts commit d129250.
@sfatimar
Copy link
Contributor Author

sfatimar commented Jul 17, 2025

@jywu-msft please review

@jywu-msft
Copy link
Member

fix lint errors.

@jywu-msft
Copy link
Member

/azp run Linux QNN CI Pipeline, Win_TRT_Minimal_CUDA_Test_CI, Windows ARM64 QNN CI Pipeline, Windows x64 QNN CI Pipeline, Windows GPU Doc Gen CI Pipeline

@azure-pipelines
Copy link

Azure Pipelines successfully started running 5 pipeline(s).

@jatinwadhwa921
Copy link
Contributor

fix lint errors.

Fixed

@sfatimar
Copy link
Contributor Author

@adrianlizarraga can you please review.

@jywu-msft jywu-msft merged commit 551fa6f into microsoft:main Jul 18, 2025
84 checks passed
@snnn
Copy link
Contributor

snnn commented Jul 25, 2025

Hi there! We haven't cut the release branch for this version yet, so I'm removing the release:1.23.0 label for now to keep things tidy. Thanks so much for your contribution! We'll make sure this gets included when the release is prepared. 🤖

qti-yuduo pushed a commit to CodeLinaro/onnxruntime that referenced this pull request Aug 8, 2025
### Description
Add QDQ scale propagation pass 

Enable dynamic path for NPU when enable_causallm is enabled 

Allow zero-element tensors to get set 
Fix the model copies and redefinitions for CPU fallback 
Added support for 2025.2 and enabled SimplifiedLayerNormalization op 

### Motivation and Context
<!-- - Why is this change required? What problem does it solve?
- If it fixes an open issue, please link to the issue here. -->

---------

Co-authored-by: Javier Martinez <[email protected]>
Co-authored-by: Ryan Metcalfe <[email protected]>
Co-authored-by: Ankit Maheshkar <[email protected]>
Co-authored-by: Preetha Veeramalai <[email protected]>
Co-authored-by: n1harika <[email protected]>
sanketkaleoss pushed a commit to sanketkaleoss/onnxruntime that referenced this pull request Aug 11, 2025
### Description
Add QDQ scale propagation pass 

Enable dynamic path for NPU when enable_causallm is enabled 

Allow zero-element tensors to get set 
Fix the model copies and redefinitions for CPU fallback 
Added support for 2025.2 and enabled SimplifiedLayerNormalization op 

### Motivation and Context
<!-- - Why is this change required? What problem does it solve?
- If it fixes an open issue, please link to the issue here. -->

---------

Co-authored-by: Javier Martinez <[email protected]>
Co-authored-by: Ryan Metcalfe <[email protected]>
Co-authored-by: Ankit Maheshkar <[email protected]>
Co-authored-by: Preetha Veeramalai <[email protected]>
Co-authored-by: n1harika <[email protected]>
Kotomi-Du pushed a commit to Kotomi-Du/onnxruntime that referenced this pull request Oct 30, 2025
### Description
Add QDQ scale propagation pass 

Enable dynamic path for NPU when enable_causallm is enabled 

Allow zero-element tensors to get set 
Fix the model copies and redefinitions for CPU fallback 
Added support for 2025.2 and enabled SimplifiedLayerNormalization op 

### Motivation and Context
<!-- - Why is this change required? What problem does it solve?
- If it fixes an open issue, please link to the issue here. -->

---------

Co-authored-by: Javier Martinez <[email protected]>
Co-authored-by: Ryan Metcalfe <[email protected]>
Co-authored-by: Ankit Maheshkar <[email protected]>
Co-authored-by: Preetha Veeramalai <[email protected]>
Co-authored-by: n1harika <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

9 participants