Round 2 of cherry-picks into rel-1.21.0 #23899

amarin16 · 2025-03-05T14:18:39Z

The second round of cherry-picks into rel-1.21.0. The first one was done in #23846.

…vior (#23856)

### Description Resolve #23817 ### Motivation and Context

### Description #### Background From code search, the following EPs use `onnxruntime::GetCpuPreferredNodes()` in their `GetCapabilities()` methods: - CANN - CUDA - DML - JS - ROCM - WebGPU However, the source file that implements `onnxruntime::GetCpuPreferredNodes()` is excluded when minimal build is ON: https://github.com/microsoft/onnxruntime/blob/6df0973e58ba5399fcaa98686f70ed9a9e59aaef/cmake/onnxruntime_framework.cmake#L38-L42 This means that all EPs mentioned above is not able to compile with minimal build. #### Solution The excluded file `core/framework/fallback_cpu_capability.cc` cannot build in minimal build because some of its dependencies are not included in the minimal build. However, in extended minimal build mode, all dependencies are available. This PR looses the restrict and allows to compile this file when it is extended minimal build. After this change, those EPs are able to compile in extended minimal build.

### Description Add `dawn` to ThirdPartyNotices.

…#23892) ### Description When using the enable_htp_shared_memory feature, we see that the address of the buffer passed to rpcmem_free is incorrect. So the rpc buffers are not freed leading to memory exhaustion. ### Motivation and Context When using the enable_htp_shared_memory_allocator feature for QNN in GenAI extensions, it leads to inference failures during the second prompt. As GenAI memory asks are higher, it surfaces sooner in gen AI use cases. Co-authored-by: Ashish Garg <[email protected]>

jambayk

Thanks for adding the changes to the release!

jambayk and others added 6 commits March 5, 2025 06:12

Quant tool: Add nodes_to_exclude in get_qnn_qdq_config (#23779)

a6840dc

Quant tool: Consistent get_qdq_config and get_qnn_qdq_config beha…

51cecbb

…vior (#23856)

[js/common] allows using Uint16Array as data for float16 tensor (#23827)

4130ca1

### Description Resolve #23817 ### Motivation and Context

Add dawn to ThirdPartyNotices (#23876)

792e096

### Description Add `dawn` to ThirdPartyNotices.

amarin16 requested review from HectorSVC, fs-eire and jambayk March 5, 2025 16:00

jambayk approved these changes Mar 5, 2025

View reviewed changes

HectorSVC approved these changes Mar 5, 2025

View reviewed changes

fs-eire approved these changes Mar 5, 2025

View reviewed changes

amarin16 merged commit e0b66ca into rel-1.21.0 Mar 6, 2025
111 of 113 checks passed

amarin16 deleted the emarin/rel1.21/cherry_picks_round2 branch March 6, 2025 00:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Round 2 of cherry-picks into rel-1.21.0 #23899

Round 2 of cherry-picks into rel-1.21.0 #23899

amarin16 commented Mar 5, 2025

jambayk left a comment

Round 2 of cherry-picks into rel-1.21.0 #23899

Round 2 of cherry-picks into rel-1.21.0 #23899

Conversation

amarin16 commented Mar 5, 2025

jambayk left a comment

Choose a reason for hiding this comment