onnxruntime 1.25.0#278543
Merged
BrewTestBot merged 4 commits intomainfrom Apr 21, 2026
Merged
Conversation
iMichka
approved these changes
Apr 20, 2026
onnxruntime: add compatibility_version Signed-off-by: Rui Chen <rui@chenrui.dev>
Signed-off-by: Rui Chen <rui@chenrui.dev>
2b6fe22 to
61f935f
Compare
chenrui333
approved these changes
Apr 21, 2026
Contributor
|
Caution Please do not push to this PR branch before the bottle commits have been pushed, as this results in a state that is difficult to recover from. If you need to resolve a merge conflict, please use a merge commit. Do not force-push to this PR branch. |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Created by
brew bumpCreated with
brew bump-formula-pr.Details
release notes
Execution Provider Changes
--use_armnnbuild flags and migrate to the MLAS/KleidiAI-backed CPU EP or QNN EP for Qualcomm hardware. (#27447)API Version
🔒 Security Fixes
onnx::TensorProtodata size before allocation (#27547)✨ New Features
🔌 Execution Provider Plugin API & CUDA Plugin EP
ORT 1.25.0 introduces the CUDA Plugin EP — the first core implementation that enables third-party CUDA-backed EPs to be delivered as dynamically loaded plugins without rebuilding ORT.
OpKernelInfo::GetConfigOptions,LoggingManager::HasDefaultLogger()(#26879, #26919, #27540, #27541, #27587)🔧 Core APIs
enable_profilingin RunOptions (#26846)OrtModelinput support for Compile API (#27332)📊 New ONNX Ops & Opset Coverage
🖥️ Execution Provider Updates
NVIDIA CUDA EP
router_weightsinput to QMoE (#27687)NVIDIA TensorRT RTX EP
Qualcomm QNN EP
Other EPs
🌐 Web & JavaScript
WebGPU EP — Performance
WebGPU EP — New Op Support
WebGPU EP — Stability
WebNN EP
Node.js & React Native
🧠 CPU & Core Optimizations
MLAS / KleidiAI / Quantization
DQ→MatMulNBits Fusion
Extended to cover significantly more quantized LLM inference scenarios on CPU:
Model Optimizer & Fusions
🔌 Language Bindings
Python
OrtDeviceVendorIdenum for vendor-awareOrtDevicealiases (#27594)GetCompatibilityInfoFromModel/GetCompatibilityInfoFromModelBytes(#27565)OrtValue.from_dlpackrejecting zero-size tensors as non-contiguous (#27451)C#
GetCompatibilityInfoFromModel/GetCompatibilityInfoFromModelBytes(#27565)Java
onnxruntime.native.path(#27668)🐛 Bug Fixes
Critical Fixes
count_include_padcomputation — silent correctness issue (#27375)Operator & Graph Fixes
RemoveDuplicateCastTransformer(#27363)EP-Specific Fixes
--build_wasm_static_libimplicitly enabling--build_wasm(#27342)🙏 Contributors
Thanks to our 72 contributors for this release!
@tianleiwu, @fs-eire, @edgchen1, @titaiwangms, @hariharans29, @eserscor, @Rishi-Dave, @guschmue, @adrianlizarraga, @jambayk, @qjia7, @skottmckay, @adrastogi, @sanaa-hamel-microsoft, @yuslepukhin, @ingyukoh, @Jiawei-Shao, @vraspar, @xhcao, @chilo-ms, @Honry, @JonathanC-ARM, @kunal-vaishnavi, @ShirasawaSama, @chaya2350, @derdeljan-msft, @gedoensmax, @HectorSVC, @milpuz01, @quic-calvnguy, @xenova, @akholodnamdcom, @AlekseiNikiforovIBM, @amd-genmingz, @ashrit-ms, @bachelor-dou, @BODAPATIMAHESH, @Colm-in-Arm, @daijh, @dodokw, @fanchenkong1, @ivarusic-amd, @JanSellner, @jchen10, @jiafatom, @jnagi-intel, @johannes-rehm-snkeos, @justinchuby, @keshavv27, @Kevin-Taha, @kevinlam92, @kpkbandi, @Laan33, @melkap01-Arm, @miaobin, @n-v-k, @nico-martin, @patryk-kaiser-ARM, @praneshgo, @prathikr, @qc-tbhardwa, @sagarbhure-msft, @sdotpeng, @the0cp, @TsofnatMaman, @umangb-09, @walidbr, @wenqinI, @xadupre, @xhan65, @xiaofeihan1
Full Changelog: v1.24.4...v1.25.0
View the full release notes at https://github.com/microsoft/onnxruntime/releases/tag/v1.25.0.