CoreML: Add support for Pad with 'reflect' for ML Program by skottmckay · Pull Request #28073 · microsoft/onnxruntime

skottmckay · 2026-04-15T04:40:57Z

Description

Add support for 'reflect' if ML Program is enabled. Uses the CoreML implementation directly.

Motivation and Context

#28022

Copilot

Pull request overview

Note

Copilot was unable to run its full agentic suite in this review.

Adds CoreML EP (ML Program) support for Pad with mode="reflect" and introduces regression coverage around CoreML model loading/padding behavior.

Changes:

Enable Pad op builder support for ML Program and implement MIL pad op wiring for constant/reflect.
Tighten/adjust Pad support checks (mode gating, reflect constraints).
Add a new CoreML EP test covering Pad(mode=reflect) with ML Program.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 4 comments.

File	Description
onnxruntime/test/providers/coreml/coreml_basic_test.cc	Adds a new ML Program `Pad(mode=reflect)` regression test alongside existing CoreML tests.
onnxruntime/core/providers/coreml/builders/impl/pad_op_builder.cc	Implements ML Program path for Pad (including `reflect`) and updates support checks/initializer skipping.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

github-actions

You can commit the suggested changes from lintrunner.

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

…ssion test Agent-Logs-Url: https://github.com/microsoft/onnxruntime/sessions/93f6c70a-6288-4790-a265-f95f6f793843 Co-authored-by: skottmckay <979079+skottmckay@users.noreply.github.com>

Agent-Logs-Url: https://github.com/microsoft/onnxruntime/sessions/93f6c70a-6288-4790-a265-f95f6f793843 Co-authored-by: skottmckay <979079+skottmckay@users.noreply.github.com>

skottmckay · 2026-04-15T09:45:48Z

/azp run Windows ARM64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI,Linux QNN CI Pipeline

azure-pipelines · 2026-04-15T09:46:04Z

Azure Pipelines successfully started running 4 pipeline(s).

skottmckay · 2026-04-16T00:58:13Z

/azp run Windows GPU Doc Gen CI Pipeline

azure-pipelines · 2026-04-16T00:58:23Z

Azure Pipelines successfully started running 1 pipeline(s).

skottmckay · 2026-04-17T00:16:03Z

@copilot resolve the merge conflicts in this pull request

# Conflicts: # onnxruntime/test/providers/coreml/coreml_basic_test.cc Co-authored-by: skottmckay <979079+skottmckay@users.noreply.github.com>

Copilot · 2026-04-17T00:20:36Z

@copilot resolve the merge conflicts in this pull request

Resolved in b6e58ca by merging latest origin/main and fixing the conflict in onnxruntime/test/providers/coreml/coreml_basic_test.cc so both the external-data regression test and the Pad ML Program tests are preserved. UI screenshot: N/A (no UI changes).

skottmckay · 2026-04-17T01:04:19Z

/azp run Windows ARM64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline,Linux QNN CI Pipeline

azure-pipelines · 2026-04-17T01:04:36Z

Azure Pipelines successfully started running 3 pipeline(s).

…xruntime into skottmckay/GH28022

…han two dimensions only supports constant mode"

maxwbuckley · 2026-04-22T08:29:24Z

Verified this works for me in my application. Thanks a lot! :)

…uting Bundles CoreML graph rewrites, GPU-accelerated pipeline work, Windows CUDA fixes, and Mac/Windows runtime routing into a single drop. CoreML (Apple Silicon): - Decompose Pad(reflect) → Slice+Concat in inswapper_128 so the model runs in one CoreML partition instead of 14 (TEMPORARY: fixed upstream in microsoft/onnxruntime#28073, drop when ORT >= 1.26.0). - Fold Shape/Gather chains to constants in det_10g (21ms → 4ms). - Decompose Split(axis=1) → Slice pairs in GFPGAN (155ms → 89ms). - Route detection model to GPU so the ANE is free for the swap model. - Centralize provider/config selection in create_onnx_session. Pipeline (all platforms): - Parallelize face landmark + recognition post-detection; skip landmark_2d_106 when only face_swapper is active. - Pipeline face detection with swap for ANE overlap. - GPU-accelerated paste_back, MJPEG capture, zero-copy display path. - Standalone pipeline benchmark script. Windows / CUDA: - CUDA graphs + FP16 model + all-GPU pipeline for 1080p 60 FPS. - Auto-detect GPU provider and fix DLL discovery for Windows CUDA execution. Cross-platform: - platform_info helper for Mac/Windows runtime routing. - GFPGAN 30 fps + MSMF camera 60 fps with adaptive pipeline tuning. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

CoreML: Add support for Pad with 'reflect' for ML Program

7242927

skottmckay requested a review from Copilot April 15, 2026 04:40

Copilot AI reviewed Apr 15, 2026

View reviewed changes

github-actions Bot reviewed Apr 15, 2026

View reviewed changes

Comment thread onnxruntime/core/providers/coreml/builders/impl/pad_op_builder.cc Outdated

Comment thread onnxruntime/core/providers/coreml/builders/impl/pad_op_builder.cc Outdated

Update onnxruntime/test/providers/coreml/coreml_basic_test.cc

d24abeb

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot started work on behalf of skottmckay April 15, 2026 05:00 View session

skottmckay and others added 5 commits April 15, 2026 15:00

Update onnxruntime/core/providers/coreml/builders/impl/pad_op_builder.cc

b48fda8

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Update onnxruntime/core/providers/coreml/builders/impl/pad_op_builder.cc

5ac13af

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

CoreML test: add missing pads format comment for consistency

25596b4

Agent-Logs-Url: https://github.com/microsoft/onnxruntime/sessions/93f6c70a-6288-4790-a265-f95f6f793843 Co-authored-by: skottmckay <979079+skottmckay@users.noreply.github.com>

Copilot finished work on behalf of skottmckay April 15, 2026 05:32

Copilot started reviewing on behalf of skottmckay April 15, 2026 06:15 View session

Copilot started work on behalf of skottmckay April 15, 2026 08:37 View session

Copilot stopped work on behalf of skottmckay due to an error April 15, 2026 09:35
The session was cancelled by the user.

Address PR comment

2429ebc

skottmckay added 2 commits April 16, 2026 09:58

Merge remote-tracking branch 'origin/main' into skottmckay/GH28022

401233b

Remove test from other branch that was incorrectly added.

1f21b60

skottmckay requested a review from edgchen1 April 16, 2026 06:04

Copilot started work on behalf of skottmckay April 17, 2026 00:16 View session

Merge remote-tracking branch 'origin/main' into skottmckay/GH28022

b6e58ca

# Conflicts: # onnxruntime/test/providers/coreml/coreml_basic_test.cc Co-authored-by: skottmckay <979079+skottmckay@users.noreply.github.com>

Copilot finished work on behalf of skottmckay April 17, 2026 00:21

edgchen1 reviewed Apr 18, 2026

View reviewed changes

skottmckay added 5 commits April 20, 2026 11:18

Address PR comments

81cf802

Merge branch 'skottmckay/GH28022' of https://github.com/microsoft/onn…

a5db5d1

…xruntime into skottmckay/GH28022

Add 1D support

17081dc

Update test to run model

f9c8159

Add back restriction based on CoreML error output "Padding for more t…

60dc4bd

…han two dimensions only supports constant mode"

edgchen1 approved these changes Apr 20, 2026

View reviewed changes

skottmckay merged commit fb13eb3 into main Apr 20, 2026
89 checks passed

skottmckay deleted the skottmckay/GH28022 branch April 20, 2026 23:25

maxwbuckley mentioned this pull request Apr 22, 2026

CoreML EP: Pad(mode=reflect) falls back to CPU, causing 14 partition round-trips on Apple Silicon #28022

Closed

maxwbuckley mentioned this pull request Apr 22, 2026

Apple Silicon + Windows CUDA perf: 4-5x FPS, wider capture, platform routing hacksider/Deep-Live-Cam#1775

Merged

9 tasks

BrewTestBot mentioned this pull request May 8, 2026

onnxruntime 1.26.0 Homebrew/homebrew-core#281672

Merged

Conversation

skottmckay commented Apr 15, 2026

Description

Motivation and Context

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

skottmckay commented Apr 15, 2026

Uh oh!

azure-pipelines Bot commented Apr 15, 2026

Uh oh!

skottmckay commented Apr 16, 2026

Uh oh!

azure-pipelines Bot commented Apr 16, 2026

Uh oh!

skottmckay commented Apr 17, 2026

Uh oh!

Copilot AI commented Apr 17, 2026

Uh oh!

skottmckay commented Apr 17, 2026

Uh oh!

azure-pipelines Bot commented Apr 17, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

maxwbuckley commented Apr 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants