Skip to content

Conversation

@amarin16
Copy link
Contributor

@amarin16 amarin16 commented Jun 6, 2025

Add Int4 and UInt4 support for Cast.

There were a few QNN pipelines failing for this PR, which are fixed by this onnx PR Update input and output tensors in pb files to match the model. The problem is that onnxruntime, which uses onnx as a submodule in cmake/external/onnx, points to the latest release of onnx (1.18.0), but in order to have the pipeline run with my onnx fix, we would need to point to a newer version.

Since we can't update the onnx submodule to point to a non-release commit, waiting for a new onnx release might take a long time, and creating a patch under onnxruntime/cmake/patches/onnx with the changes in my onnx PR is tricky because that fix changes some binary files, this PR skips the tests which are currently failing in the QNN pipelines.

Copy link
Contributor

@github-actions github-actions bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You can commit the suggested changes from lintrunner.

@jywu-msft jywu-msft requested a review from hariharans29 June 6, 2025 16:55
Copy link
Contributor

@github-actions github-actions bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You can commit the suggested changes from lintrunner.

Copy link
Contributor

@github-actions github-actions bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You can commit the suggested changes from lintrunner.

@amarin16 amarin16 merged commit 51b678f into main Jul 22, 2025
90 of 92 checks passed
@amarin16 amarin16 deleted the dev/emarin/cast_int4 branch July 22, 2025 21:04
sanketkaleoss pushed a commit to sanketkaleoss/onnxruntime that referenced this pull request Aug 11, 2025
Add Int4 and UInt4 support for `Cast`.

There were a few QNN pipelines
[failing](https://aiinfra.visualstudio.com/PublicPackages/_build/results?buildId=841810&view=logs&j=9d976e38-31ec-50dd-b1f8-279fbf889fca&t=85ed5ad3-b72a-52c3-abe0-a87b66004fd0&l=1773)
for this PR, which are fixed by this onnx PR [Update input and output
tensors in pb files to match the
model](onnx/onnx#7074). The problem is that
onnxruntime, which uses onnx as a submodule in `cmake/external/onnx`,
points to the latest release of onnx (1.18.0), but in order to have the
pipeline run with my onnx fix, we would need to point to a newer
version.

Since we can't update the onnx submodule to point to a non-release
commit, waiting for a new onnx release might take a long time, and
creating a patch under
[onnxruntime/cmake/patches/onnx](https://github.com/microsoft/onnxruntime/tree/main/cmake/patches/onnx)
with the changes in my onnx PR is tricky because that fix changes some
binary files, this PR skips the tests which are currently failing in the
QNN pipelines.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants