Enforce static dimensions in generation of flow.tensor.transfer #205

sogartar · 2024-10-08T15:55:27Z

This solves the problem in iree-org/iree#18283
The issue is that we generate cast to/from dynamic tensors that later lowering in IREE chocks on it. My assumption is that it should be able to digest this IR since it is of the form.

    %2 = torch_c.to_builtin_tensor %arg0 : !torch.vtensor<[2,3,11,13],f32> -> tensor<2x3x11x13xf32>
    %cast = tensor.cast %2 : tensor<2x3x11x13xf32> to tensor<?x?x?x?xf32>
    %c0 = arith.constant 0 : index
    %dim = tensor.dim %cast, %c0 : tensor<?x?x?x?xf32>
    %c1 = arith.constant 1 : index
    %dim_0 = tensor.dim %cast, %c1 : tensor<?x?x?x?xf32>
    %c2 = arith.constant 2 : index
    %dim_1 = tensor.dim %cast, %c2 : tensor<?x?x?x?xf32>
    %c3 = arith.constant 3 : index
    %dim_2 = tensor.dim %cast, %c3 : tensor<?x?x?x?xf32>
    %3 = flow.tensor.transfer %cast : tensor<?x?x?x?xf32>{%dim, %dim_0, %dim_1, %dim_2} to #hal.device.promise<@__device_0>
    %cast_3 = tensor.cast %3 : tensor<?x?x?x?xf32> to tensor<2x3x11x13xf32>
    %4 = torch_c.from_builtin_tensor %cast_3 : tensor<2x3x11x13xf32> -> !torch.vtensor<[2,3,11,13],f32>

It essentially cast to a dynamic tensor<...> for the purpose of performing flow.tensor.transfer and then cast back to a static torch.vtensor. So it should be fine.

With this change we get

    %2 = torch_c.to_builtin_tensor %arg0 : !torch.vtensor<[2,3,11,13],f32> -> tensor<2x3x11x13xf32>
    %3 = flow.tensor.transfer %2 : tensor<2x3x11x13xf32> to #hal.device.promise<@__device_0>
    %4 = torch_c.from_builtin_tensor %3 : tensor<2x3x11x13xf32> -> !torch.vtensor<[2,3,11,13],f32>

I am not convinced that specializing all dimensions is correct. What should we do if we want some dynamic dimensions? How should this be represented?

Signed-off-by: Boian Petkantchin <[email protected]>

sogartar · 2024-10-08T22:09:55Z

The CI seems to be failing with unrelated errors.

The fix iree-org/iree-turbine#205 solves the issue with this test. Xfail the Unet Resnet block test with maybe low accuracy.

This solves the problem in iree-org/iree#18283 The issue is that we generate cast to/from dynamic tensors that later lowering in IREE chokes on it. My assumption is that it should be able to digest this IR since it is of the form. ```mlir %2 = torch_c.to_builtin_tensor %arg0 : !torch.vtensor<[2,3,11,13],f32> -> tensor<2x3x11x13xf32> %cast = tensor.cast %2 : tensor<2x3x11x13xf32> to tensor<?x?x?x?xf32> %c0 = arith.constant 0 : index %dim = tensor.dim %cast, %c0 : tensor<?x?x?x?xf32> %c1 = arith.constant 1 : index %dim_0 = tensor.dim %cast, %c1 : tensor<?x?x?x?xf32> %c2 = arith.constant 2 : index %dim_1 = tensor.dim %cast, %c2 : tensor<?x?x?x?xf32> %c3 = arith.constant 3 : index %dim_2 = tensor.dim %cast, %c3 : tensor<?x?x?x?xf32> %3 = flow.tensor.transfer %cast : tensor<?x?x?x?xf32>{%dim, %dim_0, %dim_1, %dim_2} to #hal.device.promise<@__device_0> %cast_3 = tensor.cast %3 : tensor<?x?x?x?xf32> to tensor<2x3x11x13xf32> %4 = torch_c.from_builtin_tensor %cast_3 : tensor<2x3x11x13xf32> -> !torch.vtensor<[2,3,11,13],f32> ``` It essentially casts to a dynamic `tensor<...>` for the purpose of performing `flow.tensor.transfer` and then casts back to a static `torch.vtensor`. So it should be fine. With this change we get ```mlir %2 = torch_c.to_builtin_tensor %arg0 : !torch.vtensor<[2,3,11,13],f32> -> tensor<2x3x11x13xf32> %3 = flow.tensor.transfer %2 : tensor<2x3x11x13xf32> to #hal.device.promise<@__device_0> %4 = torch_c.from_builtin_tensor %3 : tensor<2x3x11x13xf32> -> !torch.vtensor<[2,3,11,13],f32> ``` Signed-off-by: Boian Petkantchin <[email protected]>

sogartar mentioned this pull request Oct 8, 2024

Multi-device 2D convolution numerical imprecision iree-org/iree#18283

Open

sogartar requested a review from rsuderman October 8, 2024 21:59

Enforce static dimensions in generation of flow.tensor.transfer

657ec91

Signed-off-by: Boian Petkantchin <[email protected]>

sogartar force-pushed the flow-tensor-transfer-enforce-static-dims branch from d23a84e to 657ec91 Compare October 8, 2024 22:05

sogartar changed the title ~~WIP Enforce static dimensions in generation of flow.tensor.transfer~~ Enforce static dimensions in generation of flow.tensor.transfer Oct 8, 2024

sogartar marked this pull request as ready for review October 8, 2024 22:08

rsuderman approved these changes Oct 8, 2024

View reviewed changes

sogartar merged commit 586b9af into iree-org:main Oct 9, 2024
6 of 8 checks passed

sogartar mentioned this pull request Oct 9, 2024

Numerical inaccuracies in multi-device sharded toy-sized Llama iree-org/iree#18687

Closed

sogartar added a commit to sogartar/sharktank that referenced this pull request Oct 9, 2024

Enable check for sharded Conv2D test

7cace97

The fix iree-org/iree-turbine#205 solves the issue with this test. Xfail the Unet Resnet block test with maybe low accuracy.

sogartar mentioned this pull request Oct 9, 2024

Enable check for sharded Conv2D test nod-ai/shark-ai#263

Merged

sogartar added a commit to nod-ai/shark-ai that referenced this pull request Oct 9, 2024

Enable check for sharded Conv2D test (#263)

b55065a

The fix iree-org/iree-turbine#205 solves the issue with this test. Xfail the Unet Resnet block test with maybe low accuracy.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enforce static dimensions in generation of flow.tensor.transfer #205

Enforce static dimensions in generation of flow.tensor.transfer #205

sogartar commented Oct 8, 2024 •

edited

Loading

sogartar commented Oct 8, 2024

Enforce static dimensions in generation of flow.tensor.transfer #205

Enforce static dimensions in generation of flow.tensor.transfer #205

Conversation

sogartar commented Oct 8, 2024 • edited Loading

sogartar commented Oct 8, 2024

sogartar commented Oct 8, 2024 •

edited

Loading