TTIR view_layout op #587

nsmithtt · 2024-09-03T01:24:30Z

Add a new op to TTIR called view_layout which effectively casts an existing layout to another one.

Note use copy tile op for testing:

[Metal Direct] Copy identity op #1893

The text was updated successfully, but these errors were encountered:

ViewLayout operation, nearly identical to ToLayout operation, but with the difference that this op is not eagerly evaluated. It is also not capable of changing the data type / tile type of the tensor. It's primary usecase is to allow generic ops to take a view of some remote memory as a means of forming a stream. This is useful for streaming data in small chunks, when a tensor is too large to wholly fit in L1 memory space. All ViewLayout ops can trivially be converted to ToLayout ops. Closes #587

This change adds 2 new TTIR layout related ops and makes a few refactors to better share common interface and verifier code between them. The verifiers are also significantly improved and check for many more illegal cases. ## StreamLayout Operation StreamLayout operation, similar to the ToLayout operation, but with the difference that this op is not eagerly evaluated and is instead used as a means for defining a stream. The primary usecases include, to enable streaming a large tensor out of dram via a small L1 buffer and also as a means for forming reduce or gather multicast operations. A stream definition includes: - The tensor to be streamed. - The storage buffer to be used for streaming. - Backing memory for a list of DMA transactions to be filled in by the backend. - A result, which is also able to take a view over the input, i.e. same semantics as the ViewLayout op. Additional constraints: - It is not capable of changing the data type nor the memory space of the tensor. ```llvm %alloc = memref.alloc() {alignment = 64 : i64} : memref<2x4x4x6x!tt.tile<32x32, f32>, #l1_> %alloc_0 = memref.alloc() {alignment = 64 : i64} : memref<2x4x1x1x!tt.tile<32x32, f32>, #l1_> %stream = "ttir.stream_layout"(%arg0, %alloc_0) : (memref<2x4x4x6x!tt.tile<32x32, f32>, #l1_>, memref<2x4x1x1x!tt.tile<32x32, f32>, #l1_>) -> memref<2x4x4x6x!tt.tile<32x32, f32>, #tt.stream<(d0, d1, d2, d3) ``` ## ViewLayout Operation ViewLayout operation, nearly identical to ToLayout operation, but with the difference that this op is not eagerly evaluated. Its primary usecase is to allow reinterpreting the layout of a tensor without actually moving the data. Additional notes/constraints: - It is not capable of changing the data type nor the memory space of the tensor. - All ViewLayout ops can trivially be converted to ToLayout ops. ```llvm #layout = #tt.metal_layout<8192x128x1, undef, <1x1>, memref<64x128xf32, #system>> #layout1 = #tt.metal_layout<8192x128x1, undef, <1x1>, memref<64x128xf32, #l1_>> %1 = "ttir.view_layout"(%arg0, %0) : (tensor<64x128xf32, #layout>, tensor<64x128xf32, #layout1>) -> tensor<64x128xf32, #layout1> ``` Closes #587

nsmithtt added this to the [Metal Direct 0] milestone Sep 3, 2024

nsmithtt self-assigned this Sep 3, 2024

nsmithtt modified the milestones: [Metal Direct 0], [D2M 1] Jan 20, 2025

nsmithtt added the d2m layout label Jan 20, 2025

nsmithtt changed the title ~~TTIR reinterpret_layout op~~ TTIR view_layout op Feb 22, 2025

nsmithtt linked a pull request Mar 2, 2025 that will close this issue

Add View/StreamLayout Operation #2342

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TTIR view_layout op #587

TTIR view_layout op #587

nsmithtt commented Sep 3, 2024 •

edited

Loading

TTIR view_layout op #587

TTIR view_layout op #587

Comments

nsmithtt commented Sep 3, 2024 • edited Loading

nsmithtt commented Sep 3, 2024 •

edited

Loading