Tilize with val padding results in L1 cache OOM #16633

nardoTT · 2025-01-10T22:10:46Z

Ticket

Link to Github Issue #15950

Problem description

In the current implementation of multi-core tilize with val padding, the parallelization is only over the columns, meaning the whole row is being passed to the same core. This causes L1 OOM when the tensor has a large width

What's changed

In multi-core tilize, we are calculating the maximum available L1 and the estimated cb size. We are running the single core implementation if there isn't enough space. The multi-core implementation will be improved in the future to cover row and column parallelization

Checklist

Post commit CI passes https://github.com/tenstorrent/tt-metal/actions/runs/12698362359
Blackhole Post commit (if applicable)
Model regression CI testing passes (if applicable)
Device performance regression CI testing passes (if applicable)
(For models and ops writers) Full new models tests passes
New/Existing tests provide coverage for changes

nardoTT added 3 commits January 8, 2025 21:27

run sc for cb overflow

8e43393

fix rt args

0580bed

revert rt and fix cb size

db830f7

nardoTT requested review from ayerofieiev-tt, dmakoviichuk-tt, rfurko-tt, cfjchu, TT-BrianLiu, razorback3, dongjin-na, bbradelTT, ntarafdar, sjameelTT, jaykru-tt, yugi957, jvegaTT and llongTT as code owners January 10, 2025 22:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tilize with val padding results in L1 cache OOM #16633

Tilize with val padding results in L1 cache OOM #16633

nardoTT commented Jan 10, 2025 •

edited

Loading

Tilize with val padding results in L1 cache OOM #16633

Are you sure you want to change the base?

Tilize with val padding results in L1 cache OOM #16633

Conversation

nardoTT commented Jan 10, 2025 • edited Loading

Ticket

Problem description

What's changed

Checklist

nardoTT commented Jan 10, 2025 •

edited

Loading