Support of FP8 chunk prefill #17

adityachatter · 2025-10-17T09:28:41Z

Adds functional support of FP8 Chunk Prefill kernel
Supports FP8 E4M3FN and E5M2 datatype. Expects Q, K, V to be in FP8 precision and descale factors for Q, K, V to be in FP32 precision with shape (batch size, number of KV heads)

Run FP8 Chunk Prefill unit tests:

cd sgl-kernel-xpu/tests
python3 -m pytest -v -s test_flash_attention.py -k dtype1

96 passed, 182 skipped, 278 deselected

Signed-off-by: Aditya Chatterjee <[email protected]>

adityachatter and others added 4 commits October 17, 2025 09:26

Support of FP8 chunk prefill

a445e48

Signed-off-by: Aditya Chatterjee <[email protected]>

Merge branch 'main' into achatter/fp8_chunk_prefill

1383328

Merge branch 'main' into achatter/fp8_chunk_prefill

fca5a2a

Rebased restructured code

06ae0d8

Signed-off-by: Aditya Chatterjee <[email protected]>

adityachatter force-pushed the achatter/fp8_chunk_prefill branch from d20fff8 to 06ae0d8 Compare October 27, 2025 07:08

adityachatter added 3 commits October 28, 2025 03:04

initial fix

4efefc1

Signed-off-by: Aditya Chatterjee <[email protected]>

fixed fp8 accuracy

6a54149

Signed-off-by: Aditya Chatterjee <[email protected]>

update test code

49c2212

Signed-off-by: Aditya Chatterjee <[email protected]>

adityachatter marked this pull request as ready for review October 29, 2025 08:49

sunjiweiswift added the run-ci label Oct 31, 2025

adityachatter added 3 commits November 5, 2025 05:13

trigger CI

4cfd040

Fix format

7a123cd

Signed-off-by: Aditya Chatterjee <[email protected]>

fixed typo

7a6c59b

Signed-off-by: Aditya Chatterjee <[email protected]>

deepvars self-requested a review November 6, 2025 04:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support of FP8 chunk prefill #17

Support of FP8 chunk prefill #17

Uh oh!

adityachatter commented Oct 17, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Support of FP8 chunk prefill #17

Are you sure you want to change the base?

Support of FP8 chunk prefill #17

Uh oh!

Conversation

adityachatter commented Oct 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

adityachatter commented Oct 17, 2025 •

edited

Loading