Skip to content

potential bug in fp4 quantize kernel #2021

@Qiaolin-Yu

Description

@Qiaolin-Yu
Image

I found it extremely slow. In trtllm, it should be 1~2 us. Therefore, I think there's potential bug in this kernel (fp4_quantize)

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions