Skip to content

quantize_activation_per_token_absmax use general quant primitives #841

quantize_activation_per_token_absmax use general quant primitives

quantize_activation_per_token_absmax use general quant primitives #841

Annotations

1 warning

test (CUDA 2.3, linux.g5.12xlarge.nvidia.gpu, torch==2.3.0, cuda, 12.1)  /  linux-job

succeeded May 3, 2024 in 19m 28s