Skip to content

quantize_activation_per_token_absmax use general quant primitives #841

quantize_activation_per_token_absmax use general quant primitives

quantize_activation_per_token_absmax use general quant primitives #841

Annotations

1 warning

test (CUDA 2.2.2, linux.g5.12xlarge.nvidia.gpu, torch==2.2.2, cuda, 12.1)  /  linux-job

succeeded May 3, 2024 in 13m 33s