Skip to content

quantize_activation_per_token_absmax use general quant primitives #841

quantize_activation_per_token_absmax use general quant primitives

quantize_activation_per_token_absmax use general quant primitives #841

Annotations

1 warning

test (CUDA 2.4.0.dev20240428, linux.g5.12xlarge.nvidia.gpu, --pre torch==2.4.0.dev20240428+cu121 ...  /  linux-job

succeeded May 3, 2024 in 23m 13s