What is the purpose of weight quantization and activation quantization? #3176

CoinCheung · 2023-04-10T14:29:39Z

CoinCheung
Apr 10, 2023

Hi,

I did not find enough specification of weight/activation quantization. I can thought of two usage of this: the first one is to speed up training with 8 bit or 4 bit computation, in which process we need to quantize the weights or activation. The second usage is the so-called quantized-aware-training(QAT), in which we want the model to be adapted to the precision of quantization during training process. Would you tell me which one is the purpose of weight/activation quantization in deepspeed?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What is the purpose of weight quantization and activation quantization? #3176

{{title}}

Replies: 0 comments

Select a reply

What is the purpose of weight quantization and activation quantization? #3176

CoinCheung Apr 10, 2023

Replies: 0 comments

CoinCheung
Apr 10, 2023