INT4-AWQ PPL results for LLaMA-3 model are not as expected #257

lisuying214 · 2025-01-23T06:21:00Z

I keep following the new code. I have tried to evaluate both the real quantized model and pseudo quantization using Llama-3-8B-Instruct model (w4-g128), based on wikitext-2-raw-v1(test). But the PPL result is 8.5. (GPU:A100, test data is downloaded from https://huggingface.co/datasets/Salesforce/wikitext/tree/main/wikitext-2-raw-v1)
I want to confirm if my result is correct? And why are the results different?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

INT4-AWQ PPL results for LLaMA-3 model are not as expected #257

INT4-AWQ PPL results for LLaMA-3 model are not as expected #257

lisuying214 commented Jan 23, 2025 •

edited

Loading

INT4-AWQ PPL results for LLaMA-3 model are not as expected #257

INT4-AWQ PPL results for LLaMA-3 model are not as expected #257

Comments

lisuying214 commented Jan 23, 2025 • edited Loading

lisuying214 commented Jan 23, 2025 •

edited

Loading