[Model] Optional FP8 lm_head compression for Llama and Mistral#35696
Open
lucaspirola wants to merge 4 commits intovllm-project:mainfrom
Open
[Model] Optional FP8 lm_head compression for Llama and Mistral#35696lucaspirola wants to merge 4 commits intovllm-project:mainfrom
lucaspirola wants to merge 4 commits intovllm-project:mainfrom