Add FP8 quantization ignored_layers support in llama#6592
Closed
cli99 wants to merge 2 commits intovllm-project:mainfrom
Closed
Add FP8 quantization ignored_layers support in llama#6592cli99 wants to merge 2 commits intovllm-project:mainfrom
ignored_layers support in llama#6592cli99 wants to merge 2 commits intovllm-project:mainfrom
Commits
Commits on Jul 19, 2024
- committed
- committed