[SmoothQuant] Remove unused functions #266

ibsidorenko · 2024-04-30T13:54:24Z

This is refactoring to follow new rule: "keep mlc-llm and tvm as clean as possible".
This commit removes several helper functions that were moved to slm/interface part.

This is refactoring to follow new rule: "keep mlc-llm and tvm as clean as possible". This commit removes several helper functions that were moved to slm/interface part.

ponytaill · 2024-05-25T02:52:55Z

Hi, I tested it and I want to ask if the Smoothquant quantization really works? I run mlc_llm convert_weight ./dist/llama2 --quantization smq_q8i8f16_0 -o dist/$llama2-smq-MLC/ and get unquantized model. Weird.

ibsidorenko · 2024-05-27T08:20:20Z

Hi, I tested it and I want to ask if the Smoothquant quantization really works? I run mlc_llm convert_weight ./dist/llama2 --quantization smq_q8i8f16_0 -o dist/$llama2-smq-MLC/ and get unquantized model. Weird.

Hi @ponytaill ! It works, but not through the mlc-llm.

ponytaill · 2024-05-28T06:06:21Z

Hi, I tested it and I want to ask if the Smoothquant quantization really works? I run mlc_llm convert_weight ./dist/llama2 --quantization smq_q8i8f16_0 -o dist/$llama2-smq-MLC/ and get unquantized model. Weird.

Hi @ponytaill ! It works, but not through the mlc-llm.
Hi, thanks for your reply. Could you explain a little bit about how to use the code about Smoothquant without going through mlc-llm? I just started learning mlc-llm.

ibsidorenko · 2024-05-28T12:47:13Z

Hi, I tested it and I want to ask if the Smoothquant quantization really works? I run mlc_llm convert_weight ./dist/llama2 --quantization smq_q8i8f16_0 -o dist/$llama2-smq-MLC/ and get unquantized model. Weird.

Hi @ponytaill ! It works, but not through the mlc-llm.
Hi, thanks for your reply. Could you explain a little bit about how to use the code about Smoothquant without going through mlc-llm? I just started learning mlc-llm.

You need access to the private repo. If you don't have access, there's no way to run SmoothQuant in this case

This PR refactors the mlc-chat into a formal package Still need some followup TODOs on cleaning up the rest and gradio API.

[SmoothQuant] Remove unused functions

b53b8ca

This is refactoring to follow new rule: "keep mlc-llm and tvm as clean as possible". This commit removes several helper functions that were moved to slm/interface part.

ibsidorenko requested a review from sunggg April 30, 2024 13:54

sunggg merged commit 0739fab into mlc-serve-v0.2.0 Jun 14, 2024

ibsidorenko deleted the ibsidorenko/smq-refactoring branch June 14, 2024 05:57

Lunderberg pushed a commit to Lunderberg/mlc-llm that referenced this pull request Jul 25, 2024

Refactor mlc_chat into a formal package (octoml#266)

096c8a5

This PR refactors the mlc-chat into a formal package Still need some followup TODOs on cleaning up the rest and gradio API.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SmoothQuant] Remove unused functions #266

[SmoothQuant] Remove unused functions #266

Uh oh!

ibsidorenko commented Apr 30, 2024

Uh oh!

ponytaill commented May 25, 2024

Uh oh!

ibsidorenko commented May 27, 2024

Uh oh!

ponytaill commented May 28, 2024

Uh oh!

ibsidorenko commented May 28, 2024 •

edited by sunggg

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[SmoothQuant] Remove unused functions #266

[SmoothQuant] Remove unused functions #266

Uh oh!

Conversation

ibsidorenko commented Apr 30, 2024

Uh oh!

ponytaill commented May 25, 2024

Uh oh!

ibsidorenko commented May 27, 2024

Uh oh!

ponytaill commented May 28, 2024

Uh oh!

ibsidorenko commented May 28, 2024 • edited by sunggg Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ibsidorenko commented May 28, 2024 •

edited by sunggg

Loading