feat(cli,model,llm-runtime): unify LLM runtimes #285

pinglin · 2025-06-23T15:58:26Z

Because

LLM runtimes are rapidly evolving, and mainstream tools like Transformers, vLLM, and MLC LLM now support advanced LLM features.

This commit

refactored CLI codebase
unified Dockerfile for multi-platform build current supporting CPU for both amd64 and arm64 and GPU for only amd66
added unit testing for CLI commands

linear · 2025-06-23T16:25:00Z

INS-8084 [python-sdk] Add support for mlc-llm

Because - LLM runtimes are rapidly evolving, and mainstream tools like `Transformers`, `vLLM`, and `MLC LLM` now support advanced LLM features. This commit - refactored CLI codebase - unified Dockerfile for multi-platform build current supporting CPU for both `amd64` and `arm64` and GPU for only `amd66` - added unit testing for CLI commands

🤖 I have created a release *beep* *boop* --- ## [0.18.0](v0.17.2...v0.18.0) (2025-06-24) ### Features * **cli,model,llm-runtime:** unify LLM runtimes ([#285](#285)) ([9950b8a](9950b8a)) ### Miscellaneous * **deps-dev:** bump setuptools from 74.1.2 to 78.1.1 ([#279](#279)) ([33fda4f](33fda4f)) * **deps-dev:** bump tornado from 6.4.2 to 6.5.1 ([#280](#280)) ([0caa2fe](0caa2fe)) * **deps:** bump protobuf from 4.25.3 to 4.25.8 ([#286](#286)) ([4b122cf](4b122cf)) * **deps:** bump requests from 2.32.3 to 2.32.4 ([#282](#282)) ([970f693](970f693)) * **domain:** update production domain ([3f0efe0](3f0efe0)) * **mypy:** fix make check errors ([971e640](971e640)) * **pacakge:** upgrade versions ([4e45717](4e45717)) * **poetry:** lock ray version on 2.47.0 ([da75851](da75851)) * release v0.18.0 ([f267cad](f267cad)) * **release-please:** update config.json ([#281](#281)) ([13cbace](13cbace)) ### Tests * **instill:** improve unit test coverage ([#287](#287)) ([6f38ea9](6f38ea9)) --- This PR was generated with [Release Please](https://github.com/googleapis/release-please). See [documentation](https://github.com/googleapis/release-please#release-please).

pinglin added 4 commits June 17, 2025 17:30

feat(cli): add mlc-llm support

1e2bd89

feat(build): add many LLM runtimes

d7c01ca

feat(model,dockerfile,llm-runtime): unify llm runtimes

4ee20b6

ci(python): build test covering 3.8 to 3.12

6193db2

pinglin requested a review from GeorgeWilliamStrong as a code owner June 23, 2025 15:58

pinglin merged commit ebaa93f into main Jun 23, 2025
10 checks passed

pinglin deleted the pinglin/ins-8084-python-sdk-add-support-for-mlc-llm branch June 23, 2025 16:00

droplet-bot mentioned this pull request Jun 23, 2025

chore(main): release 0.18.0 #283

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(cli,model,llm-runtime): unify LLM runtimes #285

feat(cli,model,llm-runtime): unify LLM runtimes #285

Uh oh!

pinglin commented Jun 23, 2025 •

edited

Loading

Uh oh!

Uh oh!

linear bot commented Jun 23, 2025

Uh oh!

Uh oh!

feat(cli,model,llm-runtime): unify LLM runtimes #285

feat(cli,model,llm-runtime): unify LLM runtimes #285

Uh oh!

Conversation

pinglin commented Jun 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

linear bot commented Jun 23, 2025

Uh oh!

Uh oh!

pinglin commented Jun 23, 2025 •

edited

Loading