Skip to content

Conversation

pinglin
Copy link
Member

@pinglin pinglin commented Jun 23, 2025

Because

  • LLM runtimes are rapidly evolving, and mainstream tools like Transformers, vLLM, and MLC LLM now support advanced LLM features.

This commit

  • refactored CLI codebase
  • unified Dockerfile for multi-platform build current supporting CPU for both amd64 and arm64 and GPU for only amd66
  • added unit testing for CLI commands

@pinglin pinglin merged commit ebaa93f into main Jun 23, 2025
10 checks passed
@pinglin pinglin deleted the pinglin/ins-8084-python-sdk-add-support-for-mlc-llm branch June 23, 2025 16:00
Copy link

linear bot commented Jun 23, 2025

pinglin added a commit that referenced this pull request Jun 23, 2025
Because

- LLM runtimes are rapidly evolving, and mainstream tools like
`Transformers`, `vLLM`, and `MLC LLM` now support advanced LLM features.

This commit

- refactored CLI codebase
- unified Dockerfile for multi-platform build current supporting CPU for
both `amd64` and `arm64` and GPU for only `amd66`
- added unit testing for CLI commands
pinglin added a commit that referenced this pull request Jun 23, 2025
Because

- LLM runtimes are rapidly evolving, and mainstream tools like
`Transformers`, `vLLM`, and `MLC LLM` now support advanced LLM features.

This commit

- refactored CLI codebase
- unified Dockerfile for multi-platform build current supporting CPU for
both `amd64` and `arm64` and GPU for only `amd66`
- added unit testing for CLI commands
pinglin added a commit that referenced this pull request Jun 23, 2025
Because

- LLM runtimes are rapidly evolving, and mainstream tools like
`Transformers`, `vLLM`, and `MLC LLM` now support advanced LLM features.

This commit

- refactored CLI codebase
- unified Dockerfile for multi-platform build current supporting CPU for
both `amd64` and `arm64` and GPU for only `amd66`
- added unit testing for CLI commands
pinglin added a commit that referenced this pull request Jun 23, 2025
Because

- LLM runtimes are rapidly evolving, and mainstream tools like
`Transformers`, `vLLM`, and `MLC LLM` now support advanced LLM features.

This commit

- refactored CLI codebase
- unified Dockerfile for multi-platform build current supporting CPU for
both `amd64` and `arm64` and GPU for only `amd66`
- added unit testing for CLI commands
pinglin added a commit that referenced this pull request Jun 23, 2025
Because

- LLM runtimes are rapidly evolving, and mainstream tools like
`Transformers`, `vLLM`, and `MLC LLM` now support advanced LLM features.

This commit

- refactored CLI codebase
- unified Dockerfile for multi-platform build current supporting CPU for
both `amd64` and `arm64` and GPU for only `amd66`
- added unit testing for CLI commands
pinglin added a commit that referenced this pull request Jun 23, 2025
Because

- LLM runtimes are rapidly evolving, and mainstream tools like
`Transformers`, `vLLM`, and `MLC LLM` now support advanced LLM features.

This commit

- refactored CLI codebase
- unified Dockerfile for multi-platform build current supporting CPU for
both `amd64` and `arm64` and GPU for only `amd66`
- added unit testing for CLI commands
pinglin added a commit that referenced this pull request Jun 23, 2025
Because

- LLM runtimes are rapidly evolving, and mainstream tools like
`Transformers`, `vLLM`, and `MLC LLM` now support advanced LLM features.

This commit

- refactored CLI codebase
- unified Dockerfile for multi-platform build current supporting CPU for
both `amd64` and `arm64` and GPU for only `amd66`
- added unit testing for CLI commands
pinglin pushed a commit that referenced this pull request Jun 24, 2025
🤖 I have created a release *beep* *boop*
---


##
[0.18.0](v0.17.2...v0.18.0)
(2025-06-24)


### Features

* **cli,model,llm-runtime:** unify LLM runtimes
([#285](#285))
([9950b8a](9950b8a))


### Miscellaneous

* **deps-dev:** bump setuptools from 74.1.2 to 78.1.1
([#279](#279))
([33fda4f](33fda4f))
* **deps-dev:** bump tornado from 6.4.2 to 6.5.1
([#280](#280))
([0caa2fe](0caa2fe))
* **deps:** bump protobuf from 4.25.3 to 4.25.8
([#286](#286))
([4b122cf](4b122cf))
* **deps:** bump requests from 2.32.3 to 2.32.4
([#282](#282))
([970f693](970f693))
* **domain:** update production domain
([3f0efe0](3f0efe0))
* **mypy:** fix make check errors
([971e640](971e640))
* **pacakge:** upgrade versions
([4e45717](4e45717))
* **poetry:** lock ray version on 2.47.0
([da75851](da75851))
* release v0.18.0
([f267cad](f267cad))
* **release-please:** update config.json
([#281](#281))
([13cbace](13cbace))


### Tests

* **instill:** improve unit test coverage
([#287](#287))
([6f38ea9](6f38ea9))

---
This PR was generated with [Release
Please](https://github.com/googleapis/release-please). See
[documentation](https://github.com/googleapis/release-please#release-please).
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant