feat: add CLI tools by rafvasq · Pull Request #52 · IBM/vllm

rafvasq · 2024-06-21T14:40:19Z

Replaces ✨ add tgis-cli tools #16
Related to
- [Feature] vLLM CLI for serving and querying OpenAI compatible server vllm-project/vllm#5090
- Previously, Add vllm serve to wrap vllm.entrypoints.openai.api_server vllm-project/vllm#4167

Signed-off-by: Rafael Vasquez <rafvasq21@gmail.com> Co-authored-by: Prashant Gupta <prashantgupta@us.ibm.com>

Signed-off-by: Rafael Vasquez <rafvasq21@gmail.com>

prashantgupta24 · 2024-07-17T23:22:17Z

Closing in favor of opendatahub-io/vllm#92

This PR cleans and simplifies the code. ### Changes: - removed right padding since not used - removed dict of `seq_ids` since on `AIU` only **one** `seq_id` **per** `request_id` (no beam search or other multi sequence decoding) - removed for loop over single `seq_id` (always 1 per `request_id`) during decoding - deleting batch padding mask and position ids after decode has finished instead of overwriting it. - merged main into this branch to resolve merge conflicts The code has been in client/server mode for the `llama 194m` and `granite 3b` on `AIU` and `CPU`.

This PR cleans and simplifies the code. ### Changes: - simplified warmup by using a function call to remove duplicated lines - moving mask and position_ids from `SENDNNCasualLM` to `SENDNNModelRunner` - fixing error in pyproject.toml - already merged PR #52 and main into this branch for easier merge. The code has been in client/server mode for the `llama 194m` and `granite 3b` on `AIU` and `CPU`.

Add TGIS CLI integrated with upstream CLI

10d1d22

Signed-off-by: Rafael Vasquez <rafvasq21@gmail.com> Co-authored-by: Prashant Gupta <prashantgupta@us.ibm.com>

rafvasq force-pushed the cli-tools branch from 125c801 to 10d1d22 Compare June 28, 2024 18:34

rafvasq changed the title ~~add CLI Tools~~ feat: add CLI tools Jun 28, 2024

Add tests for hub

0bacb8c

Signed-off-by: Rafael Vasquez <rafvasq21@gmail.com>

prashantgupta24 mentioned this pull request Jul 11, 2024

✨ add tgis-cli tools #16

Closed

Brings in flexargparse and rebases

fe882a8

Signed-off-by: Rafael Vasquez <rafvasq21@gmail.com>

rafvasq marked this pull request as ready for review July 15, 2024 17:14

prashantgupta24 closed this Jul 17, 2024

rafvasq mentioned this pull request Jul 26, 2024

feat: Add model-util CLI opendatahub-io/vllm-tgis-adapter#59

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Comments

feat: add CLI tools#52

feat: add CLI tools#52
rafvasq wants to merge 3 commits intoIBM:mainfrom
rafvasq:cli-tools

rafvasq commented Jun 21, 2024 •

edited

Loading

Uh oh!

prashantgupta24 commented Jul 17, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Comments

Conversation

rafvasq commented Jun 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

prashantgupta24 commented Jul 17, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

rafvasq commented Jun 21, 2024 •

edited

Loading