Skip to content

Refactor system architecture#82

Merged
WoosukKwon merged 27 commits intomainfrom
refactor-arch
May 9, 2023
Merged

Refactor system architecture#82
WoosukKwon merged 27 commits intomainfrom
refactor-arch

Conversation

@WoosukKwon
Copy link
Copy Markdown
Collaborator

@WoosukKwon WoosukKwon commented May 7, 2023

This PR includes extensive refactoring of the system.

Major changes are:

  • Moved parallel_utils into model_executor
  • Moved simple_frontend to frontend
  • Moved gradio_webserver and test_cli_client to the root
  • Removed plot

@WoosukKwon WoosukKwon requested a review from zhuohan123 May 7, 2023 23:46
This was referenced May 8, 2023
@WoosukKwon WoosukKwon merged commit 7c041ab into main May 9, 2023
@WoosukKwon WoosukKwon deleted the refactor-arch branch May 9, 2023 22:30
hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024
yukavio pushed a commit to yukavio/vllm that referenced this pull request Jul 3, 2024
Fix repo link in setup.py
dllehr-amd pushed a commit to dllehr-amd/vllm that referenced this pull request Jul 22, 2024
…eation_before_each_gemm

Charlifu/avoid tensor creation before each gemm
iwooook pushed a commit to moreh-dev/vllm that referenced this pull request Nov 29, 2025
…ect#82)

* Merge tt params into override_tt_config for offline_inference_tt
* Add fabric_config to override_tt_config
* Use override_tt_config for device_params
* Add override_tt_config and TT_LLAMA_TEXT_VER to readme

(cherry picked from commit 2a8debd)
tjtanaa pushed a commit to tjtanaa/vllm that referenced this pull request Jan 29, 2026
Signed-off-by: zjy0516 <riverclouds.zhu@qq.com>
tianshu-Michael-yu added a commit to tianshu-Michael-yu/vllm that referenced this pull request Feb 13, 2026
<!-- markdownlint-disable -->
PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS (AT
THE BOTTOM) HAVE BEEN CONSIDERED.

## Purpose

## Test Plan

## Test Result

---
<details>
<summary> Essential Elements of an Effective PR Description Checklist
</summary>

- [ ] The purpose of the PR, such as "Fix some issue (link existing
issues this PR will resolve)".
- [ ] The test plan, such as providing test command.
- [ ] The test results, such as pasting the results comparison before
and after, or e2e results
- [ ] (Optional) The necessary documentation update, such as updating
`supported_models.md` and `examples` for a new model.
- [ ] (Optional) Release notes update. If your change is user facing,
please update the release notes draft in the [Google
Doc](https://docs.google.com/document/d/1YyVqrgX4gHTtrstbq8oWUImOyPCKSGnJ7xtTpmXzlRs/edit?tab=t.0).
</details>

**BEFORE SUBMITTING, PLEASE READ
<https://docs.vllm.ai/en/latest/contributing>** (anything written below
this line will be removed by GitHub Actions)

---------

Signed-off-by: Tianshu Yu <tianshuyu.formal@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant