-
Notifications
You must be signed in to change notification settings - Fork 836
[NPU] Update Dockerfile and docs for v0.14.0 #671
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
14 commits
Select commit
Hold shift + click to select a range
4fd4695
[Docs][NPU] Add Qwen3-Omni into supported list
gcanlin 8fc3ac1
add docs for vae
gcanlin 338452a
Update Dockerfile.npu
gcanlin 34c7de3
Add a3 Dockerfile
gcanlin 523955f
update docs
gcanlin 39fd3e3
Merge branch 'main' into docs-qwen3-omni
gcanlin f67a22c
update to v0.14.0
gcanlin 9f73683
update
gcanlin 0946689
Merge branch 'main' into docs-qwen3-omni
gcanlin 75e5c6b
update
gcanlin f80952a
update
gcanlin dfcae69
update
gcanlin 1d376e0
fix
gcanlin 78497b9
Merge branch 'main' into docs-qwen3-omni
gcanlin File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file was deleted.
Oops, something went wrong.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,19 @@ | ||
| ARG VLLM_ASCEND_IMAGE=quay.io/ascend/vllm-ascend | ||
| ARG VLLM_ASCEND_TAG=v0.14.0rc1 | ||
| FROM ${VLLM_ASCEND_IMAGE}:${VLLM_ASCEND_TAG} | ||
|
|
||
| ARG APP_DIR=/vllm-workspace/vllm-omni | ||
| WORKDIR ${APP_DIR} | ||
|
|
||
| COPY . . | ||
|
|
||
| # Remove this replace when the dispatch of requirements is ready | ||
| RUN sed -i -E 's/^([[:space:]]*)"fa3-fwd==0\.0\.1",/\1# "fa3-fwd==0.0.1",/' pyproject.toml \ | ||
| && sed -i -E 's/\bonnxruntime\b/onnxruntime-cann/g' pyproject.toml | ||
|
|
||
| # Install vllm-omni with dev dependencies | ||
| RUN pip install --no-cache-dir -e . | ||
|
|
||
| ENV VLLM_WORKER_MULTIPROC_METHOD=spawn | ||
|
|
||
| ENTRYPOINT [] |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,19 @@ | ||
| ARG VLLM_ASCEND_IMAGE=quay.io/ascend/vllm-ascend | ||
| ARG VLLM_ASCEND_TAG=v0.14.0rc1-a3 | ||
| FROM ${VLLM_ASCEND_IMAGE}:${VLLM_ASCEND_TAG} | ||
|
|
||
| ARG APP_DIR=/vllm-workspace/vllm-omni | ||
| WORKDIR ${APP_DIR} | ||
|
|
||
| COPY . . | ||
|
|
||
| # Remove this replace when the dispatch of requirements is ready | ||
| RUN sed -i -E 's/^([[:space:]]*)"fa3-fwd==0\.0\.1",/\1# "fa3-fwd==0.0.1",/' pyproject.toml \ | ||
| && sed -i -E 's/\bonnxruntime\b/onnxruntime-cann/g' pyproject.toml | ||
|
|
||
| # Install vllm-omni with dev dependencies | ||
| RUN pip install --no-cache-dir -e . | ||
|
|
||
| ENV VLLM_WORKER_MULTIPROC_METHOD=spawn | ||
|
|
||
| ENTRYPOINT [] |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Here I paste the Q1 roadmap link, so that NPU users can get the latest information here.