[Misc] Various simplifications and typing fixes by njhill · Pull Request #5368 · vllm-project/vllm

njhill · 2024-06-09T22:58:00Z

Noticed while working on other features, thought it would be cleaner to split into a separate PR.

Noticed while working on other features.

njhill · 2024-06-09T23:02:20Z

vllm/worker/model_runner.py

-                     dim=0,
-                     dtype=query_start_loc.dtype,
-                     out=query_start_loc[1:])
-


These tensors aren't used in the flashinfer case

DarkLight1337

Sorry I approved too early. Looks like you broke the speculative decoding code.

njhill · 2024-06-10T14:42:53Z

Thanks @DarkLight1337! It was a small mistake, have now pushed a fix.

* upstream/main: (126 commits) [Bugfix][Frontend] Cleanup "fix chat logprobs" (vllm-project#5026) [Bugfix] OpenAI entrypoint limits logprobs while ignoring server defined --max-logprobs (vllm-project#5312) [Misc] Various simplifications and typing fixes (vllm-project#5368) [ci] Fix Buildkite agent path (vllm-project#5392) [Doc] Add documentation for FP8 W8A8 (vllm-project#5388) Bump version to v0.5.0 (vllm-project#5384) [Docs] Alphabetically sort sponsors (vllm-project#5386) [Docs] Add Docs on Limitations of VLM Support (vllm-project#5383) [ci] Mount buildkite agent on Docker container to upload benchmark results (vllm-project#5330) [ci] Use small_cpu_queue for doc build (vllm-project#5331) [Bugfix] Fix LLaVA-NeXT (vllm-project#5380) [Feature][Frontend]: Continued `stream_options` implementation also in CompletionRequest (vllm-project#5319) [Model] Initial support for LLaVA-NeXT (vllm-project#4199) [Misc] Improve error message when LoRA parsing fails (vllm-project#5194) [misc][typo] fix typo (vllm-project#5372) [Frontend][Misc] Enforce Pixel Values as Input Type for VLMs in API Server (vllm-project#5374) [Misc] Update to comply with the new `compressed-tensors` config (vllm-project#5350) [Bugfix] Fix KeyError: 1 When Using LoRA adapters (vllm-project#5164) [Kernel][Misc] Use TORCH_LIBRARY instead of PYBIND11_MODULE for custom ops (vllm-project#5047) [mis][ci/test] fix flaky test in test_sharded_state_loader.py (vllm-project#5361) ...

[Misc] Various simplifications and typing fixes

0895001

Noticed while working on other features.

njhill commented Jun 9, 2024

View reviewed changes

DarkLight1337 approved these changes Jun 10, 2024

View reviewed changes

DarkLight1337 requested changes Jun 10, 2024

View reviewed changes

njhill added 2 commits June 10, 2024 07:41

Fix missed dimension

df3b30c

Avoid a tensor copy in rejection sampler

d334579

DarkLight1337 approved these changes Jun 11, 2024

View reviewed changes

DarkLight1337 merged commit a008629 into vllm-project:main Jun 11, 2024

njhill deleted the some-cleanup branch June 11, 2024 02:30

robertgshaw2-redhat pushed a commit to neuralmagic/nm-vllm that referenced this pull request Jun 12, 2024

[Misc] Various simplifications and typing fixes (vllm-project#5368)

c098739

joerunde pushed a commit to joerunde/vllm that referenced this pull request Jun 17, 2024

[Misc] Various simplifications and typing fixes (vllm-project#5368)

6599db4

njhill mentioned this pull request Jun 20, 2024

[Misc][Speculative Decoding] Improve top1_proposal output tensor initialization. #5706

Closed

xjpang pushed a commit to xjpang/vllm that referenced this pull request Jun 27, 2024

[Misc] Various simplifications and typing fixes (vllm-project#5368)

2b166f5

xjpang pushed a commit to xjpang/vllm that referenced this pull request Jul 8, 2024

[Misc] Various simplifications and typing fixes (vllm-project#5368)

5906637

xjpang pushed a commit to xjpang/vllm that referenced this pull request Jul 24, 2024

[Misc] Various simplifications and typing fixes (vllm-project#5368)

8b81e6e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Comments

[Misc] Various simplifications and typing fixes#5368

[Misc] Various simplifications and typing fixes#5368
DarkLight1337 merged 3 commits intovllm-project:mainfrom
njhill:some-cleanup

njhill commented Jun 9, 2024

Uh oh!

njhill Jun 9, 2024

Uh oh!

DarkLight1337 left a comment

Uh oh!

njhill commented Jun 10, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Comments

Conversation

njhill commented Jun 9, 2024

Uh oh!

njhill Jun 9, 2024

Choose a reason for hiding this comment

Uh oh!

DarkLight1337 left a comment

Choose a reason for hiding this comment

Uh oh!

njhill commented Jun 10, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants