Update README.md by bm777 · Pull Request #306 · vllm-project/vllm

bm777 · 2023-06-29T07:53:18Z

Updated the README.md -> corrected the pip usage command

zhuohan123

LGTM! Thanks for catching the errors.

bm777 · 2023-07-09T18:06:57Z

It is a pleasure :)

Triton's autotuner had a bug (fixed in triton-lang/triton@184fb53 ) that we happen to trigger - this PR is a temp. workaround before we patch/update Triton.

Remove useless code which wrongly introduced by multi step PR . Signed-off-by: new-TonyWang <wangtonyyu222@gmail.com>

* load gpt-oss int4 Signed-off-by: Yan Ma <yan.ma@intel.com> * fix int4 path * fix * fix * fix * try padding at weight create * fix * test * test * use linear * fix * fix * fix loading [1/N] * fix loading * fix o/qkv * fix router * remove log * address comments --------- Signed-off-by: Yan Ma <yan.ma@intel.com> Co-authored-by: Yan Ma <yan.ma@intel.com>

Update SHA for extension to: [Security] Fix: remove dead code (vllm-project#306) (vllm-project#310)

* load gpt-oss int4 Signed-off-by: Yan Ma <yan.ma@intel.com> * fix int4 path * fix * fix * fix * try padding at weight create * fix * test * test * use linear * fix * fix * fix loading [1/N] * fix loading * fix o/qkv * fix router * remove log * address comments --------- Signed-off-by: Yan Ma <yan.ma@intel.com> Co-authored-by: Yan Ma <yan.ma@intel.com> add gpt oss tp support (vllm-project#307) * add fused kernel * always pack on k dim * disbale padding * fix interface * fix shape * qzero * test padding * hidden_size padding * fix tp1 * ep * layer index * bias shape * fix ep * update tp * update tp * add log * refine log * log create weight * fix ep rank/offset * fix w2 bias, only add on rank 0 * address comments fix group size 128 (vllm-project#318) add env var for use_marlin (vllm-project#319) * add env var for use_marlin * use_marlin force true * fix group size

Update README.md

66ea993

Updated the README.md -> corrected the pip usage command

zhuohan123 approved these changes Jun 29, 2023

View reviewed changes

zhuohan123 merged commit 9d27b09 into vllm-project:main Jun 29, 2023

michaelfeil pushed a commit to michaelfeil/vllm that referenced this pull request Jul 1, 2023

Update README.md (vllm-project#306)

3de69a9

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

Update README.md (vllm-project#306)

e4dccdc

wuhuikx pushed a commit to wuhuikx/vllm that referenced this pull request Mar 27, 2025

remove useless config (vllm-project#306)

4d0cdc0

Remove useless code which wrongly introduced by multi step PR . Signed-off-by: new-TonyWang <wangtonyyu222@gmail.com>

yiliu30 pushed a commit to yiliu30/vllm-fork that referenced this pull request Aug 20, 2025

Update hpu.txt (vllm-project#1654)

927a754

Update SHA for extension to: [Security] Fix: remove dead code (vllm-project#306) (vllm-project#310)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update README.md#306

Update README.md#306
zhuohan123 merged 1 commit intovllm-project:mainfrom
bm777:patch-1

bm777 commented Jun 29, 2023

Uh oh!

zhuohan123 left a comment

Uh oh!

bm777 commented Jul 9, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

bm777 commented Jun 29, 2023

Uh oh!

zhuohan123 left a comment

Choose a reason for hiding this comment

Uh oh!

bm777 commented Jul 9, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants