Fixed gpt-oss _load_weights_other() parameter position bug#28715
Fixed gpt-oss _load_weights_other() parameter position bug#28715heheda12345 merged 1 commit intovllm-project:mainfrom
Conversation
There was a problem hiding this comment.
Code Review
This pull request correctly fixes a bug where parameters were passed in the wrong order to the _load_weights_other function. The change is correct and addresses the issue described. I've added one high-severity comment regarding maintainability to prevent similar bugs in the future by making the code more robust.
| ep_rank_start, | ||
| ep_rank_end, |
There was a problem hiding this comment.
While this change correctly fixes the argument order, the root cause of the bug is the inconsistent parameter order between _load_weights_other and _load_weights_mxfp4. To improve readability and prevent similar bugs in the future, it's highly recommended to use keyword arguments for this call. This makes the code more explicit and robust against parameter reordering. For example:
return self._load_weights_other(
ep_rank_start=ep_rank_start,
ep_rank_end=ep_rank_end,
heads_per_rank=heads_per_rank,
head_start=head_start,
weights=weights,
stacked_params_mapping=stacked_params_mapping,
)eee7487 to
722f941
Compare
722f941 to
6aba28e
Compare
…ect#28715) Summary: Signed-off-by: Dezhan Tu <dztu@meta.com> For `_load_weights_other()`, `ep_rank_start` and `ep_rank_end` positions are wrongly placed, leading to the failure of loading expert data in expert parallelism Test Plan: ``` from vllm import LLM llm = LLM("openai/gpt-oss-120b", tensor_parallel_size=2, enable_expert_parallel) output = llm.generate("Hi, vLLM is a") ``` Reviewed By: helunwencser, jackm321 Differential Revision: D87021773
6aba28e to
dc52155
Compare
…ect#28715) Co-authored-by: Dezhan Tu <dztu@meta.com>
…ect#28715) Co-authored-by: Dezhan Tu <dztu@meta.com>
Summary:
Signed-off-by: Dezhan Tu dezhantu@gmail.com
For
_load_weights_other(),ep_rank_startandep_rank_endpositions are wrongly placed, leading to the failure of loading expert data in expert parallelim. This diff fixed this bug.Test Plan:
Reviewed By: helunwencser, jackm321
Differential Revision: D87021773