[MoE Refactor] Remove SharedFusedMoE class#35782
[MoE Refactor] Remove SharedFusedMoE class#35782robertgshaw2-redhat merged 5 commits intovllm-project:mainfrom
Conversation
|
This pull request has merge conflicts that must be resolved before it can be |
There was a problem hiding this comment.
Code Review
This pull request is a significant and well-executed refactoring of the Mixture of Experts (MoE) implementation. By removing the SharedFusedMoE and ZeroExpertFusedMoE classes and introducing a more modular design with MoERunner and SharedExperts classes, the code becomes more maintainable and extensible. The new tests for the zero expert functionality are also a great addition. I've found one critical issue in the new chunking runner that could lead to a crash when processing zero tokens, which I've detailed in a specific comment.
Signed-off-by: Bill Nell <bnell@redhat.com>
10c87e0 to
f4b2e72
Compare
|
This pull request has merge conflicts that must be resolved before it can be |
Signed-off-by: Bill Nell <bnell@redhat.com>
Signed-off-by: Bill Nell <bnell@redhat.com>
Signed-off-by: Bill Nell <bnell@redhat.com>
Signed-off-by: Bill Nell <bnell@redhat.com> Co-authored-by: hongbolv <33214277+hongbolv@users.noreply.github.com>
Signed-off-by: Bill Nell <bnell@redhat.com>
Signed-off-by: Bill Nell <bnell@redhat.com> Signed-off-by: Yifan <yzong@redhat.com>
Signed-off-by: Bill Nell <bnell@redhat.com> Signed-off-by: Avinash Singh <avinashsingh.rcoem@gmail.com>
Purpose
The
SharedFusedMoEclass is no longer needed. Its functionality has been moved intoFusedMoE,SharedExpertsandMoERunnerBase.Test Plan
CI tests + MoE refactoring tests
Test Result
cc @yzong-rh
Essential Elements of an Effective PR Description Checklist
supported_models.mdandexamplesfor a new model.