Merge upstream/main into matthias.awq_gemv#18
Merged
mgehre-amd merged 437 commits intomatthias.awq_gemvfrom Mar 24, 2026
Merged
Merge upstream/main into matthias.awq_gemv#18mgehre-amd merged 437 commits intomatthias.awq_gemvfrom
mgehre-amd merged 437 commits intomatthias.awq_gemvfrom
Conversation
Signed-off-by: juliendenize <julien.denize@mistral.ai> Signed-off-by: Julien Denize <40604584+juliendenize@users.noreply.github.com> Co-authored-by: root <root@h200-bar-196-227.slurm-bar-compute.tenant-slurm.svc.cluster.local> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk>
Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>
…mats (vllm-project#35109) Signed-off-by: seanmamasde <seanmamasde@gmail.com>
Signed-off-by: Santino Ramos <elsantinoramos@gmail.com>
…llm-project#37040) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
…ject#32384) Signed-off-by: Karan Bansal <karanb192@gmail.com> Co-authored-by: Inokinoki <inoki@inoki.cc>
…7062) Signed-off-by: Nick Hill <nickhill123@gmail.com>
Signed-off-by: Russell Bryant <rbryant@redhat.com>
Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>
…lm-project#34614) Signed-off-by: hasethuraman <hsethuraman@microsoft.com>
Signed-off-by: zjy0516 <riverclouds.zhu@qq.com> Co-authored-by: Roger Wang <hey@rogerw.io>
Signed-off-by: Lalithnarayan C <Lalithnarayan.C@amd.com> Signed-off-by: Tyler Michael Smith <tlrmchlsmth@gmail.com> Co-authored-by: Chinmay-Kulkarni-AMD <Chinmay.Kulkarni@amd.com> Co-authored-by: Tyler Michael Smith <tyler@neuralmagic.com> Co-authored-by: Tyler Michael Smith <tlrmchlsmth@gmail.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>
…roject#37126) Signed-off-by: Andrew Xia <axia@meta.com>
…iio_connector to restore P/D functionality (vllm-project#34907) Signed-off-by: Randall Smith <Randall.Smith@amd.com>
Signed-off-by: yitingw1 <yiting.wang@intel.com>
…xtral test (vllm-project#37138) Signed-off-by: Andreas Karatzas <akaratza@amd.com>
…subclasses in schema fuzz tests (vllm-project#37127) Signed-off-by: Andreas Karatzas <akaratza@amd.com>
Signed-off-by: jiang1.li <jiang1.li@intel.com>
…roject#37107) Signed-off-by: bigshanedogg <bigshane319@gmail.com>
Signed-off-by: wzhao18 <wzhao18.sz@gmail.com> Signed-off-by: Leo Tian <lctian@nvidia.com> Co-authored-by: wzhao18 <wzhao18.sz@gmail.com> Co-authored-by: Stefano Castagnetta <scastagnetta@nvidia.com> Co-authored-by: root <root@lyris0267.lyris.clusters.nvidia.com>
Signed-off-by: Woosuk Kwon <woosuk@inferact.ai>
Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>
…vllm-project#36845) Signed-off-by: Andreas Karatzas <akaratza@amd.com>
…akiness (vllm-project#36442) Signed-off-by: Andreas Karatzas <akaratza@amd.com>
…odel loading (vllm-project#37136) Signed-off-by: esmeetu <jasonailu87@gmail.com>
…de_stack guards instead of previous hacks (vllm-project#36204) Signed-off-by: Laith Sakka <lsakka@meta.com>
…deo inputs (vllm-project#37147) Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>
…well (vllm-project#36987) Signed-off-by: Vadim Gimpelson <vadim.gimpelson@gmail.com>
Signed-off-by: Xiao Fu <xiaofu@meta.com>
Signed-off-by: Jee Jee Li <pandaleefree@gmail.com>
…roject#37612) Signed-off-by: sfeng33 <4florafeng@gmail.com>
…dates (vllm-project#37523) Signed-off-by: Yuxiang Liang <yuxiang.liang@intel.com> Signed-off-by: Yuxiang Liang <yuliang@habana.ai> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
…ct#37364) Signed-off-by: Giancarlo Delfin <gdelfin@inferact.ai>
…ject#37634) Signed-off-by: huanxing <huanxing.shen@intel.com>
…llm-project#37593) Signed-off-by: sfeng33 <4florafeng@gmail.com>
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
…project#37293) Signed-off-by: Wangbei25 <wangbei41@huawie.com> Signed-off-by: Wangbei25 <wangbei41@huawei.com> Co-authored-by: Wangbei25 <wangbei41@huawie.com>
Signed-off-by: Kunshang Ji <kunshang.ji@intel.com>
…llm-project#37461) Signed-off-by: root <root@prenyx0169.a51.clusters.nvidia.com> Signed-off-by: wzhao18 <wzhao18.sz@gmail.com> Signed-off-by: <> Co-authored-by: root <root@prenyx0169.a51.clusters.nvidia.com> Co-authored-by: root <root@prenyx0042.a51.clusters.nvidia.com>
Signed-off-by: Andreas Karatzas <akaratza@amd.com>
…lay (vllm-project#37639) Signed-off-by: Giancarlo Delfin <gdelfin@inferact.ai>
…project#37537) Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>
…llm-project#37619) Signed-off-by: Andreas Karatzas <akaratza@amd.com>
Signed-off-by: Andreas Karatzas <akaratza@amd.com>
… attention backend (vllm-project#37611) Signed-off-by: Andreas Karatzas <akaratza@amd.com>
…vllm-project#37595) Signed-off-by: sfeng33 <4florafeng@gmail.com>
…ints/ (vllm-project#37500) Signed-off-by: sfeng33 <4florafeng@gmail.com>
Brings in 333 upstream commits. Conflicts resolved in: - CMakeLists.txt (arch ordering) - kernels/linear/__init__.py (both new kernels kept) - fused_moe/runner/default_moe_runner.py (kept our init + upstream refactor) - layers/utils.py (kept our tracing + upstream aiter support) - models/qwen2_5_vl.py (kept our attn backend check + upstream model_tag removal) Signed-off-by: Matthias Gehre <matthias.gehre@amd.com>
2 tasks
- Remove stale ensure_dp_chunking_init() call (renamed to _maybe_init_dp_chunking and moved to __init__ upstream) - Update use_fi_all2allv_kernels -> use_fi_nvl_two_sided_kernels in hip_w4a16_experts.py and exllama_moe.py to match upstream rename Signed-off-by: Matthias Gehre <matthias.gehre@amd.com>
eble-amd
reviewed
Mar 20, 2026
eble-amd
approved these changes
Mar 20, 2026
Collaborator
eble-amd
left a comment
There was a problem hiding this comment.
I reviewed the files that you listed as having conflicts, but I didn't notice changes to any code that I am familiar with, so LGTM.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Test plan