Skip to content

UPSTREAM PR #18442: Patch perf regression for mmq kernels in ROCm#736

Open
loci-dev wants to merge 1 commit intomainfrom
upstream-PR18442-branch_jiachengjason-fix/jiachengjason/rocm7.x_regression
Open

UPSTREAM PR #18442: Patch perf regression for mmq kernels in ROCm#736
loci-dev wants to merge 1 commit intomainfrom
upstream-PR18442-branch_jiachengjason-fix/jiachengjason/rocm7.x_regression

Conversation

@loci-dev
Copy link

Mirrored from ggml-org/llama.cpp#18442

Recover performance regression for ggml-org/llama.cpp#17917 for RDNA3 and 4 by choosing more performant configs for mmq kernels

@loci-review
Copy link

loci-review bot commented Dec 29, 2025

Explore the complete analysis inside the Version Insights

I've generated a summary report for your project. Here are the key highlights:

Summary Report for llama.cpp PR #736

Project Information:

Performance Analysis Results:

No Significant Performance Impact Detected

The analysis found that no modified functions showed performance changes greater than 2% in either:

  • Response Time (execution time)
  • Throughput (processing rate)

Conclusion:

This pull request is performance-neutral and can proceed without performance concerns. All changes maintain stable performance characteristics, suggesting the modifications are either non-performance-critical updates, equivalent refactoring, or feature additions with minimal runtime impact.

@loci-dev loci-dev force-pushed the main branch 27 times, most recently from ac31769 to 7aa8b1c Compare January 1, 2026 12:15
@loci-dev loci-dev force-pushed the main branch 30 times, most recently from 1de1e20 to 4be6e0f Compare January 7, 2026 13:20
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants