UPSTREAM PR #17077: HIP: RDNA4 tensor core support for MMF#118
UPSTREAM PR #17077: HIP: RDNA4 tensor core support for MMF#118
Conversation
|
Access the complete analysis in the LOCI Dashboard Performance Analysis SummaryOverviewAnalysis of project_id Key FindingsPerformance Metrics:
Core Function Impact: Power Consumption Analysis: Flame Graph and CFG Analysis: GitHub Code Review: Conclusion: |
6b50572 to
733e776
Compare
6d2349e to
9248736
Compare
a87918f to
6f7320f
Compare
2b1a9e2 to
9ea0205
Compare
|
Access the complete analysis in the LOCI Dashboard Performance Analysis SummaryOverviewAnalysis of llama.cpp project comparing versions Key FindingsPerformance Metrics:
Core Function Impact: Power Consumption Analysis: Flame Graph and CFG Analysis: GitHub Code Review Insights: Conclusion: |
5c86b47 to
ef7ca13
Compare
8e0755a to
ccd34a0
Compare
Mirrored from ggml-org/llama.cpp#17077
Add RDNA4 tensor core support for MMF, honestly the performance is lower than expectation. The model is at https://huggingface.co/Mungert/DeepSeek-R1-0528-Qwen3-8B-GGUF