memory : add llama_memory_hybrid_iswa by tdakhran · Pull Request #18601 · ggml-org/llama.cpp

tdakhran · 2026-01-04T23:23:03Z

Upcoming models use hybrid caches with sliding window attention.

Tried modifying the existing llama_memory_hybrid to use llama_kv_cache_iswa, but the change was very intrusive.

I don't like the code much, it's essentially a copy-paste done by Claude. But it works! Not very confident with the memory subsystem and open to suggestions and feedback on how to improve it.

tdakhran · 2026-01-08T15:49:03Z

@ggerganov , #18641 works on top of this PR, and shows that it works. Please let me know if something else is needed to merge.

UPD: PR doesn't affect any existing models or functionality.

ggerganov

Rebase and lets wait for CI before merge

src/llama-memory-hybrid-iswa.cpp

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

tdakhran · 2026-01-21T11:00:02Z

Addressed the feedback, rebased, and tested with #18641.
@ggerganov, @CISC, please let me know if anything else is needed to merge.

* memory : add llama_memory_hybrid_iswa * Update src/llama-memory-hybrid-iswa.cpp Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

tdakhran requested review from CISC and ggerganov as code owners January 4, 2026 23:23

loci-dev mentioned this pull request Jan 4, 2026

UPSTREAM PR #18601: memory : add llama_memory_hybrid_iswa auroralabs-loci/llama.cpp#816

Open

ngxson mentioned this pull request Jan 6, 2026

memory: add is_iswa for memory_hybrid #18640

Closed

loci-dev mentioned this pull request Jan 6, 2026

UPSTREAM PR #18640: memory: add is_iswa for memory_hybrid auroralabs-loci/llama.cpp#834

Open

tdakhran mentioned this pull request Jan 6, 2026

[Do Not Merge] model : LFM2.5-Audio-1.5B #18641

Draft

5 tasks

elfarolab mentioned this pull request Jan 7, 2026

scripts : add pr2wt.sh #18644

Merged

ggerganov approved these changes Jan 15, 2026

View reviewed changes

src/llama-memory-hybrid-iswa.cpp Show resolved Hide resolved

tdakhran force-pushed the tarek/feat/memory-hybrid-iswa branch 2 times, most recently from b40e4f2 to 99e3092 Compare January 16, 2026 08:40

memory : add llama_memory_hybrid_iswa

0c3c1d0

tdakhran force-pushed the tarek/feat/memory-hybrid-iswa branch from 99e3092 to d6a45a4 Compare January 21, 2026 10:12

Update src/llama-memory-hybrid-iswa.cpp

d6a45a4

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

CISC approved these changes Jan 21, 2026

View reviewed changes

ggerganov merged commit ad8d85b into ggml-org:master Jan 21, 2026
78 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

memory : add llama_memory_hybrid_iswa#18601

memory : add llama_memory_hybrid_iswa#18601
ggerganov merged 2 commits intoggml-org:masterfrom
tdakhran:tarek/feat/memory-hybrid-iswa

tdakhran commented Jan 4, 2026

Uh oh!

tdakhran commented Jan 8, 2026 •

edited

Loading

Uh oh!

ggerganov left a comment

Uh oh!

Uh oh!

tdakhran commented Jan 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

tdakhran commented Jan 4, 2026

Uh oh!

tdakhran commented Jan 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ggerganov left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

tdakhran commented Jan 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tdakhran commented Jan 8, 2026 •

edited

Loading