Skip to content

memory : add llama_memory_hybrid_iswa#18601

Merged
ggerganov merged 2 commits intoggml-org:masterfrom
tdakhran:tarek/feat/memory-hybrid-iswa
Jan 21, 2026
Merged

memory : add llama_memory_hybrid_iswa#18601
ggerganov merged 2 commits intoggml-org:masterfrom
tdakhran:tarek/feat/memory-hybrid-iswa

Conversation

@tdakhran
Copy link
Contributor

@tdakhran tdakhran commented Jan 4, 2026

Upcoming models use hybrid caches with sliding window attention.

Tried modifying the existing llama_memory_hybrid to use llama_kv_cache_iswa, but the change was very intrusive.

I don't like the code much, it's essentially a copy-paste done by Claude. But it works! Not very confident with the memory subsystem and open to suggestions and feedback on how to improve it.

@tdakhran
Copy link
Contributor Author

tdakhran commented Jan 8, 2026

@ggerganov , #18641 works on top of this PR, and shows that it works. Please let me know if something else is needed to merge.

UPD: PR doesn't affect any existing models or functionality.

Copy link
Member

@ggerganov ggerganov left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Rebase and lets wait for CI before merge

@tdakhran tdakhran force-pushed the tarek/feat/memory-hybrid-iswa branch 2 times, most recently from b40e4f2 to 99e3092 Compare January 16, 2026 08:40
@tdakhran tdakhran force-pushed the tarek/feat/memory-hybrid-iswa branch from 99e3092 to d6a45a4 Compare January 21, 2026 10:12
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
@tdakhran
Copy link
Contributor Author

Addressed the feedback, rebased, and tested with #18641.
@ggerganov, @CISC, please let me know if anything else is needed to merge.

@ggerganov ggerganov merged commit ad8d85b into ggml-org:master Jan 21, 2026
78 checks passed
shaofeiqi pushed a commit to qualcomm/llama.cpp that referenced this pull request Feb 6, 2026
* memory : add llama_memory_hybrid_iswa

* Update src/llama-memory-hybrid-iswa.cpp

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants