Skip to content

Fix crashes during multi-turn conversation due to KV cache serialization#9

Merged
JohannesGaessler merged 1 commit intoJohannesGaessler:ggml-meta-backend-8from
gaugarg-nv:fix_kv_serialization
Feb 28, 2026
Merged

Fix crashes during multi-turn conversation due to KV cache serialization#9
JohannesGaessler merged 1 commit intoJohannesGaessler:ggml-meta-backend-8from
gaugarg-nv:fix_kv_serialization

Conversation

@gaugarg-nv
Copy link

KV cache serialization requires non-zero offsets on the tensor. Add support in the meta backend to set/get a tensor with a non-zero offset.

KV cache serialization requires non-zero offsets on the tensor. Add support in the meta backend to set/get a tensor with a non-zero offset.
@gaugarg-nv gaugarg-nv changed the title Fix crashes due to KV cache serialization Fix crashes during multi-turn conversation due to KV cache serialization Feb 28, 2026
@JohannesGaessler JohannesGaessler merged commit 2f338d6 into JohannesGaessler:ggml-meta-backend-8 Feb 28, 2026
JohannesGaessler pushed a commit that referenced this pull request Mar 7, 2026
KV cache serialization requires non-zero offsets on the tensor. Add support in the meta backend to set/get a tensor with a non-zero offset.
JohannesGaessler pushed a commit that referenced this pull request Mar 8, 2026
KV cache serialization requires non-zero offsets on the tensor. Add support in the meta backend to set/get a tensor with a non-zero offset.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants