info: cached info message #21

VJHack · 2025-01-05T08:51:46Z

Instead of displaying the original response metrics when the completion was pulled from cache, we want to display metrics related to the cache. The completion info is shown in the following format:

C: number of elements currently in the cache / cache size
t: total FIM time

 ... world\n");     | C: 3 / 250 | t: 0.66 ms

This is what the info message looks like when there is a cache hit.

Changes in this PR:

Changed info message for when there is a cache hit
Optimized cache usage by only storing the content and not the other metrics. This helps reduce the size of the cache.

This suggestion was mentioned here: #18 (review)

ggerganov

The llama-server now has the option to filter response fields, so another idea for improvement it to use this feature to minimize the amount of information that is transported from the server to the client, by filtering just the fields that are needed by llama.vim:

ggml-org/llama.cpp#10819

VJHack added 3 commits January 5, 2025 01:39

added cached info message

1cfa96a

optimize space is cache

11cf5e9

comment clarification

c1543c5

ggerganov approved these changes Jan 5, 2025

View reviewed changes

handle empty caches

322b9e6

ggerganov merged commit 3cc84b0 into ggml-org:master Jan 6, 2025

VJHack mentioned this pull request Jan 14, 2025

llama.vim: filter server response fields #24

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

info: cached info message #21

info: cached info message #21

Uh oh!

VJHack commented Jan 5, 2025 •

edited

Loading

Uh oh!

ggerganov left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

info: cached info message #21

info: cached info message #21

Uh oh!

Conversation

VJHack commented Jan 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ggerganov left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

VJHack commented Jan 5, 2025 •

edited

Loading