[Core] implement redis cache mode #1222

vijaykramesh · 2024-01-12T20:21:04Z

Why are these changes needed?

This adds a redis mode to the cache. This way I can have multiple processes in separate containers running the same application (that is using autogen) and they can share LLM cache (vs in the current disk cache implementation the SQLIte instance ends up being machine local and can't be easily shared across multiple containers/pods).

The actual redis caching is using pickling, same as the disk cache implementation uses. So the cache should be functionally equivalent to the disk cache version.

Docs added inline and then also agent_chat.md:

LLM Caching

Legacy Disk Cache

By default, you can specify a cache_seed in your llm_config in order to take advantage of a local DiskCache backed cache. This cache will be used to store the results of your LLM calls, and will be used to return results for the same input without making a call to the LLM. This is useful for saving on compute costs, and for speeding up inference.

assistant = AssistantAgent(
    "coding_agent",
    llm_config={
        "cache_seed": 42,
        "config_list": OAI_CONFIG_LIST,
        "max_tokens": 1024,
    },
)

Setting this cache_seed param to None will disable the cache.

Configurable Context Manager

A new configurable context manager allows you to easily turn on and off LLM cache, using either DiskCache or Redis. All LLM agents inside the context manager will use the same cache.

from autogen.cache.cache import Cache

with Cache.redis(cache_seed=42, redis_url="redis://localhost:6379/0") as cache_client:
    user.initiate_chat(assistant, message=coding_task, cache_client=cache_client)

with Cache.disk(cache_seed=42, cache_dir=".cache") as cache_client:
    user.initiate_chat(assistant, message=coding_task, cache_client=cache_client)

Here's an example of the new integration test running in CI (note I had to setup my fork to get it to run, I think it will only run when it is on main that is being merged into? - and in my fork the other tests fail due to my OAI_CONFIG_LIST not being correct.

Integration test coverage for the new code I added:

➜  autogen git:(vr/redis_cache) ✗ coverage run -a -m pytest test/agentchat/test_cache.py
=============================================================================================================== test session starts ===============================================================================================================
platform darwin -- Python 3.11.4, pytest-7.4.4, pluggy-1.3.0
rootdir: /Users/vijay/oss/autogen
configfile: pyproject.toml
plugins: Faker-19.3.0, anyio-3.7.1
collected 3 items


test/agentchat/test_cache.py ...                                                                                                                                                                                                            [100%]

=============================================================================================================== 3 passed in 47.20s ================================================================================================================

And then per PR feedback I added some unit tests for the cache implementations.

➜  autogen git:(vr/redis_cache) ✗ coverage run -a -m pytest test/cache
=============================================================================================================== test session starts ===============================================================================================================
platform darwin -- Python 3.11.4, pytest-7.4.4, pluggy-1.3.0
rootdir: /Users/vijay/oss/autogen
configfile: pyproject.toml
plugins: Faker-19.3.0, anyio-3.7.1
collected 14 items


test/cache/test_cache.py ....                                                                                                                                                                                                               [ 28%]
test/cache/test_disk_cache.py .....                                                                                                                                                                                                         [ 64%]
test/cache/test_redis_cache.py .....                                                                                                                                                                                                        [100%]

=============================================================================================================== 14 passed in 1.24s ================================================================================================================

Related issue number

Checks

I've included any doc changes needed for https://microsoft.github.io/autogen/. See https://microsoft.github.io/autogen/docs/Contribute#documentation to build and test documentation locally.
I've added tests (if relevant) corresponding to the changes introduced in this PR.
I've made sure all auto checks have passed.

test/agentchat/test_cache.py

vijaykramesh · 2024-01-12T20:39:15Z

@microsoft-github-policy-service agree company="Regrello"

davorrunje · 2024-01-19T22:18:03Z

Redis cache and disk cache have different behaviors after being closed:
...
My thinking is that the cache should detaches itself from every agent instances once it exits the with context.

I agree. This will certainly cause problems in future if not unified.

autogen/agentchat/conversable_agent.py

ekzhu · 2024-01-20T05:01:19Z

@sonichi @vijaykramesh @davorrunje

Sorry I was incorrect to say that Redis and DiskCache will have different exit behaviors. I tested myself and there is no difference. Both caches will stay alive and re-opens once you use it again.

autogen/agentchat/groupchat.py

ekzhu · 2024-01-20T05:13:02Z

I pushed another commit to make sure a_run_chat has the same handling for cache as run_chat.

* implement redis cache mode, if redis_url is set in the llm_config then it will try to use this. also adds a test to validate both the existing and the redis cache behavior. * PR feedback, add unit tests * more PR feedback, move the new style cache to a context manager * Update agent_chat.md * more PR feedback, remove tests from contrib and have them run with the normal jobs * doc * updated * Update website/docs/Use-Cases/agent_chat.md Co-authored-by: Chi Wang <[email protected]> * update docs * update docs; let openaiwrapper to use cache object * typo * Update website/docs/Use-Cases/enhanced_inference.md Co-authored-by: Chi Wang <[email protected]> * save previous client cache and reset it after send/a_send * a_run_chat --------- Co-authored-by: Vijay Ramesh <[email protected]> Co-authored-by: Eric Zhu <[email protected]> Co-authored-by: Chi Wang <[email protected]>

vijaykramesh had a problem deploying to openai1 January 12, 2024 20:21 — with GitHub Actions Failure

vijaykramesh commented Jan 12, 2024

View reviewed changes

test/agentchat/test_cache.py Outdated Show resolved Hide resolved

vijaykramesh force-pushed the vr/redis_cache branch from f5f9d39 to 17d2c9e Compare January 12, 2024 20:23

vijaykramesh had a problem deploying to openai1 January 12, 2024 20:23 — with GitHub Actions Failure

vijaykramesh force-pushed the vr/redis_cache branch from 17d2c9e to 7cbead0 Compare January 12, 2024 20:42

vijaykramesh had a problem deploying to openai1 January 12, 2024 20:42 — with GitHub Actions Failure

ekzhu changed the title ~~implement redis cache mode~~ [Core] implement redis cache mode Jan 12, 2024

vijaykramesh had a problem deploying to openai1 January 12, 2024 22:41 — with GitHub Actions Failure

sonichi reviewed Jan 20, 2024

View reviewed changes

autogen/agentchat/conversable_agent.py Show resolved Hide resolved

save previous client cache and reset it after send/a_send

49bde52

vijaykramesh temporarily deployed to openai1 January 20, 2024 03:49 — with GitHub Actions Inactive

vijaykramesh had a problem deploying to openai1 January 20, 2024 03:49 — with GitHub Actions Failure

vijaykramesh temporarily deployed to openai1 January 20, 2024 03:49 — with GitHub Actions Inactive

vijaykramesh had a problem deploying to openai1 January 20, 2024 03:49 — with GitHub Actions Failure

vijaykramesh temporarily deployed to openai1 January 20, 2024 03:49 — with GitHub Actions Inactive

vijaykramesh had a problem deploying to openai1 January 20, 2024 03:49 — with GitHub Actions Failure

ekzhu reviewed Jan 20, 2024

View reviewed changes

autogen/agentchat/groupchat.py Show resolved Hide resolved

a_run_chat

7e088f9

ekzhu temporarily deployed to openai1 January 20, 2024 05:11 — with GitHub Actions Inactive

sonichi added this pull request to the merge queue Jan 20, 2024

Merged via the queue into microsoft:main with commit ee6ad8d Jan 20, 2024
97 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Core] implement redis cache mode #1222

[Core] implement redis cache mode #1222

vijaykramesh commented Jan 12, 2024 •

edited

Loading

vijaykramesh commented Jan 12, 2024

davorrunje commented Jan 19, 2024

ekzhu commented Jan 20, 2024 •

edited

Loading

ekzhu commented Jan 20, 2024

[Core] implement redis cache mode #1222

[Core] implement redis cache mode #1222

Conversation

vijaykramesh commented Jan 12, 2024 • edited Loading

Why are these changes needed?

LLM Caching

Legacy Disk Cache

Configurable Context Manager

Related issue number

Checks

vijaykramesh commented Jan 12, 2024

davorrunje commented Jan 19, 2024

ekzhu commented Jan 20, 2024 • edited Loading

ekzhu commented Jan 20, 2024

vijaykramesh commented Jan 12, 2024 •

edited

Loading

ekzhu commented Jan 20, 2024 •

edited

Loading