Add Azure CosmosDB MongoDB vCore option for agent memory #3187

tyler-suard-parker · 2024-07-22T17:29:43Z

This is pretty much an exact duplication of the current teachability code, except it uses MongoDB vCore instead of ChromaDB. Why? Because ChromaDB stores all its information in memory, so it is not ideal for long conversations or for containers/vms that delete everything on restart. This allows memories to be stored and recalled via a permanent database rather than an ephemeral one, and it is not stored in memory on the virtual machine so it will not slow down the execution of the agent workflow.

@sonichi @victordibia

Why are these changes needed?

Currently, AutoGen stores memories using ChromaDB. While easy to use and capable, ChromaDB must store information in memory, which is difficult for low-memory virtual machines like Azure WebApps or FuntionApps (they have around 500mb). These files allow AutoGen to store memories in an external vector database using Azure CosmosDB for MongoDB vCore and perform vector searches on them.

The code automatically sets up the vector databases and creates vector indexes on them.

Related issue number

Closes Issue #3066

Checks

I've included any doc changes needed for https://microsoft.github.io/autogen/. See https://microsoft.github.io/autogen/docs/Contribute#documentation to build and test documentation locally.
I've added tests (if relevant) corresponding to the changes introduced in this PR.
I've made sure all auto checks have passed.

This is pretty much an exact duplication of the current teachability code, except it uses MongoDB vCore instead of ChromaDB. Why? Because ChromaDB stores all its information in memory, so it is not ideal for long conversations or for containers/vms that delete everything on restart. This allows memories to be stored and recalled via a permanent database rather than an ephemeral one, and it is not stored in memory on the virtual machine so it will not slow down the execution of the agent workflow.

sonichi · 2024-07-25T18:23:00Z

I like the idea of extending to different vector DBs. Can we do it without code duplication? @thinkall does the current vector DB abstraction allow an extensible design here?

tyler-suard-parker · 2024-07-25T18:25:33Z

@sonichi thank you for your response. I would love to do this without any code duplication, but the original memory module has ChromaDB entangled in the code, no abstractions or interfaces at all. I had to pretty much rewrite everything from scratch.

thinkall

Since we've a vectordb abstraction here https://github.com/microsoft/autogen/tree/main/autogen/agentchat/contrib/vectordb, it would be nice to update the code of teachability to leverage the vectordb abstraction, then we can support different vectordbs easily like we've done with RAG agent.

rysweet · 2024-10-12T00:32:12Z

Hi @tyler-suard-parker - it looks like things have evolved quite a bit since you sent in this valuable contribution. Think about whether you would like to update with @thinkall's suggestions or perhaps engage with us on the new 0.4 codebase. Closing for now and please reopen if you want to revive this work.

tyler-suard-parker · 2024-10-14T18:10:21Z

@rysweet That is the nicest closing message I have ever received on a pull request. Thank you!

sonichi requested review from thinkall and rickyloynd-microsoft July 25, 2024 18:18

Merge branch 'main' into main

b22cc04

sonichi had a problem deploying to openai1 August 24, 2024 18:21 — with GitHub Actions Failure

thinkall reviewed Sep 24, 2024

View reviewed changes

ekzhu changed the base branch from main to 0.2 October 2, 2024 18:27

jackgerrits added the 0.2 Issues which were filed before re-arch to 0.4 label Oct 4, 2024

rysweet added the awaiting-op-response Issue or pr has been triaged or responded to and is now awaiting a reply from the original poster label Oct 10, 2024

Merge branch '0.2' into main

d57522e

rysweet closed this Oct 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Azure CosmosDB MongoDB vCore option for agent memory #3187

Add Azure CosmosDB MongoDB vCore option for agent memory #3187

tyler-suard-parker commented Jul 22, 2024

sonichi commented Jul 25, 2024

tyler-suard-parker commented Jul 25, 2024

thinkall left a comment

rysweet commented Oct 12, 2024

tyler-suard-parker commented Oct 14, 2024

Add Azure CosmosDB MongoDB vCore option for agent memory #3187

Add Azure CosmosDB MongoDB vCore option for agent memory #3187

Conversation

tyler-suard-parker commented Jul 22, 2024

Why are these changes needed?

Related issue number

Checks

sonichi commented Jul 25, 2024

tyler-suard-parker commented Jul 25, 2024

thinkall left a comment

Choose a reason for hiding this comment

rysweet commented Oct 12, 2024

tyler-suard-parker commented Oct 14, 2024