Using Semantic Kernel Memory #297

aaronpowell · 2024-04-08T04:00:33Z

This moves from the custom solution for vector searching on Postgres, instead using the SK memory feature to do it. Added some new SK dependencies for the memory (and PG memory store), then removed the Embedding column from the current data model, as there is a new table with all that in it.

Refactored the seed logic to load the memory store using the previously generated embeddings.

Changed the CatalogAPI route to use the ISemanticTextMemory search feature to search memory, rather than the custom SQL query. This does mean we don't get distance surfaced, also, pagination is currently lost and SK memory doesn't support that (we could roll that ourselves if we want).

Included a fix so that AOAI can be deployed (issue #280), and pgadmin for easier debugging of the data in the database.

Fixes #282

This moves from the custom solution for vector searching on Postgres, instead using the SK memory feature to do it. Added some new SK dependencies for the memory (and PG memory store), then removed the Embedding column from the current data model, as there is a new table with all that in it. Refactored the seed logic to load the memory store using the previously generated embeddings. Changed the CatalogAPI route to use the ISemanticTextMemory search feature to search memory, rather than the custom SQL query. This does mean we don't get distance surfaced, also, pagination is currently lost and SK memory doesn't support that (we could roll that ourselves if we want). Included a fix so that AOAI can be deployed (issue dotnet#280), and pgadmin for easier debugging of the data in the database.

This means we do less code changes during testing, use dotnet user-secrets set EnableAI true instead

roji · 2024-04-10T09:28:40Z

src/Catalog.API/Apis/CatalogApi.cs

-            itemsOnPage = itemsWithDistance.Select(i => i.Item).ToList();
-        }
-        else
+        await foreach (var item in itemsWithDistance)


Noting that in addition to losing pagination and the distance, this performs an additional database roundtrip for each result (so for a page size of 5, this does one query via SearchMemoryAsync, and then 5 FindAsync invocations totaling 6)... That's not great in terms of performance.

roji

@aaronpowell can I ask for a bit more context on the goal of this PR?

I think that generally, eShop is supposed to provide a sample of how an actual real-world app would be written (at least I think of it that way). If I were to write such an app, and have chosen pgvector as my vector store solution, I think I'd code directly against it - as the code currently is - rather than introduce an abstraction, which in this particular case seems to cause more friction than helping... Of course, the abstraction would 100% make sense in other scenarios, e.g. when using the vector DB within the SK stack (where it would be necessary), but I'm not sure it makes sense here...

What do you think?

aaronpowell added 3 commits April 8, 2024 13:59

Removing old embedding generator method as SK Memory handles that now

2f0658a

Using config to enable/disable AI

77b2a8a

This means we do less code changes during testing, use dotnet user-secrets set EnableAI true instead

roji reviewed Apr 10, 2024

View reviewed changes

adityamandaleeka deleted the branch dotnet:aspire-preview5 April 12, 2024 01:11

adityamandaleeka closed this Apr 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using Semantic Kernel Memory #297

Using Semantic Kernel Memory #297

aaronpowell commented Apr 8, 2024

roji Apr 10, 2024

roji left a comment

Using Semantic Kernel Memory #297

Using Semantic Kernel Memory #297

Conversation

aaronpowell commented Apr 8, 2024

roji Apr 10, 2024

Choose a reason for hiding this comment

roji left a comment

Choose a reason for hiding this comment