-
Notifications
You must be signed in to change notification settings - Fork 4.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: add support for DataStax HCD vector store #3728
Conversation
This pull request is automatically being deployed by Amplify Hosting (learn more). |
41ae08d
to
1503b0c
Compare
class HCDVectorStoreComponent(LCVectorStoreComponent): | ||
display_name: str = "Hyper-Converged Database" | ||
description: str = "Implementation of Vector Store using Hyper-Converged Database (HCD) with search capabilities" | ||
documentation: str = "https://python.langchain.com/docs/integrations/vectorstores/astradb" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is there any specific HCD documentation we want to link to?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good point, we can link to our install docs. It's worth noting that we are using AstraDB langchain integration so that URL is valid despite it talking to HCD.
|
||
|
||
class HCDVectorStoreComponent(LCVectorStoreComponent): | ||
display_name: str = "Hyper-Converged Database" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you clarify how we'll use HCD re: naming? Do we want it to be so strongly associated with Datastax that we don't need to specify "Datastax HCD"?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I asked about this internally and the decision was to keep it as Hyper-Converged Database
. Thanks for checking!
name="api_endpoint", | ||
display_name="HCD API Endpoint", | ||
info="API endpoint URL for the HCD service.", | ||
value="HCD_API_ENDPOINT", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Does langchain-astradb support interpreting this env variable as the endpoint (instead of ASTRA_DB_API_ENDPOINT
?). I think this naming makes sense either way though, and should be supported in langchain-astradb if not.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Unfortunately no, langchain-astradb only looks for ASTRA_DB_API_ENDPOINT
. There is not associated HCD_API_ENDPOINT
, instead we set an environment parameter during client initialization environment=Environment.HCD
.
), | ||
HandleInput( | ||
name="embedding", | ||
display_name="Embedding or Astra Vectorize", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is vectorize supported in HCD?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I believe the code paths are present, but I am unclear on how it is configured at this time.
Added support for DataStax Hyper-Converged Database (HCD) backed vector store.