-
Notifications
You must be signed in to change notification settings - Fork 4.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ref: adds multi threading to the AI/ML embeddings component #2959
Conversation
Pull Request Validation ReportThis comment is automatically generated by Conventional PR Whitelist Report
Result Pull request does not satisfy any enabled whitelist criteria. Pull request will be validated. Validation Report
Result Pull request satisfies all enabled pull request rules. Last Modified at 25 Jul 24 17:55 UTC |
This pull request is automatically being deployed by Amplify Hosting (learn more). |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
…nting for better code clarit
…-ai#2959) * Use http client for requests and split texts naively * update models list * prints * multithread requests to aiml embeddings * remove comment * [autofix.ci] apply automated fixes * style(AIMLEmbeddingsImpl.py): improve code formatting and add type hinting for better code clarit --------- Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com> Co-authored-by: Gabriel Luiz Freitas Almeida <[email protected]> (cherry picked from commit 55693c9)
The Langchain implementations of the embeddings classes obviously offer advanced parallelization / retries / etc. Our naive since request call was significantly slower.
This approach is still slower, but by much less of a factor. In the very few tests I had time to run, I was seeing maybe a 50% increase in time required to embed the
cosmos
documents into AstraDB using OpenAI vs. AI/ML embedding components with this approach. (Previously, we couldn't even wait long enough for the AI/ML embedding component to embed successfully).The AI/ML Team is going to figure out why the Langchain OpenAI Implementation with base_url is not working.
Testing: