Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ref: adds multi threading to the AI/ML embeddings component #2959

Merged
merged 7 commits into from
Jul 25, 2024

Conversation

jordanrfrazier
Copy link
Collaborator

The Langchain implementations of the embeddings classes obviously offer advanced parallelization / retries / etc. Our naive since request call was significantly slower.

This approach is still slower, but by much less of a factor. In the very few tests I had time to run, I was seeing maybe a 50% increase in time required to embed the cosmos documents into AstraDB using OpenAI vs. AI/ML embedding components with this approach. (Previously, we couldn't even wait long enough for the AI/ML embedding component to embed successfully).

The AI/ML Team is going to figure out why the Langchain OpenAI Implementation with base_url is not working.

Testing:

  • verified that the embeddings stored match between AI/ML and OpenAI (so no out of ordering was done via the multi-threading).

@dosubot dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. enhancement New feature or request python Pull requests that update Python code labels Jul 25, 2024
Copy link
Contributor

Pull Request Validation Report

This comment is automatically generated by Conventional PR

Whitelist Report

Whitelist Active Result
Pull request is a draft and should be ignored
Pull request is made by a whitelisted user and should be ignored
Pull request is submitted by a bot and should be ignored
Pull request is submitted by administrators and should be ignored

Result

Pull request does not satisfy any enabled whitelist criteria. Pull request will be validated.

Validation Report

Validation Active Result
All commits in this pull request has valid messages
Pull request does not introduce too many changes
Pull request has a valid title
Pull request has mentioned issues
Pull request has valid branch name
Pull request should have a non-empty body

Result

Pull request satisfies all enabled pull request rules.

Last Modified at 25 Jul 24 17:55 UTC

Copy link

This pull request is automatically being deployed by Amplify Hosting (learn more).

Access this pull request here: https://pr-2959.dmtpw4p5recq1.amplifyapp.com

Copy link
Contributor

@ogabrielluiz ogabrielluiz left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@dosubot dosubot bot added the lgtm This PR has been approved by a maintainer label Jul 25, 2024
@ogabrielluiz ogabrielluiz merged commit 55693c9 into main Jul 25, 2024
50 checks passed
@ogabrielluiz ogabrielluiz deleted the aiml-perf branch July 25, 2024 21:50
nicoloboschi pushed a commit to datastax/ragstack-ai-langflow that referenced this pull request Jul 30, 2024
…-ai#2959)

* Use http client for requests and split texts naively

* update models list

* prints

* multithread requests to aiml embeddings

* remove comment

* [autofix.ci] apply automated fixes

* style(AIMLEmbeddingsImpl.py): improve code formatting and add type hinting for better code clarit

---------

Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>
Co-authored-by: Gabriel Luiz Freitas Almeida <[email protected]>
(cherry picked from commit 55693c9)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request lgtm This PR has been approved by a maintainer python Pull requests that update Python code size:L This PR changes 100-499 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants