ref: adds multi threading to the AI/ML embeddings component #2959

jordanrfrazier · 2024-07-25T17:55:05Z

The Langchain implementations of the embeddings classes obviously offer advanced parallelization / retries / etc. Our naive since request call was significantly slower.

This approach is still slower, but by much less of a factor. In the very few tests I had time to run, I was seeing maybe a 50% increase in time required to embed the cosmos documents into AstraDB using OpenAI vs. AI/ML embedding components with this approach. (Previously, we couldn't even wait long enough for the AI/ML embedding component to embed successfully).

The AI/ML Team is going to figure out why the Langchain OpenAI Implementation with base_url is not working.

Testing:

verified that the embeddings stored match between AI/ML and OpenAI (so no out of ordering was done via the multi-threading).

github-actions · 2024-07-25T17:55:57Z

Pull Request Validation Report

This comment is automatically generated by Conventional PR

Whitelist Report

Whitelist	Active	Result
Pull request is a draft and should be ignored	✅	❌
Pull request is made by a whitelisted user and should be ignored	❌	❌
Pull request is submitted by a bot and should be ignored	✅	❌
Pull request is submitted by administrators and should be ignored	❌	❌

Result

Pull request does not satisfy any enabled whitelist criteria. Pull request will be validated.

Validation Report

Validation	Active	Result
All commits in this pull request has valid messages	❌	✅
Pull request does not introduce too many changes	❌	✅
Pull request has a valid title	✅	✅
Pull request has mentioned issues	❌	✅
Pull request has valid branch name	❌	✅
Pull request should have a non-empty body	✅	✅

Result

Pull request satisfies all enabled pull request rules.

_{Last Modified at 25 Jul 24 17:55 UTC}

aws-amplify-sa-east-1 · 2024-07-25T17:58:33Z

This pull request is automatically being deployed by Amplify Hosting (learn more).

Access this pull request here: https://pr-2959.dmtpw4p5recq1.amplifyapp.com

ogabrielluiz

LGTM

…nting for better code clarit

…-ai#2959) * Use http client for requests and split texts naively * update models list * prints * multithread requests to aiml embeddings * remove comment * [autofix.ci] apply automated fixes * style(AIMLEmbeddingsImpl.py): improve code formatting and add type hinting for better code clarit --------- Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com> Co-authored-by: Gabriel Luiz Freitas Almeida <[email protected]> (cherry picked from commit 55693c9)

jordanrfrazier added 5 commits July 25, 2024 09:58

Use http client for requests and split texts naively

62adc1a

update models list

5b8d381

prints

f7c065f

multithread requests to aiml embeddings

e946f61

remove comment

d6b31d5

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. enhancement New feature or request python Pull requests that update Python code labels Jul 25, 2024

[autofix.ci] apply automated fixes

a0eadb9

ogabrielluiz approved these changes Jul 25, 2024

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Jul 25, 2024

style(AIMLEmbeddingsImpl.py): improve code formatting and add type hi…

9dc18f5

…nting for better code clarit

ogabrielluiz merged commit 55693c9 into main Jul 25, 2024
50 checks passed

ogabrielluiz deleted the aiml-perf branch July 25, 2024 21:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ref: adds multi threading to the AI/ML embeddings component #2959

ref: adds multi threading to the AI/ML embeddings component #2959

jordanrfrazier commented Jul 25, 2024

github-actions bot commented Jul 25, 2024

aws-amplify-sa-east-1 bot commented Jul 25, 2024

ogabrielluiz left a comment

ref: adds multi threading to the AI/ML embeddings component #2959

ref: adds multi threading to the AI/ML embeddings component #2959

Conversation

jordanrfrazier commented Jul 25, 2024

github-actions bot commented Jul 25, 2024

Pull Request Validation Report

Whitelist Report

Validation Report

aws-amplify-sa-east-1 bot commented Jul 25, 2024

ogabrielluiz left a comment

Choose a reason for hiding this comment