Skip to content

[8.15] [Inference API] Add Amazon Bedrock support to Inference API (#110248)#110544

Closed
markjhoy wants to merge 1 commit intoelastic:8.15from
markjhoy:backport/8.15/pr-110248
Closed

[8.15] [Inference API] Add Amazon Bedrock support to Inference API (#110248)#110544
markjhoy wants to merge 1 commit intoelastic:8.15from
markjhoy:backport/8.15/pr-110248

Conversation

@markjhoy
Copy link
Copy Markdown
Contributor

@markjhoy markjhoy commented Jul 5, 2024

Backport

This will backport the following commits from main to 8.15:

Questions ?

Please refer to the Backport tool documentation

* Initial commit; setup Gradle; start service

* initial commit

* minor cleanups, builds green; needs tests

* bug fixes; tested working embeddings & completion

* use custom json builder for embeddings request

* Ensure auto-close; fix forbidden API

* start of adding unit tests; abstraction layers

* adding additional tests; cleanups

* add requests unit tests

* all tests created

* fix cohere embeddings response

* fix cohere embeddings response

* fix lint

* better test coverage for secrets; inference client

* update thread-safe syncs; make dims/tokens + int

* add tests for dims and max tokens positive integer

* use requireNonNull;override settings type;cleanups

* use r/w lock for client cache

* remove client reference counting

* update locking in cache; client errors; noop doc

* remove extra block in internalGetOrCreateClient

* remove duplicate dependencies; cleanup

* add fxn to get default embeddings similarity

* use async calls to Amazon Bedrock; cleanups

* use Clock in cache; simplify locking; cleanups

* cleanups around executor; remove some instanceof

* cleanups; use EmbeddingRequestChunker

* move max chunk size to constants

* oof - swapped transport vers w/ master node req

* use XContent instead of Jackson JsonFactory

* remove gradle versions; do not allow dimensions

(cherry picked from commit 52e591d)
@markjhoy markjhoy requested a review from a team as a code owner July 5, 2024 17:38
@markjhoy markjhoy requested review from a team and jonathan-buttner July 5, 2024 17:39
@markjhoy markjhoy added >non-issue :ml Machine learning Team:ML Meta label for the ML team :EnterpriseSearch/Application Enterprise Search Team:Enterprise Search Meta label for Enterprise Search team v8.15.1 labels Jul 5, 2024
@markjhoy
Copy link
Copy Markdown
Contributor Author

markjhoy commented Jul 5, 2024

closing this to let HOMER do the auto-backport

@markjhoy markjhoy closed this Jul 5, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

backport :EnterpriseSearch/Application Enterprise Search :ml Machine learning >non-issue Team:Enterprise Search Meta label for Enterprise Search team Team:ML Meta label for the ML team v8.15.1

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants