Added embeddings endpoint to server #374

Montagon · 2023-09-01T09:40:21Z

This PR include the endpoint for the embeddings in the same way that OpenAI supports it.

server/src/main/kotlin/com/xebia/functional/xef/server/http/routes/Routes.kt

javipacheco

In order to finish this endpoint we should be sure that the embeddings are working fine. For that, we are using the SpaceCraftLocal example

We should change this example using the local embedding. Something like that

  val localOpenAi = OpenAI(host = "http://localhost:8081/")

  val model = localOpenAi.DEFAULT_SERIALIZATION

  val scope =
    Conversation(LocalVectorStore(OpenAIEmbeddings(localOpenAi.DEFAULT_EMBEDDING)))

....

server/src/main/kotlin/com/xebia/functional/xef/server/http/routes/Routes.kt

Intex32 · 2023-09-01T13:33:00Z

@Montagon have you tested streaming? when I tested that earlier, it hang forever. probably something was just wrong with my setup.

Montagon · 2023-09-01T14:41:19Z

@Montagon have you tested streaming? when I tested that earlier, it hang forever. probably something was just wrong with my setup.

In which example did you try? I tried the SpaceCraftLocal with the new endpoint and it fails, not sure if related. But I'm checking it. Thanks!

Intex32 · 2023-09-01T14:53:14Z

@Montagon have you tested streaming? when I tested that earlier, it hang forever. probably something was just wrong with my setup.

In which example did you try? I tried the SpaceCraftLocal with the new endpoint and it fails, not sure if related. But I'm checking it. Thanks!

I used Postman, the response hanging and never finished.

Intex32 · 2023-09-04T10:42:52Z

@Montagon have you tested streaming? when I tested that earlier, it hang forever. probably something was just wrong with my setup.

As I found out, whether the request hangs or not depends on the length of the response from OpenAI:
Prompting to "Write an essay about climate change" does always hang, as an essay is a long text.
Short prompts do work as expected.
Now, the threshold appears to be around 80,000 chars (or bytes not sure) - mas o menos.

I haven't figured out why this is.

This reverts commit 1ffe687.

… server-embeddings-endpoint

Intex32

I think we're set. Venga!

My issue with long prompts is going to be targeted in a different PR.

javipacheco

🚀

added embeddings endpoint

a908307

Montagon requested review from javipacheco, Intex32 and raulraja September 1, 2023 09:40

raulraja previously approved these changes Sep 1, 2023

View reviewed changes

server/src/main/kotlin/com/xebia/functional/xef/server/http/routes/Routes.kt Outdated Show resolved Hide resolved

javipacheco requested changes Sep 1, 2023

View reviewed changes

server/src/main/kotlin/com/xebia/functional/xef/server/http/routes/Routes.kt Show resolved Hide resolved

server/src/main/kotlin/com/xebia/functional/xef/server/http/routes/Routes.kt Outdated Show resolved Hide resolved

Montagon added 2 commits September 1, 2023 15:10

added correct response to server endpoints

9a2e393

removed runBlocking

b831ae0

Montagon dismissed raulraja’s stale review via b831ae0 September 1, 2023 13:10

Merge branch 'main' into server-embeddings-endpoint

6de4ebb

Montagon added 2 commits September 4, 2023 16:29

Merge branch 'main' into server-embeddings-endpoint

9d3ad79

some refactor

1ffe687

Montagon requested review from javipacheco and raulraja September 4, 2023 14:44

Montagon and others added 5 commits September 4, 2023 17:10

Revert "some refactor"

6fe0191

This reverts commit 1ffe687.

adding headers from llm response to ours

7aaab7f

Merge branch 'main' into server-embeddings-endpoint

8339361

add unsafe check when copying headers

786c220

Merge remote-tracking branch 'origin/server-embeddings-endpoint' into…

002b4c5

… server-embeddings-endpoint

Intex32 previously approved these changes Sep 5, 2023

View reviewed changes

Montagon added 3 commits September 5, 2023 15:08

update sample

383b218

Merge branch 'main' into server-embeddings-endpoint

378db71

updated movie example

64f8567

Montagon dismissed Intex32’s stale review via 64f8567 September 5, 2023 13:15

Montagon added 2 commits September 5, 2023 15:38

spotless

1e2db1a

removed spacecraft local and added comments

d28ecea

configure request timeout for CIO engine of ktor client (#389)

ad76d06

raulraja approved these changes Sep 5, 2023

View reviewed changes

Merge branch 'main' into server-embeddings-endpoint

b3bce97

javipacheco approved these changes Sep 6, 2023

View reviewed changes

Montagon merged commit 4a8d3d0 into main Sep 6, 2023

Montagon deleted the server-embeddings-endpoint branch September 6, 2023 06:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added embeddings endpoint to server #374

Added embeddings endpoint to server #374

Montagon commented Sep 1, 2023

javipacheco left a comment

Intex32 commented Sep 1, 2023

Montagon commented Sep 1, 2023

Intex32 commented Sep 1, 2023

Intex32 commented Sep 4, 2023

Intex32 left a comment

javipacheco left a comment

Added embeddings endpoint to server #374

Added embeddings endpoint to server #374

Conversation

Montagon commented Sep 1, 2023

javipacheco left a comment

Choose a reason for hiding this comment

Intex32 commented Sep 1, 2023

Montagon commented Sep 1, 2023

Intex32 commented Sep 1, 2023

Intex32 commented Sep 4, 2023

Intex32 left a comment

Choose a reason for hiding this comment

javipacheco left a comment

Choose a reason for hiding this comment