Add Anyscale Embedding model support #198

kylehh · 2023-12-05T02:40:37Z

Problem

No support for Open Source embedding models

Solution

Added Anyscale Embedding model support for

Type of Change

Bug fix (non-breaking change which fixes an issue)
[ x] New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update
Infrastructure change (CI configs, etc)
Non-code change (docs, etc)
None of the above: (explain here)

Test Plan

Describe specific steps for validating this change.
unit test tests/system/record_encoder/test_anyscale_record_encoder.py
canopy test config/anyscale.yaml

igiloh-pinecone

@kylehh LGTM!

Please see one clarification question

igiloh-pinecone · 2023-12-10T10:01:45Z

config/anyscale.yaml


 chat_engine:
  llm: &llm
    type: AnyscaleLLM                     # Options: [OpenAILLM, AnyscaleLLM]
    params:
-      model_name: meta-llama/Llama-2-7b-chat-hf         # The name of the model to use.
+      model_name: HuggingFaceH4/zephyr-7b-beta         # The name of the model to use.


@kylehh why do you want to change the llama2 version?

We have a hacky workaround in CLI to select the batch_size of documents by the expected batch_size of underlying chunks to encode. However this mechanism used a hardcoded number, instead of using the actual Encoder's batch_size

igiloh-pinecone

LGTM

grpinto · 2024-01-21T15:04:16Z

I don't understand how is this going to change the initial variable definition, how should it be provided, I already provided anyscale api key and still cannot use the command :

canonopy new test

This is the error that I get :

(canopy-env) kingsize@Goncalos-MacBook-Pro-3 EdGenAI % canopy new test
Canopy is going to create a new index named canopy--test with the following initialization parameters:
{}

Do you want to continue? [y/N]: y
Error: Failed to create a new index. Reason:
Canopy has failed to infer vectors' dimensionality using the selected encoder: OpenAIRecordEncoder. You can provide the dimension manually, try using a different encoder, or fix the underlying error:
Failed to enconde documents using OpenAIRecordEncoder. Error: Your OpenAI account seem to have reached the rate limit. Details: You exceeded your current quota, please check your plan and billing details. For more information on this error, read the docs: https://platform.openai.com/docs/guides/error-codes/api-errors.

kylehh added 2 commits December 4, 2023 18:39

Add Anyscale Embedding model support

d710609

fixed flake8 error DOC301, no docstring in __init__()

fdc4b68

igiloh-pinecone approved these changes Dec 10, 2023

View reviewed changes

kylehh and others added 2 commits December 10, 2023 12:22

Added comment for list of supported LLM for Anyscale

e88ad87

Merge remote-tracking branch 'upstream/main' into ae-embedding

958ec38

igiloh-pinecone enabled auto-merge December 10, 2023 20:48

igiloh-pinecone added this pull request to the merge queue Dec 10, 2023

igiloh-pinecone removed this pull request from the merge queue due to a manual request Dec 10, 2023

igiloh-pinecone approved these changes Dec 12, 2023

View reviewed changes

igiloh-pinecone added this pull request to the merge queue Dec 12, 2023

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Dec 12, 2023

igiloh-pinecone added this pull request to the merge queue Dec 13, 2023

Merged via the queue into pinecone-io:main with commit 7844f31 Dec 13, 2023
10 checks passed

kylehh deleted the ae-embedding branch December 13, 2023 15:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Anyscale Embedding model support #198

Add Anyscale Embedding model support #198

kylehh commented Dec 5, 2023

igiloh-pinecone left a comment

igiloh-pinecone Dec 10, 2023

igiloh-pinecone left a comment

grpinto commented Jan 21, 2024

Add Anyscale Embedding model support #198

Add Anyscale Embedding model support #198

Conversation

kylehh commented Dec 5, 2023

Problem

Solution

Type of Change

Test Plan

igiloh-pinecone left a comment

Choose a reason for hiding this comment

igiloh-pinecone Dec 10, 2023

Choose a reason for hiding this comment

igiloh-pinecone left a comment

Choose a reason for hiding this comment

grpinto commented Jan 21, 2024