Compatibility with AzureOpenAI #67

NP4567-dev · 2024-07-31T11:31:23Z

Adressing: #60

Context:
Though the wrapped class is the same for Azure OpenAI, there is a difference in the model names: azure does not use dots in the name (chatgpt-3.5 in openai API becomes chatgpt-35 in azure).

Impacts:
Rework of the find_model method in two steps

the method checks if the name with deleted dots matches a name in the provider's models' names or if the provided name matches one of the provider's name
if there was no match in the first step the matching is done again to check if any of the provider's model name contains the provided name
This prevents gpt-4-turbo matching gtp-4 for example.

The choice was made not to add azure as a provider and to consider a model running on either azure or openai infrastructure has the same impacts, this could be discussed.

Side effects:
As of right now, there is no side effects (no false positive matching) but further down the line too many provider specific matching rules could become a hassle to maintain.

added docs: example using AzureOpenAI client
added unit test checking that the names used in azure now match model name in the csv file

feat: added azure model name matching to find_model

samuelrince · 2024-08-02T10:11:49Z

Thanks for this work @NP4567-dev, I'll be reviewing it next week! 🙏

samuelrince

Before merging, can you add some tests for AzureOpenAI clients?

You can copy and adapt the tests from test_openai.py and add them to the same file. The objective is to make sure that we stay compatible with Azure's client as well.

To create the test, you will need an Azure account, which I assume you have. Add your API key and endpoint as environment variables AZURE_OPENAI_API_KEY and AZURE_OPENAI_ENDPOINT. Also add fake env var in conftest.py. Run the tests with make test-record as stated in the newly added contribution docs.

Don't hesitate to comment if you have issues adding these tests! 🤗

samuelrince · 2024-08-05T12:31:24Z

ecologits/model_repository.py

-        for model in self.__models:
-            # To handle specific LiteLLM calling (e.g., mistral/mistral-small)
-            if model.provider == provider and model.name in model_name:
-                return model
-        return None
+        provider_models = [model for model in self.__models if model.provider == provider]
+        return next(
+            (
+                model
+                for model in provider_models
+                if (model.name == model_name or model.name.replace(".", "") == model_name)
+            ),
+            next((model for model in provider_models if model.name in model_name), None),
+        )


Instead of changing this function, I would rather add the models to the models.csv file. For instance, gpt-35-turbo will be a duplicated line of gpt-3.5-turbo. We plan to refactor the model repository to introduce aliases in the future.

Ok perfect, adding some kind of aliases mechanic was something I tought of but felt was a bit overkill for just azure.

This function still needs to be changed at some point though. Currently it seems like gpt-4-turbo and gpt-o match with gpt-4. (I can move this to another PR/open an issue if needed as this is a separated problem). Or aliases can fix this too.

Agree, I think this bug was introduced when adding litellm, will get it fixed in the next release for sure! 🙏

Adding the tests is proving to be a bit more complicated than I anticipated.

Python-recording or the underlying vcrpy seems to always be trying to overwrite some cassettes.
Running "make test" raises vcr.errors.CannotOverwriteExistingCassetteExceptionon on google, huggingface and litellm tests. Even after cloning and reinstalling from scratch this still happens.

I did manage to generate a cassette for azure openai, but whenever I try to run the tests using it, the same error is raised.

Any idea on how to solve this?

Ok thanks for letting me know. That is indeed a strange behavior... Can you push your code as it is right now? So I can try on my side, see if I have the same issue.

@NP4567-dev I think I have found your work on your fork (branch feature/azure-tests).

The issue you have is probably because the name of the tests you've added for Azure OpenAI are equal to the tests for OpenAI.

Meaning that if you change the function names from test_openai_chat to test_azure_openai_chat in the test_azure_openai.py file, it should work properly. :)

Yeah, this was another problem I had already fixed locally.
I did find the issue though, we have a proxy that prevented url matching on the cassettes, hence the recording attempt.
It is fixed now, I just have a few final verification to make and it should be ready to merge

* revert: roll back find_model and deleted related tests, updated data csv * feat: review changes * feat: added tests and slight fix * fix: removed extra csv rows * fix: rolled back env specific changes * fix: rolled back env specific changes

NP4567-dev added 7 commits July 30, 2024 14:54

feat: added azure model name matching to find_model

6953c34

chore: bump version and correct docs

711cfa4

Merge pull request #1 from NP4567-dev/feature/azure_openai

3fd5001

feat: added azure model name matching to find_model

fix: fix gpt-4-turbo matching gpt-4

4113ee4

fix: fix gpt-4-turbo matching gpt-4

6eaf0cd

chore: bump version

6ed29ec

fix: rollback version, no new PR

a6257bf

samuelrince added 5 commits August 5, 2024 12:26

revert: indentation for streaming sync example for openai

1880428

revert: poetry update and version bump

d5e63ce

docs: update azure openai compatibility and use official code snippet

1550ce6

docs: list Azure OpenAI as a provider

b5ce3d6

revert: remove unused import

54e6b2b

samuelrince requested changes Aug 5, 2024

View reviewed changes

samuelrince and others added 16 commits August 5, 2024 14:46

chore: typo and formatting

0b58895

Merge branch 'genai-impact:main' into main

341295b

Update models.csv

a6665e1

docs: typo in contributing.md

30b9409

chore: update issue templates

5938146

docs: move providers page

c6dcb5b

docs: improve installer

290e5ed

docs: remove roadmap

9b4dd29

docs: change accent color of checkboxes

f60673d

docs: remove page warning

0093363

docs: add methodological background section

d5f389d

docs: prepare faq page

3b6eac0

docs: add why page

bdd1fdd

docs: add answers to faq page

3222a6b

docs: update link to provider guides

46881a9

chore: bump version to 0.3.2

3e98579

ycouble and others added 2 commits August 22, 2024 09:28

removed trailing newline

c586115

Feature/azure tests (#3)

58e790f

* revert: roll back find_model and deleted related tests, updated data csv * feat: review changes * feat: added tests and slight fix * fix: removed extra csv rows * fix: rolled back env specific changes * fix: rolled back env specific changes

NP4567-dev requested a review from samuelrince August 23, 2024 10:06

NP4567-dev and others added 18 commits August 23, 2024 17:26

feat: added azure model name matching to find_model

f34f262

chore: bump version and correct docs

0149d4d

fix: fix gpt-4-turbo matching gpt-4

e365cca

fix: fix gpt-4-turbo matching gpt-4

eb0992b

chore: bump version

abfbe70

fix: rollback version, no new PR

b1af8e2

revert: indentation for streaming sync example for openai

f4233ae

revert: poetry update and version bump

655a21b

docs: update azure openai compatibility and use official code snippet

028fa1e

docs: list Azure OpenAI as a provider

acb39f4

revert: remove unused import

a1461df

chore: typo and formatting

169345f

Feature/azure tests (#3)

15a40e0

* revert: roll back find_model and deleted related tests, updated data csv * feat: review changes * feat: added tests and slight fix * fix: removed extra csv rows * fix: rolled back env specific changes * fix: rolled back env specific changes

Merge remote-tracking branch 'NP4567-dev/main'

3047d7b

fix: restore openai test

1376714

chore: formatting

858137d

chore: use main __init__.py

ea583f9

fix: wrong version number

389f7e2

samuelrince approved these changes Aug 23, 2024

View reviewed changes

samuelrince merged commit 8000592 into genai-impact:main Aug 23, 2024
2 checks passed

samuelrince mentioned this pull request Aug 23, 2024

AzureOpenAI #60

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compatibility with AzureOpenAI #67

Compatibility with AzureOpenAI #67

NP4567-dev commented Jul 31, 2024 •

edited

Loading

samuelrince commented Aug 2, 2024

samuelrince left a comment

samuelrince Aug 5, 2024

NP4567-dev Aug 5, 2024 •

edited

Loading

samuelrince Aug 6, 2024

NP4567-dev Aug 19, 2024

samuelrince Aug 20, 2024

samuelrince Aug 23, 2024

NP4567-dev Aug 23, 2024

Compatibility with AzureOpenAI #67

Compatibility with AzureOpenAI #67

Conversation

NP4567-dev commented Jul 31, 2024 • edited Loading

samuelrince commented Aug 2, 2024

samuelrince left a comment

Choose a reason for hiding this comment

samuelrince Aug 5, 2024

Choose a reason for hiding this comment

NP4567-dev Aug 5, 2024 • edited Loading

Choose a reason for hiding this comment

samuelrince Aug 6, 2024

Choose a reason for hiding this comment

NP4567-dev Aug 19, 2024

Choose a reason for hiding this comment

samuelrince Aug 20, 2024

Choose a reason for hiding this comment

samuelrince Aug 23, 2024

Choose a reason for hiding this comment

NP4567-dev Aug 23, 2024

Choose a reason for hiding this comment

NP4567-dev commented Jul 31, 2024 •

edited

Loading

NP4567-dev Aug 5, 2024 •

edited

Loading