Python: Verify local models in Ollama and LM Studio are compatible with the OpenAI connector #6973

TaoChenOSU · 2024-06-26T20:59:34Z

Motivation and Context

Related to #6498

The use of local models presents a twofold benefit for developers: increased flexibility and reduced costs. Ollama and LM Studio are two well-known platforms that facilitate the hosting of models locally, both of which offer compatibility with OpenAI endpoints. As such, it is imperative that our OpenAI connector functions correctly when users are operating models on these platforms.

Description

Verify that our OpenAI connector works as expected with models hosted locally using Ollama and LM Studio.
Create three new samples (Ollama/chat, LM Studio/chat, LM Studio/Embedding) under /concepts/local_models to show how to using local models with the OpenAI connector.
Fix a bug in test_sample_utils.py where if a test case is retried and input was never reset.

Contribution Checklist

The code builds clean without any errors or warnings
The PR follows the SK Contribution Guidelines and the pre-submission formatting script raises no violations
All unit tests pass, and I have added new tests where possible
I didn't break anyone 😄

markwallace-microsoft · 2024-06-26T21:02:23Z

Python 3.10 Test Coverage Report •

File	Stmts	Miss	Cover	Missing
TOTAL	6602	827	87%

report-only-changed-files is enabled. No files were changed during this commit :)

Python 3.10 Unit Test Overview

Tests	Skipped	Failures	Errors	Time
1558	1 💤	0 ❌	0 🔥	24.544s ⏱️

eavanvalkenburg

small note, let's make it so that you can just use the Completion service directly instead of having to create your own client!

python/samples/concepts/local_models/lm_studio_chat_completion.py

python/samples/concepts/local_models/ollama_chat_completion.py

AndreasKunar · 2024-06-27T18:25:27Z

python/samples/concepts/local_models/ollama_chat_completion.py

When testing this sample locally with semantic-kernel==1.1.2, I needed to add an api_key (fake) to line 39 OpenAIChatCompletion as well. Otherwise I get a parameter validation error. Maybe I'm wrong, but please verify.

That shouldn't happen as the api_key is not a model field. Could you provide more information, such as the validation error message?

Running your ollama_chat_completion.py with semantic-kernel 1.1.2, openai 1.35.7 on macOS 14.5 produces the following Traceback (most recent call last): Note, I had to change phi3 to phi3:mini for my computer.

File "/opt/homebrew/Caskroom/miniconda/base/envs/sk/lib/python3.12/site-packages/semantic_kernel/connectors/ai/open_ai/services/open_ai_chat_completion.py", line 51, in init
openai_settings = OpenAISettings.create(
^^^^^^^^^^^^^^^^^^^^^^
File "/opt/homebrew/Caskroom/miniconda/base/envs/sk/lib/python3.12/site-packages/semantic_kernel/kernel_pydantic.py", line 56, in create
return cls(**data)
^^^^^^^^^^^
File "/opt/homebrew/Caskroom/miniconda/base/envs/sk/lib/python3.12/site-packages/pydantic_settings/main.py", line 140, in init
super().init(
File "/opt/homebrew/Caskroom/miniconda/base/envs/sk/lib/python3.12/site-packages/pydantic/main.py", line 176, in init
self.pydantic_validator.validate_python(data, self_instance=self)
pydantic_core._pydantic_core.ValidationError: 1 validation error for OpenAISettings
api_key
Field required [type=missing, input_value={'chat_model_id': 'phi3:mini'}, input_type=dict]
For further information visit https://errors.pydantic.dev/2.7/v/missing

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
File "/Users/andi/Projects/sk/ollama_chat_completion.py", line 39, in
kernel.add_service(OpenAIChatCompletion(service_id=service_id, ai_model_id="phi3:mini", async_client=openAIClient))
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/homebrew/Caskroom/miniconda/base/envs/sk/lib/python3.12/site-packages/semantic_kernel/connectors/ai/open_ai/services/open_ai_chat_completion.py", line 59, in init
raise ServiceInitializationError("Failed to create OpenAI settings.", ex) from ex
semantic_kernel.exceptions.service_exceptions.ServiceInitializationError: ('Failed to create OpenAI settings.', 1 validation error for OpenAISettings
api_key
Field required [type=missing, input_value={'chat_model_id': 'phi3:mini'}, input_type=dict]
For further information visit https://errors.pydantic.dev/2.7/v/missing)

If I change Line 39 by adding the fake API key as 2nd argument (to kernel.add_service(OpenAIChatCompletion(service_id=service_id, api_key="fake-key", ...) the error goes away and the code works.

Hope this helps, Andi

Hi @TaoChenOSU, yes, this is true as @AndreasKunar pointed out. In the OpenAISettings the model has the api_key as a required attribute. See here.

For Ollama local models that don't require an API key, and it is not provided, this will fail. Which is why Andi needed to provide a fake key to get this model validation to pass.

I see. Then I guess the api key shouldn't be a required parameter but if one is not provided and no async_client is provided, it will be caught here.

I was able to run the sample without error because I had the env file.

Should we make the change to the OpenAI settings?

AndreasKunar

Still fails for me because of the missing API-key parameter in line 39 (see comment for details on the validation error).

AndreasKunar · 2024-07-02T07:56:28Z

python/samples/concepts/local_models/ollama_chat_completion.py

Running your ollama_chat_completion.py with semantic-kernel 1.1.2, openai 1.35.7 on macOS 14.5 produces the following Traceback (most recent call last): Note, I had to change phi3 to phi3:mini for my computer.

File "/opt/homebrew/Caskroom/miniconda/base/envs/sk/lib/python3.12/site-packages/semantic_kernel/connectors/ai/open_ai/services/open_ai_chat_completion.py", line 51, in init
openai_settings = OpenAISettings.create(
^^^^^^^^^^^^^^^^^^^^^^
File "/opt/homebrew/Caskroom/miniconda/base/envs/sk/lib/python3.12/site-packages/semantic_kernel/kernel_pydantic.py", line 56, in create
return cls(**data)
^^^^^^^^^^^
File "/opt/homebrew/Caskroom/miniconda/base/envs/sk/lib/python3.12/site-packages/pydantic_settings/main.py", line 140, in init
super().init(
File "/opt/homebrew/Caskroom/miniconda/base/envs/sk/lib/python3.12/site-packages/pydantic/main.py", line 176, in init
self.pydantic_validator.validate_python(data, self_instance=self)
pydantic_core._pydantic_core.ValidationError: 1 validation error for OpenAISettings
api_key
Field required [type=missing, input_value={'chat_model_id': 'phi3:mini'}, input_type=dict]
For further information visit https://errors.pydantic.dev/2.7/v/missing

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
File "/Users/andi/Projects/sk/ollama_chat_completion.py", line 39, in
kernel.add_service(OpenAIChatCompletion(service_id=service_id, ai_model_id="phi3:mini", async_client=openAIClient))
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/homebrew/Caskroom/miniconda/base/envs/sk/lib/python3.12/site-packages/semantic_kernel/connectors/ai/open_ai/services/open_ai_chat_completion.py", line 59, in init
raise ServiceInitializationError("Failed to create OpenAI settings.", ex) from ex
semantic_kernel.exceptions.service_exceptions.ServiceInitializationError: ('Failed to create OpenAI settings.', 1 validation error for OpenAISettings
api_key
Field required [type=missing, input_value={'chat_model_id': 'phi3:mini'}, input_type=dict]
For further information visit https://errors.pydantic.dev/2.7/v/missing)

If I change Line 39 by adding the fake API key as 2nd argument (to kernel.add_service(OpenAIChatCompletion(service_id=service_id, api_key="fake-key", ...) the error goes away and the code works.

Hope this helps, Andi

TaoChenOSU added 5 commits June 25, 2024 16:13

Add samples to use local models

ebd79a4

Setup Ollama for sample tests

13bfe5c

Mark skip

918f4d8

Misc

d32628a

Misc 2

d403cc5

TaoChenOSU self-assigned this Jun 26, 2024

TaoChenOSU requested a review from a team as a code owner June 26, 2024 20:59

markwallace-microsoft added the python Pull requests for the Python Semantic Kernel label Jun 26, 2024

github-actions bot changed the title ~~Verify local models in Ollama and LM Studio are compatible with the OpenAI connector~~ Python: Verify local models in Ollama and LM Studio are compatible with the OpenAI connector Jun 26, 2024

TaoChenOSU linked an issue Jun 26, 2024 that may be closed by this pull request

Python: Verify that Ollama and LLM Studio work with OpenAI Connector #6498

Open

Merge branch 'main' into taochen/local-models-with-openai-connector-2

b524999

alliscode approved these changes Jun 26, 2024

View reviewed changes

Merge branch 'main' into taochen/local-models-with-openai-connector-2

793418e

eavanvalkenburg reviewed Jun 27, 2024

View reviewed changes

python/samples/concepts/local_models/lm_studio_chat_completion.py Show resolved Hide resolved

TaoChenOSU added 2 commits June 27, 2024 10:21

Update readme

85e51f7

Merge branch 'main' into taochen/local-models-with-openai-connector-2

dff067c

markwallace-microsoft added the documentation label Jun 27, 2024

TaoChenOSU requested review from alliscode and eavanvalkenburg June 27, 2024 17:22

moonbox3 reviewed Jun 27, 2024

View reviewed changes

python/samples/concepts/local_models/ollama_chat_completion.py Show resolved Hide resolved

AndreasKunar reviewed Jun 27, 2024

View reviewed changes

alliscode approved these changes Jun 28, 2024

View reviewed changes

TaoChenOSU linked an issue Jul 1, 2024 that may be closed by this pull request

Python: semantic_kernel.connectors.ai.open_ai.OpenAIChatCompletion is missing endpoint property #6202

Open

Merge branch 'main' into taochen/local-models-with-openai-connector-2

8793d3d

TaoChenOSU requested review from AndreasKunar and moonbox3 July 1, 2024 23:19

AndreasKunar suggested changes Jul 2, 2024

View reviewed changes

Merge branch 'main' into taochen/local-models-with-openai-connector-2

a213a83

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Python: Verify local models in Ollama and LM Studio are compatible with the OpenAI connector #6973

Python: Verify local models in Ollama and LM Studio are compatible with the OpenAI connector #6973

TaoChenOSU commented Jun 26, 2024

markwallace-microsoft commented Jun 26, 2024 •

edited

Loading

eavanvalkenburg left a comment

AndreasKunar Jun 27, 2024

TaoChenOSU Jul 1, 2024

AndreasKunar Jul 2, 2024

moonbox3 Jul 2, 2024

TaoChenOSU Jul 2, 2024

TaoChenOSU Jul 2, 2024

moonbox3 Jul 2, 2024

AndreasKunar left a comment

AndreasKunar Jul 2, 2024

Python: Verify local models in Ollama and LM Studio are compatible with the OpenAI connector #6973

Are you sure you want to change the base?

Python: Verify local models in Ollama and LM Studio are compatible with the OpenAI connector #6973

Conversation

TaoChenOSU commented Jun 26, 2024

Motivation and Context

Description

Contribution Checklist

markwallace-microsoft commented Jun 26, 2024 • edited Loading

Python 3.10 Unit Test Overview

eavanvalkenburg left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

AndreasKunar left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

markwallace-microsoft commented Jun 26, 2024 •

edited

Loading