feat: implement dynamic model detection support for inference providers using litellm by mattf · Pull Request #2886 · llamastack/llama-stack

mattf · 2025-07-24T13:51:49Z

What does this PR do?

This enhancement allows inference providers using LiteLLMOpenAIMixin to validate model availability against LiteLLM's official provider model listings, improving reliability and user experience when working with different AI service providers.

Add litellm_provider_name parameter to LiteLLMOpenAIMixin constructor
Add check_model_availability method to LiteLLMOpenAIMixin using litellm.models_by_provider
Update Gemini, Groq, and SambaNova inference adapters to pass litellm_provider_name

Test Plan

standard CI.

ashwinb · 2025-07-24T19:35:29Z

llama_stack/providers/utils/inference/litellm_openai_mixin.py

        api_key_from_config: str | None,
        provider_data_api_key_field: str,
        openai_compat_api_base: str | None = None,
+        litellm_provider_name: str | None = None,


can we make this required?

we can. if i make it required right now it'll step on other related changes in flight by new developers.

ok to file a follow up issue to make it required?

I think that is fine because in my opinion check_model_availability cannot be half-assed and say "I can't do my job because you did not provide me with very important information earlier". I also do need this in my own work. Let's make this required.

@ashwinb ok, i've made it required.

@r3v5 this will superseded your PR.

Thanks, @mattf

r3v5 · 2025-07-25T16:18:31Z

This new infrastructure with litellm for check_model_availability() is fine. If the PR is merged, I can do rebase and very quickly implement that for Anthropic here #2879 and afterwards for all models from list here #2504

mattf · 2025-07-26T10:49:06Z

this is weird.

…rs using litellm This enhancement allows inference providers using LiteLLMOpenAIMixin to validate model availability against LiteLLM's official provider model listings, improving reliability and user experience when working with different AI service providers. - Add litellm_provider_name parameter to LiteLLMOpenAIMixin constructor - Add check_model_availability method to LiteLLMOpenAIMixin using litellm.models_by_provider - Update Anthropic, OpenAI, Llama, Gemini, Groq, and SambaNova inference adapters to pass litellm_provider_name

ashwinb

thanks!

- Specify the default model and provider in the LSC config This is required because following llamastack/llama-stack#2886 llama-stack discover the models dynamically, if we don't pass a provider and model it falls back to the firt model it finds (regadless of what we have in the llama-stack config.yaml) Note that it's also listing vertexAI models (which work with the vertexAI API)! Signed-off-by: Eran Cohen <eranco@redhat.com>

Specify the default model and provider in the LSC config This is required because following llamastack/llama-stack#2886 llama-stack discovers the models dynamically, if we don't pass a provider and model, it falls back to the first model it finds (regardless of what we have in the llama-stack config.yaml) Note that it's also listing vertexAI models (which work with the vertexAI API)! Signed-off-by: Eran Cohen <eranco@redhat.com>

mattf requested review from ashwinb, bbrowning, ehhuang, hardikjshah, leseb, raghotham, reluctantfuturist, terrytangyuan and yanxi0830 as code owners July 24, 2025 13:51

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jul 24, 2025

This was referenced Jul 24, 2025

Dynamically Update SUPPORTED_MODELS list for remote providers #2504

Closed

feat: create dynamic model registration for Groq remote inference provider #2872

Closed

feat: create dynamic model registration for Anthropic remote inferenc… #2879

Closed

ashwinb requested changes Jul 24, 2025

View reviewed changes

mattf force-pushed the add-check-model-avail-for-litellm branch 2 times, most recently from b83c24a to 3272221 Compare July 26, 2025 10:54

mattf requested a review from ashwinb July 26, 2025 11:01

mattf force-pushed the add-check-model-avail-for-litellm branch from 3272221 to 923c08c Compare July 28, 2025 14:05

ashwinb approved these changes Jul 28, 2025

View reviewed changes

ashwinb merged commit 47c078f into llamastack:main Jul 28, 2025
77 checks passed

eranco74 mentioned this pull request Jul 30, 2025

MGMT-21336 Query fail when a model isn't provided in the request rh-ecosystem-edge/assisted-chat#75

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: implement dynamic model detection support for inference providers using litellm#2886

feat: implement dynamic model detection support for inference providers using litellm#2886
ashwinb merged 1 commit intollamastack:mainfrom
mattf:add-check-model-avail-for-litellm

mattf commented Jul 24, 2025

Uh oh!

ashwinb Jul 24, 2025

Uh oh!

mattf Jul 24, 2025

Uh oh!

ashwinb Jul 25, 2025 •

edited

Loading

Uh oh!

mattf Jul 26, 2025

Uh oh!

r3v5 Jul 26, 2025

Uh oh!

r3v5 commented Jul 25, 2025 •

edited

Loading

Uh oh!

mattf commented Jul 26, 2025

Uh oh!

ashwinb left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

mattf commented Jul 24, 2025

What does this PR do?

Test Plan

Uh oh!

ashwinb Jul 24, 2025

Choose a reason for hiding this comment

Uh oh!

mattf Jul 24, 2025

Choose a reason for hiding this comment

Uh oh!

ashwinb Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mattf Jul 26, 2025

Choose a reason for hiding this comment

Uh oh!

r3v5 Jul 26, 2025

Choose a reason for hiding this comment

Uh oh!

r3v5 commented Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mattf commented Jul 26, 2025

Uh oh!

ashwinb left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ashwinb Jul 25, 2025 •

edited

Loading

r3v5 commented Jul 25, 2025 •

edited

Loading