Skip to content

feat(registry): make the Stack query providers for model listing#2862

Merged
ashwinb merged 7 commits intollamastack:mainfrom
ashwinb:regis_2
Jul 24, 2025
Merged

feat(registry): make the Stack query providers for model listing#2862
ashwinb merged 7 commits intollamastack:mainfrom
ashwinb:regis_2

Conversation

@ashwinb
Copy link
Contributor

@ashwinb ashwinb commented Jul 22, 2025

This flips #2823 and #2805 by making the Stack periodically query the providers for models rather than the providers going behind the back and calling "register" on to the registry themselves. This also adds support for model listing for all other providers via ModelRegistryHelper. Once this is done, we do not need to manually list or register models via run.yaml and it will remove both noise and annoyance (setting INFERENCE_MODEL environment variables, for example) from the new user experience.

In addition, it adds a configuration variable allowed_models which can be used to optionally restrict the set of models exposed from a provider.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jul 22, 2025
@ashwinb
Copy link
Contributor Author

ashwinb commented Jul 22, 2025

cc @mattf particularly, since you may have thoughts given your recent changes -- the addition of query_model_availability() for example.

@ashwinb
Copy link
Contributor Author

ashwinb commented Jul 22, 2025

cc @cdoern the post-training tests are failing, what might I be doing wrong?

Copy link
Contributor

@ehhuang ehhuang left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LG

@ashwinb
Copy link
Contributor Author

ashwinb commented Jul 22, 2025

I will make a PR after this which removes explicit model registration from templates like starter since it won't be necessary anymore. It will also remove the requirement for handling various kinds of disabled annotations from model IDs, etc.

@ashwinb ashwinb merged commit 1463b79 into llamastack:main Jul 24, 2025
77 checks passed
@ashwinb ashwinb deleted the regis_2 branch July 24, 2025 17:39
ChristianZaccaria pushed a commit to ChristianZaccaria/llama-stack that referenced this pull request Jul 28, 2025
…mastack#2862)

This flips llamastack#2823 and llamastack#2805 by making the Stack periodically query the
providers for models rather than the providers going behind the back and
calling "register" on to the registry themselves. This also adds support
for model listing for all other providers via `ModelRegistryHelper`.
Once this is done, we do not need to manually list or register models
via `run.yaml` and it will remove both noise and annoyance (setting
`INFERENCE_MODEL` environment variables, for example) from the new user
experience.

In addition, it adds a configuration variable `allowed_models` which can
be used to optionally restrict the set of models exposed from a
provider.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Meta Open Source bot.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants