lcore-330: complete documents about OpenAI provider

raptorsun · raptorsun · commit bb5aa7c02003 · 2025-08-28T23:06:13.000+02:00
Signed-off-by: Haoyu Sun &lt;hasun@redhat.com&gt;
diff --git a/README.md b/README.md
@@ -15,62 +15,67 @@ The service includes comprehensive user data collection capabilities for various
 
 <!-- vim-markdown-toc GFM -->
 
-* [Architecture](#architecture)
-* [Prerequisites](#prerequisites)
-* [Installation](#installation)
-* [Configuration](#configuration)
-    * [Integration with Llama Stack](#integration-with-llama-stack)
-    * [Llama Stack as separate server](#llama-stack-as-separate-server)
-        * [MCP Server and Tool Configuration](#mcp-server-and-tool-configuration)
-            * [Configuring MCP Servers](#configuring-mcp-servers)
-            * [Configuring MCP Headers](#configuring-mcp-headers)
-        * [Llama Stack project and configuration](#llama-stack-project-and-configuration)
-        * [Check connection to Llama Stack](#check-connection-to-llama-stack)
-    * [Llama Stack as client library](#llama-stack-as-client-library)
-    * [Llama Stack version check](#llama-stack-version-check)
-    * [User data collection](#user-data-collection)
-    * [System prompt](#system-prompt)
-    * [Safety Shields](#safety-shields)
-    * [Authentication](#authentication)
-        * [K8s based authentication](#k8s-based-authentication)
-        * [JSON Web Keyset based authentication](#json-web-keyset-based-authentication)
-        * [No-op authentication](#no-op-authentication)
-    * [CORS](#cors)
-        * [Default values](#default-values)
-    * [Allow credentials](#allow-credentials)
-* [RAG Configuration](#rag-configuration)
-    * [Example configurations for inference](#example-configurations-for-inference)
-* [Usage](#usage)
-    * [Make targets](#make-targets)
-    * [Running Linux container image](#running-linux-container-image)
-    * [Building Container Images](#building-container-images)
-        * [Llama-Stack as Separate Service (Server Mode)](#llama-stack-as-separate-service-server-mode)
-        * [Llama-Stack as Library (Library Mode)](#llama-stack-as-library-library-mode)
-        * [Verify it's running properly](#verify-its-running-properly)
-* [Endpoints](#endpoints)
-    * [OpenAPI specification](#openapi-specification)
-    * [Readiness Endpoint](#readiness-endpoint)
-    * [Liveness Endpoint](#liveness-endpoint)
-* [Publish the service as Python package on PyPI](#publish-the-service-as-python-package-on-pypi)
-    * [Generate distribution archives to be uploaded into Python registry](#generate-distribution-archives-to-be-uploaded-into-python-registry)
-    * [Upload distribution archives into selected Python registry](#upload-distribution-archives-into-selected-python-registry)
-    * [Packages on PyPI and Test PyPI](#packages-on-pypi-and-test-pypi)
-* [Contributing](#contributing)
-* [Testing](#testing)
-* [License](#license)
-* [Additional tools](#additional-tools)
-    * [Utility to generate OpenAPI schema](#utility-to-generate-openapi-schema)
-        * [Path](#path)
-        * [Usage](#usage-1)
-    * [Utility to generate documentation from source codes](#utility-to-generate-documentation-from-source-codes)
-        * [Path](#path-1)
-        * [Usage](#usage-2)
-* [Data Export Integration](#data-export-integration)
-    * [Quick Integration](#quick-integration)
-    * [Documentation](#documentation)
-* [Project structure](#project-structure)
-    * [Configuration classes](#configuration-classes)
-    * [REST API](#rest-api)
+- [lightspeed-stack](#lightspeed-stack)
+  - [About The Project](#about-the-project)
+- [Architecture](#architecture)
+- [Prerequisites](#prerequisites)
+- [Installation](#installation)
+- [Run LCS locally](#run-lcs-locally)
+- [Configuration](#configuration)
+  - [LLM Compatibility](#llm-compatibility)
+  - [Set LLM provider and model](#set-llm-provider-and-model)
+  - [Integration with Llama Stack](#integration-with-llama-stack)
+  - [Llama Stack as separate server](#llama-stack-as-separate-server)
+    - [MCP Server and Tool Configuration](#mcp-server-and-tool-configuration)
+      - [Configuring MCP Servers](#configuring-mcp-servers)
+      - [Configuring MCP Headers](#configuring-mcp-headers)
+    - [Llama Stack project and configuration](#llama-stack-project-and-configuration)
+    - [Check connection to Llama Stack](#check-connection-to-llama-stack)
+  - [Llama Stack as client library](#llama-stack-as-client-library)
+  - [Llama Stack version check](#llama-stack-version-check)
+  - [User data collection](#user-data-collection)
+  - [System prompt](#system-prompt)
+  - [Safety Shields](#safety-shields)
+  - [Authentication](#authentication)
+    - [K8s based authentication](#k8s-based-authentication)
+    - [JSON Web Keyset based authentication](#json-web-keyset-based-authentication)
+    - [No-op authentication](#no-op-authentication)
+  - [CORS](#cors)
+    - [Default values](#default-values)
+  - [Allow credentials](#allow-credentials)
+- [RAG Configuration](#rag-configuration)
+  - [Example configurations for inference](#example-configurations-for-inference)
+- [Usage](#usage)
+  - [Make targets](#make-targets)
+  - [Running Linux container image](#running-linux-container-image)
+  - [Building Container Images](#building-container-images)
+    - [Llama-Stack as Separate Service (Server Mode)](#llama-stack-as-separate-service-server-mode)
+    - [Llama-Stack as Library (Library Mode)](#llama-stack-as-library-library-mode)
+    - [Verify it's running properly](#verify-its-running-properly)
+- [Endpoints](#endpoints)
+  - [OpenAPI specification](#openapi-specification)
+  - [Readiness Endpoint](#readiness-endpoint)
+  - [Liveness Endpoint](#liveness-endpoint)
+- [Publish the service as Python package on PyPI](#publish-the-service-as-python-package-on-pypi)
+  - [Generate distribution archives to be uploaded into Python registry](#generate-distribution-archives-to-be-uploaded-into-python-registry)
+  - [Upload distribution archives into selected Python registry](#upload-distribution-archives-into-selected-python-registry)
+  - [Packages on PyPI and Test PyPI](#packages-on-pypi-and-test-pypi)
+- [Contributing](#contributing)
+- [Testing](#testing)
+- [License](#license)
+- [Additional tools](#additional-tools)
+  - [Utility to generate OpenAPI schema](#utility-to-generate-openapi-schema)
+    - [Path](#path)
+    - [Usage](#usage-1)
+  - [Utility to generate documentation from source code](#utility-to-generate-documentation-from-source-code)
+    - [Path](#path-1)
+    - [Usage](#usage-2)
+- [Data Export Integration](#data-export-integration)
+  - [Quick Integration](#quick-integration)
+  - [Documentation](#documentation)
+- [Project structure](#project-structure)
+  - [Configuration classes](#configuration-classes)
+  - [REST API](#rest-api)
 
 <!-- vim-markdown-toc -->
 
@@ -98,9 +103,60 @@ Installation steps depends on operation system. Please look at instructions for
 - [Linux installation](https://lightspeed-core.github.io/lightspeed-stack/installation_linux)
 - [macOS installation](https://lightspeed-core.github.io/lightspeed-stack/installation_macos)
 
+# Run LCS locally
+
+To quickly get hands on LCS, we can run it using the default configurations provided in this repository: 
+0. install dependencies using [uv](https://docs.astral.sh/uv/getting-started/installation/) `uv sync --group dev --group llslibdev`
+1. check Llama stack settings in [run.yaml](run.yaml), make sure we can access the provider and the model, the server shoud listen to port 8321.
+2. export the LLM token env var that Llama stack requires. for OpenAI, we set the env var by `export OPENAI_API_KEY=sk-xxxxx`
+3. start Llama stack server `uv run llama stack run run.yaml`
+4. check the LCS settings in [lightspeed-stack.yaml](lightspeed-stack.yaml). `llama_stack.url` should be `url: http://localhost:8321`
+5. start LCS server `make run`
+6. access LCS web UI at [http://localhost:8080/](http://localhost:8080/)
+
 
 # Configuration
 
+## LLM Compatibility
+
+Lightspeed Core Stack (LCS) supports the large language models from the providers listed below. 
+
+| Provider | Model                                          | Tool Calling | provider_type  | Example                                                                    |
+| -------- | ---------------------------------------------- | ------------ | -------------- | -------------------------------------------------------------------------- |
+| OpenAI   | gpt-5, gpt-4o, gpt4-turbo, gpt-4.1, o1, o3, o4 | Yes          | remote::openai | [1](examples/openai-faiss-run.yaml) [2](examples/openai-pgvector-run.yaml) |
+| OpenAI   | gpt-3.5-turbo, gpt-4                           | No           | remote::openai |                                                                            |
+
+The "provider_type" is used in the llama stack configuration file when refering to the provider.
+
+For details of OpenAI model capabilities, please refer to https://platform.openai.com/docs/models/compare
+
+
+## Set LLM provider and model
+
+The LLM provider and model are set in the configuration file for Llama Stack. This repository has a Llama stack configuration file [run.yaml](examples/run.yaml) that can serve as a good example.
+
+The LLM providers are set in the section `providers.inference`. This example adds a inference provider "openai" to the llama stack. To use environment variables as configuration values, we can use the syntax `${env.ENV_VAR_NAME}`. 
+
+For more details, please refer to [llama stack documentation](https://llama-stack.readthedocs.io/en/latest/distributions/configuration.html#providers). Here is a list of llamastack supported providers and their configuration details: [llama stack providers](https://llama-stack.readthedocs.io/en/latest/providers/inference/index.html#providers)
+
+```yaml
+inference:
+    - provider_id: openai
+      provider_type: remote::openai
+      config:
+        api_key: ${env.OPENAI_API_KEY}
+        url: ${env.SERVICE_URL}
+```
+
+The section `models` is a list of models offered by the inference provider. Attention that the field `model_id` is a user chosen name for referring to the model locally, the field `provider_model_id` refers to the model name on the provider side. The field `provider_id` must refer to one of the inference providers we defined in the provider list above.
+
+```yaml
+models:
+  - model_id: gpt-4-turbo
+    provider_id: openai
+    model_type: llm
+    provider_model_id: gpt-4-turbo
+```
 
 
 ## Integration with Llama Stack