feat(embed_text): Support LM Studio as a provider #5103

desmondcheongzx · 2025-09-01T08:41:24Z

Changes Made

Add LM Studio as a text embedding provider.

For example

import daft
from daft.ai.provider import load_provider
from daft.functions.ai import embed_text

provider = load_provider("lm_studio", base_url="http://127.0.0.1:1234")  # This base_url parameter is optional if you're using the defaults for LM Studio. You can modify this as needed.
model = "text-embedding-nomic-embed-text-v1.5"  # Select a text embedding model that you've loaded into LM Studio.

(
    daft.read_huggingface("Open-Orca/OpenOrca")
    .with_column("embedding", embed_text(daft.col("response"), provider=provider, model=model))
    .show()
)

greptile-apps

Greptile Summary

This PR adds LM Studio as a new text embedding provider to the Daft AI framework. LM Studio is a local AI server that provides an OpenAI-compatible API for running embedding models locally, making it an attractive option for users who want to avoid external API calls or run custom models.

The implementation cleverly extends the existing OpenAI provider architecture since LM Studio maintains API compatibility with OpenAI. The key changes include:

New LMStudioProvider class in daft/ai/openai/__init__.py that inherits from OpenAIProvider but configures defaults for local usage (localhost:1234/v1) and sets a dummy API key since LM Studio doesn't require authentication
LMStudioTextEmbedderDescriptor that dynamically discovers embedding dimensions by making a probe request to the server, which is necessary because LM Studio can load arbitrary models with varying dimensions
Provider registration in daft/ai/provider.py that adds 'lm_studio' to the available providers list
Comprehensive test coverage in tests/ai/test_lm_studio.py that validates the provider functionality with proper mocking

The base_url handling includes smart logic to automatically append '/v1' if not present, ensuring compatibility with LM Studio's expected endpoint format. This implementation maintains the same interface as other providers while accommodating LM Studio's unique characteristics of local deployment and variable model dimensions.

Confidence score: 3/5

This PR introduces architectural complexity with the dynamic dimension discovery mechanism that could fail in various scenarios
Score reflects concerns about network calls during descriptor initialization and potential reliability issues with the probe request approach
Pay close attention to the dimension discovery logic in LMStudioTextEmbedderDescriptor.get_dimensions() method

_{4 files reviewed, 3 comments}

_{Edit Code Review Bot Settings | Greptile}

greptile-apps · 2025-09-02T19:27:29Z

daft/ai/openai/text_embedder.py

+    def get_dimensions(self) -> EmbeddingDimensions:
+        try:
+            client = OpenAI(**self.provider_options)
+            response = client.embeddings.create(
+                input="dimension probe",
+                model=self.model_name,
+                encoding_format="float",
+            )
+            size = len(response.data[0].embedding)
+            return EmbeddingDimensions(size=size, dtype=DataType.float32())
+        except Exception as ex:
+            raise ValueError("Failed to determine embedding dimensions from LM Studio.") from ex


style: The dimension probing creates a new OpenAI client and makes a network request during descriptor creation. This could be expensive if called repeatedly and may fail if the LM Studio server is temporarily unavailable. Consider caching the dimensions or moving this logic to instantiation time.

This is a local network request

"I am not concerned"

greptile-apps · 2025-09-02T19:27:30Z

daft/ai/openai/text_embedder.py

+        except Exception as ex:
+            raise ValueError("Failed to determine embedding dimensions from LM Studio.") from ex


style: The broad exception catch could mask specific connection errors. Consider catching more specific exceptions like OpenAIError or connection-related exceptions to provide better error messages.

daft/ai/openai/__init__.py

rchowell

So cool!!

codecov · 2025-09-02T20:23:03Z

Codecov Report

❌ Patch coverage is 72.09302% with 12 lines in your changes missing coverage. Please review.
✅ Project coverage is 76.07%. Comparing base (c3afb60) to head (590f849).
⚠️ Report is 6 commits behind head on main.

Files with missing lines	Patch %	Lines
daft/ai/provider.py	16.66%	5 Missing ⚠️
daft/ai/openai/__init__.py	73.33%	4 Missing ⚠️
daft/ai/openai/text_embedder.py	86.36%	3 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #5103      +/-   ##
==========================================
- Coverage   76.10%   76.07%   -0.04%     
==========================================
  Files         950      953       +3     
  Lines      130488   130628     +140     
==========================================
+ Hits        99310    99377      +67     
- Misses      31178    31251      +73

Files with missing lines	Coverage Δ
daft/ai/openai/text_embedder.py	`94.59% <86.36%> (-2.04%)`	⬇️
daft/ai/openai/__init__.py	`84.84% <73.33%> (-15.16%)`	⬇️
daft/ai/provider.py	`55.55% <16.66%> (-12.19%)`	⬇️

... and 6 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

## Changes Made Add LM Studio as a text embedding provider. For example ``` import daft from daft.ai.provider import load_provider from daft.functions.ai import embed_text provider = load_provider("lm_studio", base_url="http://127.0.0.1:1234") # This base_url parameter is optional if you're using the defaults for LM Studio. You can modify this as needed. model = "text-embedding-nomic-embed-text-v1.5" # Select a text embedding model that you've loaded into LM Studio. ( daft.read_huggingface("Open-Orca/OpenOrca") .with_column("embedding", embed_text(daft.col("response"), provider=provider, model=model)) .show() ) ```

impl

feeaaf5

github-actions bot added the feat label Sep 1, 2025

desmondcheongzx added 2 commits September 2, 2025 12:19

add test

9db8248

Merge remote-tracking branch 'origin' into desmond/lm-studio-provider

12dfb73

desmondcheongzx marked this pull request as ready for review September 2, 2025 19:26

greptile-apps bot reviewed Sep 2, 2025

View reviewed changes

desmondcheongzx added 2 commits September 2, 2025 12:37

add docs

e9a3954

address greptile

590f849

desmondcheongzx requested a review from rchowell September 2, 2025 19:39

rchowell reviewed Sep 2, 2025

View reviewed changes

desmondcheongzx changed the title ~~feat: Support LM Studio as a provider~~ feat(embed_text): Support LM Studio as a provider Sep 2, 2025

rchowell approved these changes Sep 2, 2025

View reviewed changes

desmondcheongzx merged commit 58e9cb2 into main Sep 3, 2025
58 of 59 checks passed

desmondcheongzx deleted the desmond/lm-studio-provider branch September 3, 2025 00:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(embed_text): Support LM Studio as a provider #5103

feat(embed_text): Support LM Studio as a provider #5103

Uh oh!

desmondcheongzx commented Sep 1, 2025 •

edited

Loading

Uh oh!

greptile-apps bot left a comment

Uh oh!

greptile-apps bot Sep 2, 2025

Uh oh!

desmondcheongzx Sep 2, 2025

Uh oh!

jaychia Sep 3, 2025

Uh oh!

greptile-apps bot Sep 2, 2025

Uh oh!

Uh oh!

rchowell left a comment

Uh oh!

codecov bot commented Sep 2, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

		except Exception as ex:
		raise ValueError("Failed to determine embedding dimensions from LM Studio.") from ex

feat(embed_text): Support LM Studio as a provider #5103

feat(embed_text): Support LM Studio as a provider #5103

Uh oh!

Conversation

desmondcheongzx commented Sep 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes Made

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Greptile Summary

Confidence score: 3/5

Uh oh!

greptile-apps bot Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

desmondcheongzx Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

jaychia Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rchowell left a comment

Choose a reason for hiding this comment

Uh oh!

codecov bot commented Sep 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

desmondcheongzx commented Sep 1, 2025 •

edited

Loading

codecov bot commented Sep 2, 2025 •

edited

Loading