feat: add support for querying available backend devices #5877

qnixsynapse · 2025-07-23T13:20:43Z

This change introduces a new get_devices method to the llamacpp_extension engine that allows the frontend to query and display a list of available devices (e.g., Vulkan, CUDA, SYCL) from the compiled llama-server binary.

Added DeviceList interface to represent GPU/device metadata.
Implemented getDevices(): Promise<DeviceList[]> method.
- Splits version/backend, ensures backend is ready.
- Invokes the new Tauri command get_devices.
Introduced a new get_devices Tauri command.
Parses llama-server --list-devices output to extract available devices with memory info.
Introduced DeviceInfo struct (id, name, mem, free) and exposed it via serialization.
Robust parsing logic using string processing (non-regex) to locate memory stats.
Registered the new command in the tauri::Builder in lib.rs.
Fixed logic to correctly parse multiple devices from the llama-server output.
Handles common failure modes: binary not found, malformed memory info, etc.

This sets the foundation for device selection, memory-aware model loading, and improved diagnostics in Jan AI engine setup flows.

Important

Adds support for querying available backend devices in llamacpp extension via new Tauri command and parsing logic.

Behavior:
- Adds getDevices() method in index.ts to query available devices (Vulkan, CUDA, SYCL) from llama-server.
- Introduces get_devices Tauri command in server.rs to parse llama-server --list-devices output.
- Handles errors like binary not found and malformed memory info.
Models:
- Adds DeviceList interface in index.ts and DeviceInfo struct in server.rs for device metadata.
Misc:
- Registers get_devices command in lib.rs.

^{This description was created by}^{for 0b9354f. You can customize this summary. It will automatically update as commits are pushed.}

This change introduces a new `get_devices` method to the `llamacpp_extension` engine that allows the frontend to query and display a list of available devices (e.g., Vulkan, CUDA, SYCL) from the compiled `llama-server` binary. * Added `DeviceList` interface to represent GPU/device metadata. * Implemented `getDevices(): Promise<DeviceList[]>` method. * Splits `version/backend`, ensures backend is ready. * Invokes the new Tauri command `get_devices`. * Introduced a new `get_devices` Tauri command. * Parses `llama-server --list-devices` output to extract available devices with memory info. * Introduced `DeviceInfo` struct (`id`, `name`, `mem`, `free`) and exposed it via serialization. * Robust parsing logic using string processing (non-regex) to locate memory stats. * Registered the new command in the `tauri::Builder` in `lib.rs`. * Fixed logic to correctly parse multiple devices from the llama-server output. * Handles common failure modes: binary not found, malformed memory info, etc. This sets the foundation for device selection, memory-aware model loading, and improved diagnostics in Jan AI engine setup flows.

ellipsis-dev

Caution

Changes requested ❌

Reviewed everything up to 0b9354f in 2 minutes and 31 seconds. Click for details.

Reviewed 342 lines of code in 3 files
Skipped 0 files when reviewing.
Skipped posting 2 draft comments. View those below.
Modify your settings and rules to customize what types of comments Ellipsis leaves. And don't forget to react with 👍 or 👎 to teach Ellipsis.

1. src-tauri/src/core/utils/extensions/inference_llamacpp_extension/server.rs:148

Draft comment:
The Windows branch uses a raw string literal in trim_start_matches (e.g. r"\?") which may not compile as expected (raw strings cannot end with a backslash). Consider using an escaped standard string (e.g. "\\?\") or an alternative approach.
Reason this comment was not posted:
Comment was not on a location in the diff, so it can't be submitted as a review comment.

2. src-tauri/src/core/utils/extensions/inference_llamacpp_extension/server.rs:414

Draft comment:
Typographical issue: The raw string literal r"\\?\" appears to be incorrectly terminated. In Rust, a raw string that contains a quote needs to use a different delimiter (e.g., r#"\\?\"#) or proper escaping. Please review and correct this to ensure the intended UNC prefix is properly trimmed.
Reason this comment was not posted:
Comment looked like it was already resolved.

Workflow ID: wflow_oHv2xuCgSimu3Y0L

^{You can customize}^{by changing your verbosity settings, reacting with 👍 or 👎, replying to comments, or adding code review rules.}

extensions/llamacpp-extension/src/index.ts

src-tauri/src/core/utils/extensions/inference_llamacpp_extension/server.rs

Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>

github-actions · 2025-07-23T13:25:35Z

Barecheck - Code coverage report

Total: 36.95%

Your code coverage diff: 0.01% ▴

✅ All code changes are covered

qnixsynapse requested review from urmauur and louis-menlo July 23, 2025 13:20

github-project-automation bot added this to Jan Jul 23, 2025

github-actions bot assigned qnixsynapse Jul 23, 2025

qnixsynapse linked an issue Jul 23, 2025 that may be closed by this pull request

idea: Add llamacpp extension function to get list of devices #5821

Closed

ellipsis-dev bot reviewed Jul 23, 2025

View reviewed changes

extensions/llamacpp-extension/src/index.ts Outdated Show resolved Hide resolved

src-tauri/src/core/utils/extensions/inference_llamacpp_extension/server.rs Show resolved Hide resolved

Update extensions/llamacpp-extension/src/index.ts

dd31618

Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>

louis-menlo approved these changes Jul 23, 2025

View reviewed changes

urmauur approved these changes Jul 23, 2025

View reviewed changes

qnixsynapse merged commit 1d0bb53 into release/v0.6.6 Jul 23, 2025
28 of 32 checks passed

qnixsynapse deleted the feat/get_devices_list branch July 23, 2025 13:50

github-project-automation bot moved this to QA in Jan Jul 23, 2025

github-actions bot added this to the v0.6.6 milestone Jul 23, 2025

louis-menlo mentioned this pull request Jul 29, 2025

Sync Release/v0.6.6 into dev #5973

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add support for querying available backend devices #5877

feat: add support for querying available backend devices #5877

Uh oh!

qnixsynapse commented Jul 23, 2025 •

edited by ellipsis-dev bot

Loading

Uh oh!

ellipsis-dev bot left a comment

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Jul 23, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

feat: add support for querying available backend devices #5877

feat: add support for querying available backend devices #5877

Uh oh!

Conversation

qnixsynapse commented Jul 23, 2025 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Jul 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Barecheck - Code coverage report

Uh oh!

Uh oh!

Uh oh!

qnixsynapse commented Jul 23, 2025 •

edited by ellipsis-dev bot

Loading

github-actions bot commented Jul 23, 2025 •

edited

Loading