feat: Add model compatibility check and memory estimation #6243

qnixsynapse · 2025-08-20T06:29:36Z

Describe Your Changes

This commit introduces a new feature to check if a given model is supported based on available device memory.

The change includes:

A new estimateKVCache method that calculates the required memory for the model's KV cache. It uses GGUF metadata such as block_count, head_count, key_length, and value_length to perform the calculation.
An isModelSupported method that combines the model file size and the estimated KV cache size to determine the total memory required. It then checks if any available device has sufficient free memory to load the model.
An updated error message for the version_backend check to be more user-friendly, suggesting a stable internet connection as a potential solution for backend setup failures.

This functionality helps prevent the application from attempting to load models that would exceed the device's memory capacity, leading to more stable and predictable behavior.

Fixes Issues

Closes feat: Indicate model compatibility with local system specs #5505

Self Checklist

Added relevant comments, esp in complex areas
Updated docs (for bug fixes / features)
Created issues for follow-up changes or refactoring needed

Important

Add model compatibility check and memory estimation to ensure models fit within device memory constraints, with UI feedback on compatibility status.

Behavior:
- Adds estimateKVCache and isModelSupported methods in index.ts to calculate memory requirements and check model compatibility.
- Updates error message for version_backend check in index.ts.
UI Components:
- Adds ModelSupportStatus and ModelInfoHoverCard components to display model compatibility status.
- Updates DropdownModelProvider and ChatInput to integrate new components.
Backend:
- Modifies read_gguf_metadata in commands.rs to support remote files.
- Updates Cargo.toml to include reqwest dependency.
Tests:
- Adds tests for isModelSupported in models.test.ts.

^{This description was created by}^{for 5ea5ca5. You can customize this summary. It will automatically update as commits are pushed.}

ellipsis-dev

Caution

Changes requested ❌

Reviewed everything up to 10e85de in 2 minutes and 30 seconds. Click for details.

Reviewed 70 lines of code in 1 files
Skipped 0 files when reviewing.
Skipped posting 6 draft comments. View those below.
Modify your settings and rules to customize what types of comments Ellipsis leaves. And don't forget to react with 👍 or 👎 to teach Ellipsis.

1. extensions/llamacpp-extension/src/index.ts:1594

Draft comment:
The revised error message is more user‐friendly but hides the original backend info. Consider logging cfg.version_backend for debugging.
Reason this comment was not posted:
Confidence changes required: 33% <= threshold 50% None

2. extensions/llamacpp-extension/src/index.ts:1699

Draft comment:
Consider improving the doc comment wording: e.g. 'Estimate KVCache size from given metadata' instead of 'estimate KVCache size of from a given metadata'.
Reason this comment was not posted:
Confidence changes required: 33% <= threshold 50% None

3. extensions/llamacpp-extension/src/index.ts:1597

Draft comment:
Typo: The error message text says "Please restart the app in a stable internet connection." This phrasing is slightly awkward. Consider replacing it with "Please restart the app with a stable internet connection." for improved clarity.
Reason this comment was not posted:
Decided after close inspection that this draft comment was likely wrong and/or not actionable: usefulness confidence = 10% vs. threshold = 50% This is a very minor grammatical suggestion. While "with" may be slightly more natural than "in", both versions are understandable and convey the same meaning. The comment is about UI text/copy rather than actual code functionality. The change would have minimal impact on user experience. The comment does point out a real, if minor, grammatical improvement. English style guides generally prefer "with a connection" over "in a connection". While technically correct, this level of nitpicking on error message wording is not valuable enough to warrant a PR comment. The existing message is perfectly understandable. Delete this comment. While the suggested change is technically correct, it's too minor to be worth a PR comment. The existing error message is clear enough.

4. extensions/llamacpp-extension/src/index.ts:1696

Draft comment:
Typo in the comment: "estimate KVCache size of from a given metadata". Consider rephrasing to "Estimate KVCache size from given metadata" or similar.
Reason this comment was not posted:
Decided after close inspection that this draft comment was likely wrong and/or not actionable: usefulness confidence = 10% vs. threshold = 50% While the comment is technically correct about the typo, fixing comment typos is a very minor issue that doesn't affect functionality. The comment is clear enough as is. We should focus on more substantive issues. The rules say not to make purely informative comments or comments about obvious/unimportant things. The typo does make the comment slightly less professional looking. Clear documentation is important for maintainability. While clear documentation is good, this is an extremely minor typo that doesn't significantly impact readability or understanding. The meaning is still clear. Delete this comment as it's too minor of an issue to be worth addressing. The comment is still perfectly understandable despite the small typo.

5. extensions/llamacpp-extension/src/index.ts:1717

Draft comment:
The comment "can extension by checking cache-type-v and cache-type-k" seems grammatically incorrect. Perhaps it should be "can be extended by checking cache-type-v and cache-type-k".
Reason this comment was not posted:
Decided after close inspection that this draft comment was likely wrong and/or not actionable: usefulness confidence = 10% vs. threshold = 50% While the grammar correction is technically accurate, code comments are informal documentation and minor grammatical issues don't impact code functionality or understanding. The meaning of the comment is still clear despite the minor grammatical error. This kind of nitpicking about grammar in comments creates noise in the PR review process. The comment is technically correct - there is a grammatical error. And some might argue that maintaining high quality even in comments is important for professionalism. While professionalism is important, focusing on minor grammatical issues in comments distracts from more substantive code review. The original comment's meaning is clear enough for its purpose. Delete this comment as it focuses on a minor grammatical issue that doesn't meaningfully impact code quality or understanding.

6. extensions/llamacpp-extension/src/index.ts:1734

Draft comment:
Typographical errors in the comment: "model weighst + kvcache" should be corrected to "model weights + KVCache".
Reason this comment was not posted:
Decided after close inspection that this draft comment was likely wrong and/or not actionable: usefulness confidence = 10% vs. threshold = 50% While the comment correctly identifies typos, fixing typos in comments is a very minor issue that doesn't affect functionality. The comment is technically about changed code since it's in the new section, but it's an extremely low priority change that doesn't meaningfully improve code quality or readability. The typos are real and the comment is technically correct. Consistent capitalization and spelling can make code more professional. While technically correct, this level of nitpicking about comment typos creates noise in the PR review process and distracts from more important issues. The meaning is still clear despite the typos. Delete this comment. While it points out real typos, commenting on minor spelling issues in comments is too trivial and creates unnecessary noise in the PR review process.

Workflow ID: wflow_6kJxH89xcwRwPrSC

^{You can customize}^{by changing your verbosity settings, reacting with 👍 or 👎, replying to comments, or adding code review rules.}

extensions/llamacpp-extension/src/index.ts

github-actions · 2025-08-20T06:35:43Z

Barecheck - Code coverage report

Total: 36.11%

Your code coverage diff: -0.27% ▾

Uncovered files and lines

File	Lines
web-app/src/containers/ChatInput.tsx	88-90, 112-113, 115-117, 133-134, 175-176, 198-200, 207-208, 211-214, 217-227, 230, 232-234, 240, 242, 244-245, 247-251, 254-255, 258, 260-263, 265-269, 271-283, 285-295, 297-306, 308-311, 314-315, 317-320, 323-324, 327-331, 334-335, 337-340, 343-345, 348-350, 353-356, 358-359, 361-365, 367-369, 372-373, 376-378, 380-382, 384-385, 387-388, 390-395, 398-403, 405-409, 445-453, 455-460, 462-464, 466-468, 470-471, 529, 557-571, 587-589, 594-598, 613-617, 632-646, 649-663, 673, 691, 708, 710-713
web-app/src/containers/DropdownModelProvider.tsx	43-50, 53-61, 90-94, 103, 109-111, 113-120, 123-128, 130-138, 147-150, 152-157, 159-160, 163-171, 173-178, 206-215, 218-219, 224-227, 229-232, 237-238, 248, 250-256, 258, 260-262, 265-266, 268-274, 290, 298-305, 307-311, 322-323, 329-332, 335-336, 338, 346-353, 356-361, 363-364, 367-375, 403-405, 418-422, 454-460, 467-469, 474, 476-480, 483-488, 490-500, 502-515, 517-518, 520-521, 526, 531-533, 535, 537-540, 543-559, 561-566, 569, 571, 573-579, 581-591, 593-596, 598-599, 601-605, 607-608, 610, 612
web-app/src/containers/ModelInfoHoverCard.tsx	1, 6, 8, 19-35, 37-41, 43-46, 48-53, 55-59, 61-62, 64-68, 70-71, 73-77, 79-80, 82-87, 89-90, 92-101, 103-105, 107-116, 119-124, 126-132, 134-140, 142-144, 146-157, 159, 161-164, 166-170, 172-173, 175-181, 184-186, 188-192, 194-195, 197-199, 201-202, 204-205, 209-215, 217-218, 220-224, 226
web-app/src/containers/ModelSupportStatus.tsx	39-45, 47, 64, 66, 70, 72, 80, 82, 88, 105-107, 110-111
web-app/src/routes/hub/$modelId.tsx	1-2, 8, 14-21, 29-33, 39, 41-46, 48-52, 54-59, 62-63, 66, 68, 70-72, 74-83, 85-87, 89-91, 94-104, 107-121, 124-128, 130-142, 145-147, 150-152, 155-158, 160, 162-177, 180-181, 183-184, 186-191, 193-198, 201-203, 206-208, 210-228, 230-238, 240-250, 252, 254-261, 263-268, 270-272, 274-279, 282-284, 287-291, 293-301, 303, 306-317, 319-322, 324-325, 329-334, 336-339, 341, 344-351, 353-357, 359-360, 362-363, 365-367, 369-385, 388-392, 395-399, 401-404, 406-425, 428-444, 446, 448-453, 455, 457, 459, 461-476, 478, 480-482, 484-488, 492-496, 498-499, 501-503, 505-518, 520-523, 525-526, 528-529, 531, 533-537, 539
web-app/src/routes/hub/index.tsx	2-7, 15-20, 27-28, 34-37, 43, 50-56, 64, 66-71, 73-75, 77-84, 86-88, 90, 92-102, 104-111, 113-114, 116-121, 124-131, 133-135, 138, 140-145, 147-148, 150-151, 153-158, 160-171, 173-184, 187-191, 193-195, 197-200, 202-204, 206-210, 212-213, 215-238, 240-241, 243-253, 255, 257-260, 262-276, 278-280, 283-285, 288-291, 293, 295-296, 298-311, 313-314, 316-327, 329-330, 332, 334-350, 352, 354-362, 364-369, 371-377, 379-383, 385-386, 388-393, 395-396, 398, 400-410, 412-413, 416-425, 427-428, 430-446, 448-452, 455-456, 459-460, 462-479, 482, 484-490, 492-493, 495-505, 507-521, 523, 525-558, 560-563, 565-590, 592-601, 603-619, 621-635, 638-644, 647-656, 658-661, 664-671, 674-684, 686-690, 692-741, 743-760, 762-771, 773-777, 779-783, 785-821, 823-841, 843-844, 846-853, 855-856, 859, 862-863, 865-882, 884-890, 892-896, 898-900, 902, 904-908, 910-916, 919-921, 923, 925-941, 943-947, 949-950, 952, 954-955, 957-961, 963-967, 969
web-app/src/services/models.ts	137-139, 201, 203-207, 271-279, 283-285, 287-288, 290, 292, 294, 296-302, 305-325, 327-328, 331-340, 425, 443-444, 446-447, 459-463, 465-466, 469-470, 473-476, 478-479, 482, 484-504, 507-512, 514-523, 525, 527-537, 539-553, 555-561, 570-571, 574-581

ellipsis-dev

Important

Looks good to me! 👍

Reviewed b6b8f4b in 1 minute and 2 seconds. Click for details.

Reviewed 40 lines of code in 1 files
Skipped 0 files when reviewing.
Skipped posting 2 draft comments. View those below.
Modify your settings and rules to customize what types of comments Ellipsis leaves. And don't forget to react with 👍 or 👎 to teach Ellipsis.

1. extensions/llamacpp-extension/src/index.ts:1697

Draft comment:
Improved parameter formatting in estimateKVCache enhances readability. Consider using a more specific type for 'meta' if possible.
Reason this comment was not posted:
Confidence changes required: 20% <= threshold 50% None

2. extensions/llamacpp-extension/src/index.ts:1742

Draft comment:
The isModelSupported method now gracefully falls back to checking system memory via getSystemUsage when no devices are found. Verify that the memory units (MB) are consistent with assumed conversions.
Reason this comment was not posted:
Confidence changes required: 30% <= threshold 50% None

Workflow ID: wflow_1VPPx8TyqfuCqfQh

^{You can customize}^{by changing your verbosity settings, reacting with 👍 or 👎, replying to comments, or adding code review rules.}

ellipsis-dev

Important

Looks good to me! 👍

Reviewed 84b9420 in 1 minute and 25 seconds. Click for details.

Reviewed 40 lines of code in 1 files
Skipped 0 files when reviewing.
Skipped posting 4 draft comments. View those below.
Modify your settings and rules to customize what types of comments Ellipsis leaves. And don't forget to react with 👍 or 👎 to teach Ellipsis.

1. extensions/llamacpp-extension/src/index.ts:35

Draft comment:
New import 'getSystemUsage' added. Ensure its returned memory values are in MB as assumed for correct byte conversion.
Reason this comment was not posted:
Confidence changes required: 0% <= threshold 50% None

2. extensions/llamacpp-extension/src/index.ts:1696

Draft comment:
Reformatted 'estimateKVCache' parameters improves readability. No issues found.
Reason this comment was not posted:
Confidence changes required: 0% <= threshold 50% None

3. extensions/llamacpp-extension/src/index.ts:1741

Draft comment:
Enhanced isModelSupported fallback now uses system memory via getSystemUsage when no devices are found. Confirm that 'sys.total_memory' and 'sys.used_memory' are in MB (as expected) to ensure accurate conversion.
Reason this comment was not posted:
Confidence changes required: 33% <= threshold 50% None

4. extensions/llamacpp-extension/src/index.ts:1696

Draft comment:
Typographical error: The comment 'estimate KVCache size of from a given metadata' appears to have extraneous words. Consider revising it to either 'estimate KVCache size from a given metadata' or an alternative phrasing.
Reason this comment was not posted:
Comment was on unchanged code.

Workflow ID: wflow_kAyKg6jmXy35hmA7

^{You can customize}^{by changing your verbosity settings, reacting with 👍 or 👎, replying to comments, or adding code review rules.}

ellipsis-dev

Caution

Changes requested ❌

Reviewed 8a2924e in 3 minutes and 22 seconds. Click for details.

Reviewed 241 lines of code in 4 files
Skipped 0 files when reviewing.
Skipped posting 3 draft comments. View those below.
Modify your settings and rules to customize what types of comments Ellipsis leaves. And don't forget to react with 👍 or 👎 to teach Ellipsis.

1. src-tauri/plugins/tauri-plugin-llamacpp/Cargo.toml:26

Draft comment:
NOTE: The reqwest dependency is added with features ["json", "blocking", "stream"]. Verify that the 'blocking' feature is necessary, as the plugin code uses async requests.
Reason this comment was not posted:
Confidence changes required: 50% <= threshold 50% None

2. src-tauri/plugins/tauri-plugin-llamacpp/src/gguf/commands.rs:10

Draft comment:
INFO: The remote GGUF metadata retrieval using 2MB range requests is implemented correctly. Consider confirming that target servers support the Range header for reliable parsing.
Reason this comment was not posted:
Confidence changes required: 33% <= threshold 50% None

3. src-tauri/plugins/tauri-plugin-llamacpp/src/gguf/helpers.rs:7

Draft comment:
INFO: The GGUF metadata parsing helpers enforce reasonable limits (e.g. string length max 1MB, array length limit) which is good. Verify these limits match expected inputs and security considerations.
Reason this comment was not posted:
Confidence changes required: 33% <= threshold 50% None

Workflow ID: wflow_vNUYBneaTK6WdxOI

^{You can customize}^{by changing your verbosity settings, reacting with 👍 or 👎, replying to comments, or adding code review rules.}

extensions/llamacpp-extension/src/index.ts

louis-menlo

LGTM

ellipsis-dev

Caution

Changes requested ❌

Reviewed ca2d9fc in 3 minutes and 20 seconds. Click for details.

Reviewed 63 lines of code in 1 files
Skipped 0 files when reviewing.
Skipped posting 0 draft comments. View those below.
Modify your settings and rules to customize what types of comments Ellipsis leaves. And don't forget to react with 👍 or 👎 to teach Ellipsis.

Workflow ID: wflow_KwQCRvcpzkQS9NtN

^{You can customize}^{by changing your verbosity settings, reacting with 👍 or 👎, replying to comments, or adding code review rules.}

extensions/llamacpp-extension/src/index.ts

This commit introduces a new feature to check if a given model is supported based on available device memory. The change includes: - A new `estimateKVCache` method that calculates the required memory for the model's KV cache. It uses GGUF metadata such as `block_count`, `head_count`, `key_length`, and `value_length` to perform the calculation. - An `isModelSupported` method that combines the model file size and the estimated KV cache size to determine the total memory required. It then checks if any available device has sufficient free memory to load the model. - An updated error message for the `version_backend` check to be more user-friendly, suggesting a stable internet connection as a potential solution for backend setup failures. This functionality helps prevent the application from attempting to load models that would exceed the device's memory capacity, leading to more stable and predictable behavior. fixes: #5505

Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>

This commit refactors the logic for checking if a model is supported by a system's available memory. **Key changes:** - **Remote model support**: The `read_gguf_metadata` function can now fetch metadata from a remote URL by reading the file in chunks. - **Improved KV cache size calculation**: The KV cache size is now estimated more accurately by using `attention.key_length` and `attention.value_length` from the GGUF metadata, with a fallback to `embedding_length`. - **Granular memory check statuses**: The `isModelSupported` function now returns a more specific status (`'RED'`, `'YELLOW'`, `'GREEN'`) to indicate whether the model weights or the KV cache are too large for the available memory. - **Consolidated logic**: The logic for checking local and remote models has been consolidated into a single `isModelSupported` function, improving code clarity and maintainability. These changes provide more robust and informative model compatibility checks, especially for models hosted on remote servers.

Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>

ellipsis-dev

Important

Looks good to me! 👍

Reviewed b3ea19c in 2 minutes and 26 seconds. Click for details.

Reviewed 22 lines of code in 1 files
Skipped 0 files when reviewing.
Skipped posting 5 draft comments. View those below.
Modify your settings and rules to customize what types of comments Ellipsis leaves. And don't forget to react with 👍 or 👎 to teach Ellipsis.

1. web-app/src/containers/ModelSupportStatus.tsx:80

Draft comment:
GREEN tooltip message updated to 'Works Well on your device'. This clarifies the message well.
Reason this comment was not posted:
Confidence changes required: 0% <= threshold 50% None

2. web-app/src/containers/ModelSupportStatus.tsx:82

Draft comment:
YELLOW tooltip message now says 'Might work on your device', which softens the limitation warning effectively.
Reason this comment was not posted:
Confidence changes required: 0% <= threshold 50% None

3. web-app/src/containers/ModelSupportStatus.tsx:84

Draft comment:
RED tooltip text contains an extra space before the parenthesis; consider removing it for consistency.
Reason this comment was not posted:
Confidence changes required: 33% <= threshold 50% None

4. web-app/src/containers/ModelSupportStatus.tsx:86

Draft comment:
LOADING tooltip now specifies 'Checking device compatibility...', which improves clarity for the user.
Reason this comment was not posted:
Confidence changes required: 0% <= threshold 50% None

5. web-app/src/containers/ModelSupportStatus.tsx:84

Draft comment:
Typographical note: There is an extra space in "Doesn't work on your device (ctx: ${contextSize})". Consider removing one extra space before the opening parenthesis to maintain consistency.
Reason this comment was not posted:
Decided after close inspection that this draft comment was likely wrong and/or not actionable: usefulness confidence = 10% vs. threshold = 50% While the comment is technically correct about the inconsistency, it's an extremely minor formatting issue that doesn't affect functionality. The change being reviewed is about updating tooltip messages to include "on your device", and this extra space is just a tiny typographical issue. Such minor formatting issues often don't warrant a PR comment and can be seen as noise in the review process. The extra space could theoretically cause visual inconsistency in the UI tooltip. Also, maintaining consistent formatting is generally good practice. However, this is such a minor issue that it creates more overhead in the review process than the value it provides. The visual impact of an extra space in a tooltip is negligible. This comment should be deleted as it focuses on an extremely minor formatting issue that doesn't meaningfully impact the code or user experience.

Workflow ID: wflow_lq1Cpv8J9TKhJFvK

^{You can customize}^{by changing your verbosity settings, reacting with 👍 or 👎, replying to comments, or adding code review rules.}

ellipsis-dev

Important

Looks good to me! 👍

Reviewed 5ea5ca5 in 2 minutes and 24 seconds. Click for details.

Reviewed 48 lines of code in 1 files
Skipped 0 files when reviewing.
Skipped posting 5 draft comments. View those below.
Modify your settings and rules to customize what types of comments Ellipsis leaves. And don't forget to react with 👍 or 👎 to teach Ellipsis.

1. web-app/src/containers/ModelInfoHoverCard.tsx:51

Draft comment:
Good: Adding 'shrink-0' prevents the spinner from shrinking in the flex container, ensuring consistent size.
Reason this comment was not posted:
Confidence changes required: 0% <= threshold 50% None

2. web-app/src/containers/ModelInfoHoverCard.tsx:58

Draft comment:
Added 'shrink-0' on green status indicator to enforce fixed size within flex layout. Looks good.
Reason this comment was not posted:
Confidence changes required: 0% <= threshold 50% None

3. web-app/src/containers/ModelInfoHoverCard.tsx:67

Draft comment:
The yellow indicator now includes 'shrink-0', ensuring it doesn’t collapse in flex containers.
Reason this comment was not posted:
Confidence changes required: 0% <= threshold 50% None

4. web-app/src/containers/ModelInfoHoverCard.tsx:76

Draft comment:
Red indicator now has 'shrink-0' to maintain consistent sizing in its flex container.
Reason this comment was not posted:
Confidence changes required: 0% <= threshold 50% None

5. web-app/src/containers/ModelInfoHoverCard.tsx:85

Draft comment:
The Unknown status indicator now includes 'shrink-0', matching other states for fixed sizing.
Reason this comment was not posted:
Confidence changes required: 0% <= threshold 50% None

Workflow ID: wflow_bZTgzo4KcYenFR4K

^{You can customize}^{by changing your verbosity settings, reacting with 👍 or 👎, replying to comments, or adding code review rules.}

louis-menlo

LGTM

github-project-automation bot added this to Jan Aug 20, 2025

github-actions bot assigned qnixsynapse Aug 20, 2025

ellipsis-dev bot reviewed Aug 20, 2025

View reviewed changes

extensions/llamacpp-extension/src/index.ts Outdated Show resolved Hide resolved

extensions/llamacpp-extension/src/index.ts Outdated Show resolved Hide resolved

qnixsynapse requested review from dinhlongviolin1, louis-menlo and urmauur August 20, 2025 06:39

qnixsynapse force-pushed the feat/5505 branch from 7a1e37a to b6b8f4b Compare August 20, 2025 07:29

ellipsis-dev bot reviewed Aug 20, 2025

View reviewed changes

qnixsynapse force-pushed the feat/5505 branch from b6b8f4b to 84b9420 Compare August 20, 2025 07:41

ellipsis-dev bot reviewed Aug 20, 2025

View reviewed changes

extensions/llamacpp-extension/src/index.ts Outdated Show resolved Hide resolved

louis-menlo approved these changes Aug 21, 2025

View reviewed changes

urmauur force-pushed the feat/5505 branch from b1fe6d5 to c13ee09 Compare August 21, 2025 02:12

ellipsis-dev bot reviewed Aug 21, 2025

View reviewed changes

extensions/llamacpp-extension/src/index.ts Show resolved Hide resolved

extensions/llamacpp-extension/src/index.ts Show resolved Hide resolved

qnixsynapse and others added 11 commits August 21, 2025 16:43

Update extensions/llamacpp-extension/src/index.ts

e9f1bd1

Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>

Update extensions/llamacpp-extension/src/index.ts

b62995f

Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>

Extend this to available system RAM if GGML device is not available

5f6355d

Update extensions/llamacpp-extension/src/index.ts

9604577

Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>

Make ctx_size optional and use sum free memory across ggml devices

56f2881

feat: hub and dropdown model selection handle model compatibility

ae12574

feat: update bage model info color

fceeace

chore: enable detail page to get compatibility model

6503fce

chore: update copy

b3ea19c

urmauur force-pushed the feat/5505 branch from 0dabf47 to b3ea19c Compare August 21, 2025 09:45

ellipsis-dev bot reviewed Aug 21, 2025

View reviewed changes

chore: update shrink indicator UI

5ea5ca5

ellipsis-dev bot reviewed Aug 21, 2025

View reviewed changes

louis-menlo approved these changes Aug 21, 2025

View reviewed changes

qnixsynapse merged commit 510c70b into dev Aug 21, 2025
16 checks passed

qnixsynapse deleted the feat/5505 branch August 21, 2025 10:43

github-project-automation bot moved this to QA in Jan Aug 21, 2025

github-actions bot added this to the v0.6.9 milestone Aug 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Add model compatibility check and memory estimation #6243

feat: Add model compatibility check and memory estimation #6243

Uh oh!

qnixsynapse commented Aug 20, 2025 •

edited by ellipsis-dev bot

Loading

Uh oh!

ellipsis-dev bot left a comment

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Aug 20, 2025 •

edited

Loading

Uh oh!

ellipsis-dev bot left a comment

Uh oh!

ellipsis-dev bot left a comment

Uh oh!

ellipsis-dev bot left a comment

Uh oh!

Uh oh!

louis-menlo left a comment

Uh oh!

ellipsis-dev bot left a comment

Uh oh!

Uh oh!

Uh oh!

ellipsis-dev bot left a comment

Uh oh!

ellipsis-dev bot left a comment

Uh oh!

louis-menlo left a comment

Uh oh!

Uh oh!

Uh oh!

feat: Add model compatibility check and memory estimation #6243

feat: Add model compatibility check and memory estimation #6243

Uh oh!

Conversation

qnixsynapse commented Aug 20, 2025 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Describe Your Changes

Fixes Issues

Self Checklist

Uh oh!

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Aug 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Barecheck - Code coverage report

Uh oh!

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

Uh oh!

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

Uh oh!

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

louis-menlo left a comment

Choose a reason for hiding this comment

Uh oh!

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

Uh oh!

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

Uh oh!

louis-menlo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

qnixsynapse commented Aug 20, 2025 •

edited by ellipsis-dev bot

Loading

github-actions bot commented Aug 20, 2025 •

edited

Loading