ui: Add max image size option by stduhpf · Pull Request #22849 · ggml-org/llama.cpp

stduhpf · 2026-05-08T19:06:31Z

Overview

Adds a way to scale down the size of images above some threshold threshold when sending multimodal prompts to the server, as very high resolution images can take forever to encode and use a lot of memory.

Additional information

I refactored ChatService.convertDbMessageToApiChatMessageData()to be async, which could be a breaking change.
The max resolution is set as a maximum total pixel count (width*height), expressed in megapixels. If the count is 0 (or rather less than a single pixel), the feature is disabled.

Requirements

I have read and agree with the contributing guidelines
AI usage disclosure: YES (navigating the project structure + inline completions were enabled)

allozaur · 2026-05-16T08:36:45Z

Please rebase this on latest commit on master and solve conflicts.

allozaur

Last small cosmetics and then we good to go

ServeurpersoCom · 2026-05-20T08:51:15Z

Good feature. Pushing a 50 megapixel image to the server just to have it encoded there is a waste of memory and time, so capping on the client before upload is the right place to do it, and defaulting to 0 means nobody who ignores the setting sees any change. Solid.
One thing I'd tweak: The resize path runs every image through the canvas even when it's already under the threshold. An already small JPEG gets redrawn and re-exported, which is a second lossy pass on top of the compression it already had, so it loses quality with nothing gained and its EXIF is dropped. Worse, toDataURL can only output PNG, JPEG and WEBP, so a GIF falls back to PNG: same pixels, but the file ends up heavier than what came in, the opposite of what this feature is for (and the server decodes GIF natively anyway through stb_image, alongside jpeg, png, bmp, tga and others, so there's no compatibility reason to convert it). An early return of the original data URL when the pixel count is already within budget keeps it touching only the images that actually need shrinking.
Small change, the rest is good!

ServeurpersoCom

LGTM

CISC · 2026-05-20T11:00:04Z

An already small JPEG gets redrawn and re-exported, which is a second lossy pass on top of the compression it already had, so it loses quality with nothing gained and its EXIF is dropped.

BTW, in case this was forgotten: #20870

* webui: Add max image size option * remove magic numbers * support all image formats * use const * Move regex to match b64 images to constants * use SETTINGS_KEYS to get max image resolution setting * Do not touch the image if already under the size threshold

* origin/master: (138 commits) fix(flash-attn): replace f32 with kv_type and q_type (ggml-org#23372) tests : move save-load-state from examples to tests (ggml-org#23336) server: expose prompt token counts in /slots endpoint (ggml-org#23454) metal : optimize concat kernel and fix set kernel threads (ggml-org#23411) server : free draft/MTP resources on sleep to fix VRAM leak (ggml-org#23461) server: re-inject subcommand when router spawns children under unified binary (ggml-org#23442) app : add batched-bench, fit-params, quantize & perplexity (ggml-org#23459) mtp: use inp_out_ids for skipping logit computation (ggml-org#23433) vocab : add Carbon-3B (HybridDNATokenizer) support (ggml-org#23410) doc: fix spec mtp typo (ggml-org#23435) ui: Improve Git Hooks for UI development (ggml-org#23403) ggml : Check the right iface method before using the fallback 2d get (ggml-org#23306) llama-graph: fix null-buffer crash in llm_graph_input_attn_kv_iswa for SWA-only models (ggml-org#23131) hexagon: ssm-conv fix for large prompts (ggml-org#23307) app : show version (ggml-org#23426) mtmd, model : merge HunyuanOCR into HunyuanVL and fix OCR vision precision (ggml-org#23329) ui: Add max image size option (ggml-org#22849) Move to backend sampling for MTP draft path (ggml-org#23287) opencl: refactor backend initilization (ggml-org#23318) common/speculative : fix nullptr crash in get_devices_str (ggml-org#23386) ...

* webui: Add max image size option * remove magic numbers * support all image formats * use const * Move regex to match b64 images to constants * use SETTINGS_KEYS to get max image resolution setting * Do not touch the image if already under the size threshold

* upstream/HEAD: (38 commits) vocab : add Carbon-3B (HybridDNATokenizer) support (ggml-org#23410) doc: fix spec mtp typo (ggml-org#23435) ui: Improve Git Hooks for UI development (ggml-org#23403) ggml : Check the right iface method before using the fallback 2d get (ggml-org#23306) llama-graph: fix null-buffer crash in llm_graph_input_attn_kv_iswa for SWA-only models (ggml-org#23131) hexagon: ssm-conv fix for large prompts (ggml-org#23307) app : show version (ggml-org#23426) mtmd, model : merge HunyuanOCR into HunyuanVL and fix OCR vision precision (ggml-org#23329) ui: Add max image size option (ggml-org#22849) Move to backend sampling for MTP draft path (ggml-org#23287) opencl: refactor backend initilization (ggml-org#23318) common/speculative : fix nullptr crash in get_devices_str (ggml-org#23386) mtmd : DeepSeek-OCR image processing fixes, img_tool::resize padding refactor (ggml-org#23345) vulkan: optimize operations in the IM2COL shader (ggml-org#22685) feat: Add WAV MIME type variants and improve audio format detection (ggml-org#23396) hexagon: HMX quantized matmul rework (ggml-org#23368) Programmatic Dependent Launch (PDL) for more performance on newer NVIDIA GPUs (Hopper+) (ggml-org#22522) app : introduce the llama unified executable (ggml-org#23296) refactor: Move text attachments up before the message content in chat completions payload (ggml-org#23406) mtmd: fit_params now take into account mmproj (ggml-org#21489) ...

* webui: Add max image size option * remove magic numbers * support all image formats * use const * Move regex to match b64 images to constants * use SETTINGS_KEYS to get max image resolution setting * Do not touch the image if already under the size threshold

stduhpf requested a review from a team as a code owner May 8, 2026 19:06

github-actions Bot added server/webui examples server labels May 8, 2026

allozaur requested changes May 12, 2026

View reviewed changes

Comment thread tools/ui/src/lib/utils/cap-png-size.ts Outdated

Comment thread tools/ui/src/lib/utils/cap-png-size.ts Outdated

stduhpf force-pushed the cap-img-sz branch 2 times, most recently from 808d6ed to e756328 Compare May 16, 2026 19:36

github-actions Bot added the server/ui label May 16, 2026

allozaur self-assigned this May 17, 2026

allozaur changed the title ~~webui: Add max image size option~~ ui: Add max image size option May 17, 2026

allozaur requested changes May 17, 2026

View reviewed changes

Comment thread tools/ui/src/lib/services/chat.service.ts Outdated

Comment thread tools/ui/src/lib/services/chat.service.ts Outdated

Comment thread tools/ui/src/lib/utils/cap-png-size.ts Outdated

stduhpf force-pushed the cap-img-sz branch from 7d7c91d to 13a8be4 Compare May 18, 2026 16:01

allozaur requested changes May 20, 2026

View reviewed changes

Comment thread tools/ui/src/lib/utils/cap-img-size.ts Outdated

Comment thread tools/ui/src/lib/services/chat.service.ts Outdated

allozaur requested a review from ServeurpersoCom May 20, 2026 08:08

stduhpf force-pushed the cap-img-sz branch from 13a8be4 to 6e769e3 Compare May 20, 2026 10:25

allozaur approved these changes May 20, 2026

View reviewed changes

ServeurpersoCom approved these changes May 20, 2026

View reviewed changes

stduhpf added 7 commits May 20, 2026 14:28

webui: Add max image size option

f1af3dc

remove magic numbers

8a87e09

support all image formats

ae8b9be

use const

605244d

Move regex to match b64 images to constants

81934ed

use SETTINGS_KEYS to get max image resolution setting

7f1bcec

Do not touch the image if already under the size threshold

b673902

stduhpf force-pushed the cap-img-sz branch from 6e769e3 to b673902 Compare May 20, 2026 12:30

allozaur approved these changes May 20, 2026

View reviewed changes

allozaur merged commit 3a479c9 into ggml-org:master May 20, 2026
5 of 6 checks passed

nyo16 mentioned this pull request May 21, 2026

Bump llama.cpp to 52fb93a2b (30 commits) nyo16/llama_cpp_ex#42

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ui: Add max image size option#22849

ui: Add max image size option#22849
allozaur merged 7 commits into
ggml-org:masterfrom
stduhpf:cap-img-sz

stduhpf commented May 8, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

allozaur commented May 16, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

allozaur left a comment

Uh oh!

Uh oh!

Uh oh!

ServeurpersoCom commented May 20, 2026 •

edited

Loading

Uh oh!

ServeurpersoCom left a comment

Uh oh!

CISC commented May 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

stduhpf commented May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Additional information

Requirements

Uh oh!

Uh oh!

Uh oh!

allozaur commented May 16, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

allozaur left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ServeurpersoCom commented May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ServeurpersoCom left a comment

Choose a reason for hiding this comment

Uh oh!

CISC commented May 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

stduhpf commented May 8, 2026 •

edited

Loading

ServeurpersoCom commented May 20, 2026 •

edited

Loading