Optimize nemotron VL image/video preprocessing by netanel-haber · Pull Request #40283 · vllm-project/vllm

netanel-haber · 2026-04-19T12:09:35Z

Purpose

Compile and reorganize image/video preprocessing for nemotron nano VL, reducing the amount of CPU time and memory needed.

Fused resize+normalize+cast under @torch.compile — CPU kernel for permute → bicubic → /255 → (x-mean)/std → dtype.
dtype conversion integrated in the fusion to avoid a later separate autocast
contiguous fused to avoid a later separate H2H copy
Skip torch.cat on the single-image / single-video path to avoid a redundant copy
Batched tokenizer call for video frame separators

  1 video of 512x512x512, H100
Before:     apply_hf_processor_ms 898.57 898.63 4.58 905.18
After:       apply_hf_processor_ms 254.21 254.56 3.35 260.79

@netanel-haber:
LGTM. I ran evals. VoxPopuli (audio+text), InfoVQA_VAL (image+text) and DailyOmni (video+audio+text) are on par before and after.
Originally @milesial's pr: #40093 - I moved it to my fork to just fix DCO and push it through, since he is currently AFK. Otherwise, there are no changes.

Signed-off-by: milesial <milesial@users.noreply.github.com>

claude

Claude Code Review

This pull request is from a fork — automated review is disabled. A repository maintainer can comment @claude review to run a one-time review.

gemini-code-assist

Code Review

This pull request optimizes the nano_nemotron_vl processor by introducing a compiled _bicubic_resize_and_normalize function that fuses resizing, normalization, and dtype casting. It also adds _pil_to_nhwc_tensor for efficient image conversion and refactors get_video_repl to use batch tokenization for frame separators. Preprocessing logic for both images and videos has been updated to reduce unnecessary tensor concatenations and support broader configuration of normalization parameters. I have no feedback to provide.

tomeras91

Thanks for the optimizations!

Signed-off-by: milesial <milesial@users.noreply.github.com> Co-authored-by: milesial <milesial@users.noreply.github.com>

Signed-off-by: milesial <milesial@users.noreply.github.com> Co-authored-by: milesial <milesial@users.noreply.github.com> Signed-off-by: Avinash Singh <avinashsingh.rcoem@gmail.com>

Signed-off-by: milesial <milesial@users.noreply.github.com> Co-authored-by: milesial <milesial@users.noreply.github.com> Signed-off-by: Adrian <info@zzit.ch>

Signed-off-by: milesial <milesial@users.noreply.github.com> Co-authored-by: milesial <milesial@users.noreply.github.com>

Optimize nemotron VL image/video preprocessing

5e2d480

Signed-off-by: milesial <milesial@users.noreply.github.com>

claude Bot reviewed Apr 19, 2026

View reviewed changes

gemini-code-assist Bot reviewed Apr 19, 2026

View reviewed changes

tomeras91 approved these changes Apr 19, 2026

View reviewed changes

tomeras91 added the ready ONLY add when PR is ready to merge/full CI is needed label Apr 19, 2026

tomeras91 enabled auto-merge (squash) April 19, 2026 12:40

Merge branch 'main' into optimize-nemotron-image-video-preprocessing

29963c9

tomeras91 merged commit 982beae into vllm-project:main Apr 19, 2026
47 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimize nemotron VL image/video preprocessing#40283

Optimize nemotron VL image/video preprocessing#40283
tomeras91 merged 2 commits into
vllm-project:mainfrom
netanel-haber:optimize-nemotron-image-video-preprocessing

netanel-haber commented Apr 19, 2026 •

edited

Loading

Uh oh!

claude Bot left a comment

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

tomeras91 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

netanel-haber commented Apr 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Uh oh!

claude Bot left a comment

Choose a reason for hiding this comment

Claude Code Review

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

tomeras91 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

netanel-haber commented Apr 19, 2026 •

edited

Loading