Skip to content

[WIP][Diffusion] LingBot-VA world-action model support#2885

Draft
KevinZeng08 wants to merge 5 commits intovllm-project:mainfrom
KevinZeng08:feat/lingbot-va-pipe2
Draft

[WIP][Diffusion] LingBot-VA world-action model support#2885
KevinZeng08 wants to merge 5 commits intovllm-project:mainfrom
KevinZeng08:feat/lingbot-va-pipe2

Conversation

@KevinZeng08
Copy link
Copy Markdown

@KevinZeng08 KevinZeng08 commented Apr 17, 2026

Purpose

This PR enables LingBot-VA world-action model inference in vllm-omni, currently work in progress.

LingBot-VA is an autoregressive world-action model that jointly predicts future video latents and robot actions.

Related issue: #1987 #2162

Roadmap

  • I2VA (Image to Video-Action) mode for offline inference
  • Server mode for online inference
  • Performance
    • Support CFG parallel
    • Support TP
  • Refactor internal state management
    • Expose KV cache and related inference-state data to upper layers
    • Explore integration with vLLM KV cache management

Test Plan

  • I2VA mode offline inference video-action output
  • Server mode output

Test Result

Text prompt: Grab the medium-sized white mug, rotate it, place it on the table, and hook it onto the smooth dark gray rack.

vllm-omni (I2VA offline inference) LingBot-VA official repo (I2VA)
vllm-omni-lingbot-va.mp4
lingbot-va-official.mp4

Future Work

  • Support for more world-action models (e.g. FastWAM)
  • Explore to unify the framework of autoregressive world-action models, which share design similarities
wam_compare

Source: FastWAM

@TKONIY TKONIY mentioned this pull request Apr 17, 2026
20 tasks
@hsliuustc0106
Copy link
Copy Markdown
Collaborator

This PR is marked as WIP. Ready for full review when draft status is removed.

Note: DCO check is currently failing. Please resolve before requesting review.

@KevinZeng08 KevinZeng08 marked this pull request as draft April 18, 2026 02:46
boyu.zbw and others added 5 commits April 18, 2026 22:37
Signed-off-by: kevinzeng <2538015266@qq.com>
Signed-off-by: kevinzeng <2538015266@qq.com>
Signed-off-by: kevinzeng <2538015266@qq.com>
Signed-off-by: kevinzeng <2538015266@qq.com>
commit 016dfa0
Author: kevinzeng <2538015266@qq.com>
Date:   Sat Apr 18 22:31:40 2026 +0800

    add cfg example

commit be01379
Author: boyu.zbw <boyu.zbw@antgroup.com>
Date:   Sat Apr 18 22:15:52 2026 +0800

    init cfg parallel

Signed-off-by: kevinzeng <2538015266@qq.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants