Skip to content

[VLM] optimize VLM processing#1234

Merged
zhuzilin merged 11 commits intomainfrom
feat/optimize-vlm
Dec 27, 2025
Merged

[VLM] optimize VLM processing#1234
zhuzilin merged 11 commits intomainfrom
feat/optimize-vlm

Conversation

@nanjiangwill
Copy link
Collaborator

@nanjiangwill nanjiangwill commented Dec 26, 2025

followup #1232

  1. Remove extra processing of multimodal train inputs
  2. Add CI for VLM FSDP
  3. Update --rotary-base for VL model

@nanjiangwill nanjiangwill changed the title optimize VLM processing [VLM] optimize VLM processing Dec 27, 2025
@zhuzilin
Copy link
Contributor

I think that it's a.bit hard to understand the meaning of multimodal tensor, could you rename the multimodal tensors to something like multimodal train inputs and still calling the orginal field as multimodal inputs.

@zhuzilin zhuzilin merged commit 63e63ef into main Dec 27, 2025
12 checks passed
@zhuzilin zhuzilin deleted the feat/optimize-vlm branch December 27, 2025 02:11
kafkayu pushed a commit to kafkayu/slime that referenced this pull request Jan 8, 2026
jind11 pushed a commit to eigen-ai-labs/slime that referenced this pull request Feb 24, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants