Skip to content

Add support for CUDA and CPU arch for Qwen-2.5-VL and Fara-7B#1919

Merged
apsonawane merged 7 commits into
mainfrom
asonawane/vlm
Dec 17, 2025
Merged

Add support for CUDA and CPU arch for Qwen-2.5-VL and Fara-7B#1919
apsonawane merged 7 commits into
mainfrom
asonawane/vlm

Conversation

@apsonawane
Copy link
Copy Markdown
Contributor

@apsonawane apsonawane commented Dec 12, 2025

Add CUDA and CPU architecture support for Qwen-2.5-VL and Fara-7B model
Validated NPU model is also working with this change

{"accuracy": 0.8765493306891423,"task_name": "ScienceQA_Visual"}
{"accuracy": 0.8244818652849741, "task_name": "ai2d_test"}
{"accuracy": 0.8108, "task_name": "chart_qa_test"}
{"accuracy": 0.4825291181364393, "task_name": "intergps_test"}

@apsonawane apsonawane requested a review from tianleiwu December 16, 2025 00:22
@apsonawane apsonawane enabled auto-merge (squash) December 16, 2025 22:08
Comment thread src/models/model.cpp
Comment thread test/python/test_qwen_fara_models.py
Comment thread test/python/test_qwen_fara_models.py
@apsonawane apsonawane merged commit f41b3cc into main Dec 17, 2025
15 checks passed
@apsonawane apsonawane deleted the asonawane/vlm branch December 17, 2025 21:08
apsonawane added a commit that referenced this pull request Dec 19, 2025
Add CUDA and CPU architecture support for Qwen-2.5-VL and Fara-7B model
Validated NPU model is also working with this change
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants