Support molmo series vlm #2260

mi804 · 2024-10-16T06:53:00Z

PR type

Bug Fix
New Feature
Document Updates
More Models or Datasets Support

PR information

Support molmo series vlm

Jintao-Huang · 2024-10-16T07:43:20Z

lint test: https://github.com/modelscope/ms-swift/blob/main/CONTRIBUTING_CN.md#%E4%BB%A3%E7%A0%81%E8%A7%84%E8%8C%83%E5%92%8C%E5%BC%80%E5%8F%91%E6%96%B9%E5%BC%8F

Jintao-Huang · 2024-10-16T07:30:02Z

swift/llm/utils/model.py

@@ -690,6 +695,7 @@ class LoRATM(NamedTuple):
    ]
    # compat
    llama2 = llama
+    molmo = 'molmo'


这里是compat的, 换个位置呗

Jintao-Huang · 2024-10-16T07:30:23Z

swift/llm/utils/model.py

+    model, tokenizer = get_model_tokenizer_from_repo(model_dir, torch_dtype, model_kwargs, load_model, **kwargs)
+    tokenizer.processor = processor
+    # fix bug for molmoe-1b
+    from types import MethodType


移动到外面好了

Jintao-Huang · 2024-10-16T07:31:57Z

swift/llm/utils/template.py

@@ -7,7 +7,7 @@
 from datetime import datetime
 from functools import partial, wraps
 from types import MethodType
-from typing import Any, Callable, Dict, List, Literal, Optional, Tuple, TypeVar, Union
+from typing import Any, Callable, Dict, List, Literal, Optional, Tuple, TypeVar, Union, Optional


Optional去除

Jintao-Huang · 2024-10-16T07:32:11Z

swift/llm/utils/model.py

@@ -6924,7 +7022,11 @@ def get_additional_saved_files(model_type: str) -> List[str]:
        'qwen-vl': ['SimSun.ttf'],
        'qwen-audio': ['mel_filters.npz'],
        'yi-vl': ['vit'],
-        'minicpm-v-v2_6-chat': ['modeling_navit_siglip.py']
+        'minicpm-v-v2_6-chat': ['modeling_navit_siglip.py'],
+        'molmoe-1b': ['modeling_molmoe.py'],


写成循环呗

Jintao-Huang · 2024-10-16T07:48:29Z

swift/llm/utils/model.py

+                                  **kwargs):
+    from transformers import AutoProcessor
+    processor = AutoProcessor.from_pretrained(model_dir, trust_remote_code=True)
+    model, tokenizer = get_model_tokenizer_from_repo(model_dir, torch_dtype, model_kwargs, load_model, **kwargs)


少了 attention_type 这块对于flash_attn的支持

Jintao-Huang · 2024-10-16T07:53:13Z

#2199

#2143

Jintao-Huang · 2024-10-16T07:55:16Z

跑一下 python scripts/utils/run_model_info.py, 会更新支持的模型的文档

Support molmo series vlm

e6ee273

Jintao-Huang reviewed Oct 16, 2024

View reviewed changes

update

62067ec

Jintao-Huang approved these changes Oct 16, 2024

View reviewed changes

mi804 merged commit fb97bba into modelscope:main Oct 16, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support molmo series vlm #2260

Support molmo series vlm #2260

mi804 commented Oct 16, 2024

Jintao-Huang commented Oct 16, 2024

Jintao-Huang Oct 16, 2024

Jintao-Huang Oct 16, 2024

Jintao-Huang Oct 16, 2024

Jintao-Huang Oct 16, 2024

Jintao-Huang Oct 16, 2024

Jintao-Huang commented Oct 16, 2024

Jintao-Huang commented Oct 16, 2024

Support molmo series vlm #2260

Support molmo series vlm #2260

Conversation

mi804 commented Oct 16, 2024

PR type

PR information

Jintao-Huang commented Oct 16, 2024

Jintao-Huang Oct 16, 2024

Choose a reason for hiding this comment

Jintao-Huang Oct 16, 2024

Choose a reason for hiding this comment

Jintao-Huang Oct 16, 2024

Choose a reason for hiding this comment

Jintao-Huang Oct 16, 2024

Choose a reason for hiding this comment

Jintao-Huang Oct 16, 2024

Choose a reason for hiding this comment

Jintao-Huang commented Oct 16, 2024

Jintao-Huang commented Oct 16, 2024