Skip to content

v3.0.3

Compare
Choose a tag to compare
@Jintao-Huang Jintao-Huang released this 22 Jan 15:27
· 74 commits to main since this release

中文版

新特性

  1. 支持多模态大模型SequenceClassification架构用于多模态分类任务,参考这里
  2. 支持多模态大模型reward model训练。

新模型

  1. Shanghai_AI_Laboratory/internlm3-8b-instruct
  2. OpenBMB/MiniCPM-o-2_6
  3. deepseek-ai/DeepSeek-R1, deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B系列
  4. bytedance-research/Valley-Eagle-7B
  5. LLM-Research/phi-4
  6. Qwen/Qwen2.5-Math-PRM-7B, Qwen/Qwen2.5-Math-PRM-72B
  7. MiniMaxAI/MiniMax-Text-01, MiniMaxAI/MiniMax-VL-01

English Version

New Features

  1. Support multi-modal large model SequenceClassification architecture for multi-modal classification tasks, see here.
  2. Support training of multi-modal reward model.

New Models

  1. Shanghai_AI_Laboratory/internlm3-8b-instruct
  2. OpenBMB/MiniCPM-o-2_6
  3. deepseek-ai/DeepSeek-R1, deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B series
  4. bytedance-research/Valley-Eagle-7B
  5. LLM-Research/phi-4
  6. Qwen/Qwen2.5-Math-PRM-7B, Qwen/Qwen2.5-Math-PRM-72B
  7. MiniMaxAI/MiniMax-Text-01, MiniMaxAI/MiniMax-VL-01

What's Changed

Full Changelog: v3.0.2...v3.0.3