v3.1.0

Latest

Latest

Jintao-Huang released this 07 Feb 12:38

· 44 commits to main since this release

adb6f8f

中文版

新特性

支持swift sample命令进行数据采样，参考这里。
支持强化微调训练，目前已支持拒绝采样微调，参考这里。
Grounding任务自定义数据格式重构，参考这里。
swift infer支持输出推理速度和ACC/ROUGE/BLEU指标。

新模型

Qwen/Qwen2.5-VL-3B-Instruct系列
Qwen/Qwen2.5-7B-Instruct-1M系列
deepseek-ai/Janus-Pro-1B系列
bytedance-research/UI-TARS-2B-SFT系列

新数据集

ServiceNow-AI/R1-Distill-SFT
bespokelabs/Bespoke-Stratos-17k
open-thoughts/OpenThoughts-114k

English Version

New Features

Supports the swift sample command for data sampling; refer to here.
Supports reinforcement fine-tuning training, with current support for rejection sampling fine-tuning; refer to here.
3Grounding task custom data format restructuring; refer to here.
swift infer supports outputting inference speed and ACC/ROUGE/BLEU metrics.

New Models

Qwen/Qwen2.5-VL-3B-Instruct Series
Qwen/Qwen2.5-7B-Instruct-1M Series
deepseek-ai/Janus-Pro-1B Series
bytedance-research/UI-TARS-2B-SFT Series

New Datasets

ServiceNow-AI/R1-Distill-SFT
bespokelabs/Bespoke-Stratos-17k
open-thoughts/OpenThoughts-114k

What's Changed

add "enable_prefix_caching" args for vllm engine. by @Leoyzen in #2939
Fix vllm docs link & fix web-ui by @Jintao-Huang in #2970
Fix sample by @tastelikefeet in #2971
support merge-lora & quant by @Jintao-Huang in #2973
support create_checkpoint_symlink by @Jintao-Huang in #2975
Sampling and RFT by @tastelikefeet in #2977
support auto dataset mapping by @Jintao-Huang in #2976
support qwen2_5 long by @Jintao-Huang in #2982
sys_prompt from file by @lxline in #2980
support bytedance-research/UI-TARS-2B-SFT series by @Jintao-Huang in #2987
support Qwen/Qwen2.5-VL-3B-Instruct series model by @Jintao-Huang in #2996
fix qwen2_5-vl by @Jintao-Huang in #2998
support Qwen/Qwen2.5-VL-72B-Instruct by @Jintao-Huang in #2999
refactor grounding by @Jintao-Huang in #3000
compatible with trl v0.13 by @hjh0119 in #2992
update R1 dataset by @Jintao-Huang in #3005
fix qwen2.5-vl grounding (refactor) by @Jintao-Huang in #2979
fix deploy by @Jintao-Huang in #3007
support infer metric: acc/rouge or bleu by @Jintao-Huang in #3008
support deepseek janus pro by @Jintao-Huang in #3009
update readme by @Jintao-Huang in #3011
fix parse_dict by @Jintao-Huang in #3012
update docs by @Jintao-Huang in #3015
Fix readme & update docs by @Jintao-Huang in #3018
fix push to hub by @tastelikefeet in #3024
Fix bugs by @Jintao-Huang in #3025
fix bugs by @Jintao-Huang in #3026
Fix qwen tool template to official format by @Leoyzen in #2988
fix message merging strategy when multi-turn tool calling. by @Leoyzen in #2986

New Contributors

@Leoyzen made their first contribution in #2939

Full Changelog: v3.0.3...v3.1.0

Contributors

Leoyzen, Jintao-Huang, and 3 other contributors

Assets 2