中文版
新特性
- 支持
swift sample
命令进行数据采样,参考这里。 - 支持强化微调训练,目前已支持拒绝采样微调,参考这里。
- Grounding任务自定义数据格式重构,参考这里。
swift infer
支持输出推理速度和ACC/ROUGE/BLEU指标。
新模型
- Qwen/Qwen2.5-VL-3B-Instruct系列
- Qwen/Qwen2.5-7B-Instruct-1M系列
- deepseek-ai/Janus-Pro-1B系列
- bytedance-research/UI-TARS-2B-SFT系列
新数据集
- ServiceNow-AI/R1-Distill-SFT
- bespokelabs/Bespoke-Stratos-17k
- open-thoughts/OpenThoughts-114k
English Version
New Features
- Supports the
swift sample
command for data sampling; refer to here. - Supports reinforcement fine-tuning training, with current support for rejection sampling fine-tuning; refer to here.
3Grounding task custom data format restructuring; refer to here. swift infer
supports outputting inference speed and ACC/ROUGE/BLEU metrics.
New Models
- Qwen/Qwen2.5-VL-3B-Instruct Series
- Qwen/Qwen2.5-7B-Instruct-1M Series
- deepseek-ai/Janus-Pro-1B Series
- bytedance-research/UI-TARS-2B-SFT Series
New Datasets
- ServiceNow-AI/R1-Distill-SFT
- bespokelabs/Bespoke-Stratos-17k
- open-thoughts/OpenThoughts-114k
What's Changed
- add "enable_prefix_caching" args for vllm engine. by @Leoyzen in #2939
- Fix vllm docs link & fix web-ui by @Jintao-Huang in #2970
- Fix sample by @tastelikefeet in #2971
- support merge-lora & quant by @Jintao-Huang in #2973
- support create_checkpoint_symlink by @Jintao-Huang in #2975
- Sampling and RFT by @tastelikefeet in #2977
- support auto dataset mapping by @Jintao-Huang in #2976
- support qwen2_5 long by @Jintao-Huang in #2982
- sys_prompt from file by @lxline in #2980
- support bytedance-research/UI-TARS-2B-SFT series by @Jintao-Huang in #2987
- support Qwen/Qwen2.5-VL-3B-Instruct series model by @Jintao-Huang in #2996
- fix qwen2_5-vl by @Jintao-Huang in #2998
- support Qwen/Qwen2.5-VL-72B-Instruct by @Jintao-Huang in #2999
- refactor grounding by @Jintao-Huang in #3000
- compatible with trl v0.13 by @hjh0119 in #2992
- update R1 dataset by @Jintao-Huang in #3005
- fix qwen2.5-vl grounding (refactor) by @Jintao-Huang in #2979
- fix deploy by @Jintao-Huang in #3007
- support infer metric: acc/rouge or bleu by @Jintao-Huang in #3008
- support deepseek janus pro by @Jintao-Huang in #3009
- update readme by @Jintao-Huang in #3011
- fix parse_dict by @Jintao-Huang in #3012
- update docs by @Jintao-Huang in #3015
- Fix readme & update docs by @Jintao-Huang in #3018
- fix push to hub by @tastelikefeet in #3024
- Fix bugs by @Jintao-Huang in #3025
- fix bugs by @Jintao-Huang in #3026
- Fix qwen tool template to official format by @Leoyzen in #2988
- fix message merging strategy when multi-turn tool calling. by @Leoyzen in #2986
New Contributors
Full Changelog: v3.0.3...v3.1.0