Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sampler #2905

Merged
merged 64 commits into from
Jan 13, 2025
Merged

Sampler #2905

merged 64 commits into from
Jan 13, 2025

Conversation

tastelikefeet
Copy link
Collaborator

PR type

  • Bug Fix
  • New Feature
  • Document Updates
  • More Models or Datasets Support

PR information

Write the detail information belongs to this PR.

Experiment results

Paste your experiment result here(if needed).

* main:
  Support LoRA-GA (modelscope#2650)
  fix swift/Infinity-Instruct (modelscope#2651)
  update truncation_strategy (modelscope#2647)
  fix bugs (modelscope#2645)
  Fix post encode (modelscope#2643)
  fix app-ui (modelscope#2641)
  fix bugs & support openbuddy llama3.3 & update docs (modelscope#2638)
  fix dataset (modelscope#2636)
  fix add_default_tag (modelscope#2631)
  support reward model (modelscope#2628)
* commit '07f10d2a94e7342413fa7762b6ce6b101b93d130': (86 commits)
  Move optimizer to create_optimizer (modelscope#2851)
  support reward_model (modelscope#2849)
  1. fix hub ignore-pattern (modelscope#2848)
  Fix bugs (modelscope#2838)
  Update base_to_chat shell (modelscope#2833)
  Update padding side (modelscope#2832)
  Fix glm4v suffix (modelscope#2829)
  add 'right' option for 'truncation_strategy' (modelscope#2754)
  update docs (specific model arguments) (modelscope#2822)
  fix enable_cache (modelscope#2813)
  fix citest (modelscope#2812)
  support ZhipuAI/cogagent-9b-20241220 (modelscope#2810)
  fix swift deploy log error (repeat log) (modelscope#2808)
  fix glm4v (modelscope#2806)
  update base_model deploy example (modelscope#2803)
  fix world_size (modelscope#2801)
  fix (modelscope#2800)
  support swift app (modelscope#2792)
  fix some web-ui bugs (modelscope#2794)
  fix stream infer (modelscope#2793)
  ...

# Conflicts:
#	examples/train/multi-gpu/ddp/train.sh
#	swift/llm/__init__.py
#	swift/llm/argument/rlhf_args.py
#	swift/llm/template/base.py
#	swift/llm/template/template_inputs.py
#	swift/llm/template/utils.py
#	swift/llm/train/tuner.py
#	swift/trainers/mixin.py
tastelikefeet and others added 28 commits January 6, 2025 14:24
* commit 'a0d0351400d522392fb4535567bab83d8b9d45b2':
  Support infer n parameter (modelscope#2893)
  support multi round dpo (modelscope#2884)
  fix docs (modelscope#2882)
  update qlora shell (modelscope#2880)
  fix bugs (modelscope#2876)
  fix citest (modelscope#2873)
  Support ppo (modelscope#2783)
  fix bugs (modelscope#2869)
  Update agent demo (modelscope#2867)
  support mps (modelscope#2866)
  fix vllm video (modelscope#2864)
  support reward model train (modelscope#2862)
  fix jsonl writer (modelscope#2860)
  Support quant bert reward (modelscope#2859)

# Conflicts:
#	examples/train/rlhf/ppo.sh
#	swift/trainers/__init__.py
#	swift/trainers/mixin.py
#	swift/trainers/rlhf_trainer/ppo_trainer.py
* commit '65e4b26cc433878dbb4d67b0d1ae97287814bfa4':
  fix link & bug (modelscope#2902)
  Add phi4 (modelscope#2895)
  fix infer engine (modelscope#2898)
  Fix qwen vl eval (modelscope#2892)
@tastelikefeet tastelikefeet merged commit e9f4f9f into modelscope:main Jan 13, 2025
1 of 2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants