-
Notifications
You must be signed in to change notification settings - Fork 3.4k
add function call parser for DeepSeek V3 #5054
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
great work,let me check it,please wait for a moment :-) |
thanks for replying, actually I found some bugs here. I will update this PR later. |
@finger92 I found the deepseek.jinja and function_call_parser.py has many chinese character, such as <|tool▁calls▁begin|>, <|tool▁sep|> etc. |
you are right, you can also see them in the tokenizor.json for deepseek model |
Co-authored-by: 晟海 <[email protected]> Co-authored-by: laixinn <[email protected]>
…l-project#5050) Co-authored-by: Qingquan Song <[email protected]> Co-authored-by: Chunan Zeng <[email protected]>
Co-authored-by: wunhuang <[email protected]>
Co-authored-by: laixinn <[email protected]> Co-authored-by: sleepcoo <[email protected]> Co-authored-by: zhyncs <[email protected]>
Signed-off-by: Shangming Cai <[email protected]> Co-authored-by: Shangming Cai <[email protected]> Co-authored-by: Xuchun Shang <[email protected]> Co-authored-by: shangmingc <[email protected]>
|
@finger92 Why did you turn this pr off? |
Motivation
support function call for deepseek model issued by #4379
Modifications
add deepseek v3 function call parser
Checklist
validation steps
environment:
H20 * 8
deepseek.jinja
Attention
and I cant find token with id "129279" in deepseek model's tokenizor.json. but this work fine with "outlines", so I add "--grammar-backend outlines" to launch cmd.
3. use a smaller temperature. I found that the default temperature will cause model output unstable for function calling