Add NVIDIA Nemotron Nano tool call support #83

aldehir · 2025-08-31T21:29:06Z

Add support for nvidia/NVIDIA-Nemotron-Nano-9B-v2.

The template strips the last message if role == "assistant", causing it to fail the tool call checks. I added tool responses to make it happy, but I understand this somewhat overlaps with the tool call response check. I'm happy to change the implementation if there is a better alternative.

ref: ggml-org/llama.cpp#15676 (comment)

pwilkin · 2025-08-31T21:35:01Z

One obvious solution would be to pad the tool call testing message with another assistant message at the end.

Edit: nvm, didn't look at the commits, I see you already did that :)

aldehir added 3 commits August 31, 2025 15:42

add nvidia nemotron template and fix tool call check

282bd11

add tool response to parallel tool calling check

94ad946

comment out requires_non_null_content expect

6ae38c1

fix possible false positive in parallel tool call check

31af2a3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add NVIDIA Nemotron Nano tool call support #83

Add NVIDIA Nemotron Nano tool call support #83

Uh oh!

aldehir commented Aug 31, 2025

Uh oh!

pwilkin commented Aug 31, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add NVIDIA Nemotron Nano tool call support #83

Are you sure you want to change the base?

Add NVIDIA Nemotron Nano tool call support #83

Uh oh!

Conversation

aldehir commented Aug 31, 2025

Uh oh!

pwilkin commented Aug 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pwilkin commented Aug 31, 2025 •

edited

Loading