[Mistral Grammar] Fix tool and reasoning parsing. by juliendenize · Pull Request #2 · juliendenize/vllm

juliendenize · 2026-03-27T21:36:51Z

Purpose

When Mistral models are served with --tool-call-parser mistral and a mistral_common-compatible tokenizer (tekken/v11+), since this PR, tool parsers has a grammar-based approach: adjust_request injects a Lark grammar from mistral-common's grammar factory into structured_outputs. This grammar constrains the model output to follow valid Mistral tool-call formatting at the decoding level.

However, the PR only handled the grammar injection which broke the vLLM tool-parsing. This PR makes the serving layer actually use the Mistral tool parser and reasoning parser to parse the grammar-constrained output, rather than falling through to the generic vLLM tool-call parsing paths which don't understand Mistral's format.

Test Plan

The branch adds:

Unit tests in tests/tool_parsers/test_mistral_tool_parser.py for is_mistral_grammar_path, build_non_streaming_tool_calls, and the extract_maybe_reasoning_and_tool_streaming state machine.
E2E tests in tests/tool_use/mistral/test_mistral_tool_calls.py covering auto, required, none, no-tools, and tool-response-followup scenarios for ministral-3b — both streaming and non-streaming.

# Unit tests
pytest tests/tool_parsers/test_mistral_tool_parser.py -v
# E2E tests (requires GPU + model download)
pytest tests/tool_use/mistral/test_mistral_tool_calls.py -v

Test Result

The tests pass

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

juliendenize

Adding a bunch of comments for context

juliendenize · 2026-03-27T21:38:02Z

vllm/entrypoints/openai/chat_completion/protocol.py

-    @model_validator(mode="before")
-    @classmethod
-    def set_include_reasoning_for_none_effort(cls, data: Any) -> Any:
-        if data.get("reasoning_effort") == "none":
-            data["include_reasoning"] = False
-        return data


Was introduce by me when adding reasoning_effort="none", it was a bad idea as sometimes model would fail to not generate or include_reasoning can have side effects on the parsers.

juliendenize · 2026-03-27T21:39:55Z

vllm/transformers_utils/processors/pixtral.py

+        # Override to ensure both image_processor and tokenizer are used.
+        # The base get_attributes() introspects __init__ parameters and
+        # misses image_processor since it is created internally rather
+        # than passed as an __init__ argument.


This is due to a regression that was recently introduced, i'd need to check if this fix is sound on Monday and if so i'll open another pr just for that

juliendenize · 2026-03-27T21:40:26Z

vllm/sampling_params.py

+    _from_tool_parser: bool = field(default=False, init=False)
+    """CAUTION: Should only be set by ToolParser.adjust_request"""


needed to know if the grammar is active without ambiguity

juliendenize · 2026-03-27T21:41:36Z

vllm/entrypoints/openai/chat_completion/serving.py

+        _is_mistral_tool_parser = self.tool_parser is not None and issubclass(
+            self.tool_parser, MistralToolParser
+        )
+        if _is_mistral_tool_parser and self.reasoning_parser_cls is not None:
+            MistralToolParser.model_can_reason = True


this is not the cleanest maybe but this is the best i found to have a non-intrusive way of knowing if reasoning should be expected by the grammar.

Signed-off-by: juliendenize <julien.denize@mistral.ai>

juliendenize commented Mar 27, 2026

View reviewed changes

juliendenize changed the title ~~[Mistral Grammar] Use Mistral Grammar path for tool and reasoning parser~~ [Mistral Grammar] Fix tool and reasoning parsing. Mar 27, 2026

juliendenize force-pushed the improve_mistral_parsing branch from e90a7a1 to 955532c Compare March 30, 2026 11:36

juliendenize force-pushed the fix_parsing_on_top_of_grammar branch from c46ff15 to 40f5d9b Compare March 30, 2026 11:38

juliendenize added 7 commits April 1, 2026 14:31

Add Mistral grammar

0fa177a

Signed-off-by: juliendenize <julien.denize@mistral.ai>

Improve adjust_request to support json and redirect

0be4861

Signed-off-by: juliendenize <julien.denize@mistral.ai>

Improve error message for Guidance backend

3a80fb4

Signed-off-by: juliendenize <julien.denize@mistral.ai>

Fix json grammar arguments

a355de4

Signed-off-by: juliendenize <julien.denize@mistral.ai>

Update with respect to mistral-common guidance

23f76ca

Signed-off-by: juliendenize <julien.denize@mistral.ai>

Fix tests

71be6ac

Signed-off-by: juliendenize <julien.denize@mistral.ai>

bump

3772f0c

Signed-off-by: juliendenize <julien.denize@mistral.ai>

juliendenize force-pushed the improve_mistral_parsing branch from 955532c to 3772f0c Compare April 1, 2026 14:39

juliendenize added 9 commits April 1, 2026 14:53

Exclude ResponsesRequest

0506d4e

Signed-off-by: juliendenize <julien.denize@mistral.ai>

Add _from_tool_parser attribute

4b6eb02

Signed-off-by: juliendenize <julien.denize@mistral.ai>

Core parser changes

86ba6cf

Signed-off-by: juliendenize <julien.denize@mistral.ai>

Allow adjust_request for tool_choice="none"

84d35f1

Signed-off-by: juliendenize <julien.denize@mistral.ai>

Streaming and Non-streaming mistral grammar

0f4418a

Signed-off-by: juliendenize <julien.denize@mistral.ai>

Use MistralToolParser in _parse_tool_calls_from_content

5a8cbcf

Signed-off-by: juliendenize <julien.denize@mistral.ai>

Add tests

7a0e426

Signed-off-by: juliendenize <julien.denize@mistral.ai>

Fixes

b32bc38

Signed-off-by: juliendenize <julien.denize@mistral.ai>

Minor improvements

5b4589a

Signed-off-by: juliendenize <julien.denize@mistral.ai>

juliendenize force-pushed the fix_parsing_on_top_of_grammar branch from 40f5d9b to 5b4589a Compare April 1, 2026 14:55

juliendenize force-pushed the improve_mistral_parsing branch from e747e48 to 5c0f3b2 Compare April 2, 2026 15:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Mistral Grammar] Fix tool and reasoning parsing.#2

[Mistral Grammar] Fix tool and reasoning parsing.#2
juliendenize wants to merge 16 commits intoimprove_mistral_parsingfrom
fix_parsing_on_top_of_grammar

juliendenize commented Mar 27, 2026 •

edited by github-actions bot

Loading

Uh oh!

juliendenize left a comment

Uh oh!

juliendenize Mar 27, 2026

Uh oh!

juliendenize Mar 27, 2026

Uh oh!

juliendenize Mar 27, 2026

Uh oh!

juliendenize Mar 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		_from_tool_parser: bool = field(default=False, init=False)
		"""CAUTION: Should only be set by ToolParser.adjust_request"""

Conversation

juliendenize commented Mar 27, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

juliendenize left a comment

Choose a reason for hiding this comment

Uh oh!

juliendenize Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

juliendenize Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

juliendenize Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

juliendenize Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

juliendenize commented Mar 27, 2026 •

edited by github-actions bot

Loading