[Feature][Response API] Support `num_cached_tokens` and `num_reasoning_tokens` in ResponseUsage

### 🚀 The feature, motivation and pitch

These attributes are important for monitoring model behavior. But both variables are set to 0 now.
https://github.com/vllm-project/vllm/blob/8a19303173881e4197c7656727c6f2b296faa7fc/vllm/entrypoints/context.py#L70-L74

https://github.com/vllm-project/vllm/pull/22667 implements `num_prompt_tokens` and `num_output_tokens`, but we still need help for `num_cached_tokens` and `num_reasoning_tokens`.

When implementing this, please note that
1. gpt-oss has built-in tool calls so one request can trigger multiple rounds of generation https://github.com/vllm-project/vllm/blob/8a19303173881e4197c7656727c6f2b296faa7fc/vllm/entrypoints/openai/serving_engine.py#L954
2. For non_streaming case, Context.append_output is called for each round of generation, while for streaming case, append_output is called for each output token. 

### Alternatives

_No response_

### Additional context

_No response_

### Before submitting a new issue...

- [x] Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the [documentation page](https://docs.vllm.ai/en/latest/), which can answer lots of frequently asked questions.

	# TODO(woosuk): Implement the following fields.
	self.num_prompt_tokens = 0
	self.num_cached_tokens = 0
	self.num_output_tokens = 0
	self.num_reasoning_tokens = 0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Feature][Response API] Support `num_cached_tokens` and `num_reasoning_tokens` in ResponseUsage #23363

🚀 The feature, motivation and pitch

Alternatives

Additional context

Before submitting a new issue...

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[Feature][Response API] Support num_cached_tokens and num_reasoning_tokens in ResponseUsage #23363

Description

🚀 The feature, motivation and pitch

Alternatives

Additional context

Before submitting a new issue...

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

[Feature][Response API] Support `num_cached_tokens` and `num_reasoning_tokens` in ResponseUsage #23363