Structure completion request to maximize Prompt Caching

### Describe the feature or improvement you are requesting

Today, the current flow of a request through to an OpenAI service relies on simple JSON-serialization of a model to encode the message to `BinaryData` and send it through the pipeline.

This does not maximize Prompt Caching capabilities, where the completion request should have `tools`, then `history`, then new content - in that order.
Additionally, the tools and history must be in the same order every time (suggest alpha order by tool name).

Sources:
https://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/prompt-caching
https://openai.com/index/api-prompt-caching/
https://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/prompt-caching#what-is-cached

Asks for `BinaryData` from the options:
https://github.com/openai/openai-dotnet/blob/c49dd7065215bc0d094c7f79ccd634a38f0d7b66/src/Custom/Chat/ChatClient.cs#L196

Writes the JSON doc in non-optimal order:
https://github.com/openai/openai-dotnet/blob/c49dd7065215bc0d094c7f79ccd634a38f0d7b66/src/Generated/Models/ChatCompletionOptions.Serialization.cs#L15

Uses the non-optimal serialization when constructing the `BinaryData` for the options
https://github.com/openai/openai-dotnet/blob/c49dd7065215bc0d094c7f79ccd634a38f0d7b66/src/Generated/Models/ChatCompletionOptions.Serialization.cs#L625-L628

### Additional context

https://github.com/microsoft/semantic-kernel/discussions/9444

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Structure completion request to maximize Prompt Caching #281

Describe the feature or improvement you are requesting

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

	internal virtual BinaryContent ToBinaryContent()
	{
	return BinaryContent.Create(this, ModelSerializationExtensions.WireOptions);
	}

Structure completion request to maximize Prompt Caching #281

Description

Describe the feature or improvement you are requesting

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions