From e8164ae52136cf3c5810ff31d37e49a01cb4006b Mon Sep 17 00:00:00 2001 From: Matthew Isabel Date: Fri, 20 Sep 2024 18:23:05 -0400 Subject: [PATCH] Update rate limits for o1 models (#52377) Co-authored-by: Will Perry Co-authored-by: skedwards88 --- .../prototyping-with-ai-models.md | 52 ++++++++++++++++++- 1 file changed, 51 insertions(+), 1 deletion(-) diff --git a/content/github-models/prototyping-with-ai-models.md b/content/github-models/prototyping-with-ai-models.md index 4a341cf4194f..3ba7b9f51069 100644 --- a/content/github-models/prototyping-with-ai-models.md +++ b/content/github-models/prototyping-with-ai-models.md @@ -126,7 +126,7 @@ Low, high, and embedding models have different rate limits. To see which type of 4 - Embedding + Embedding Requests per minute 15 15 @@ -150,6 +150,56 @@ Low, high, and embedding models have different rate limits. To see which type of 5 8 + + Azure OpenAI o1-preview + Requests per minute + 1 + 2 + 2 + + + Requests per day + 8 + 10 + 12 + + + Tokens per request + 4000 in, 4000 out + 4000 in, 4000 out + 4000 in, 8000 out + + + Concurrent requests + 1 + 1 + 1 + + + Azure OpenAI o1-mini + Requests per minute + 2 + 3 + 3 + + + Requests per day + 12 + 15 + 20 + + + Tokens per request + 4000 in, 4000 out + 4000 in, 4000 out + 4000 in, 4000 out + + + Concurrent requests + 1 + 1 + 1 + These limits are subject to change without notice.