feat(gcpvertexai): do HTTP 429 like retries for Anthropic API HTTP 529 overloaded status code #2280

uddhav · 2025-04-20T05:35:16Z

GCP Vertex AI: Add retry handling for Anthropic API 529 overloaded status code

Adds support for handling Anthropic's HTTP 529 "API Overloaded" status code in the GCP Vertex AI provider. This status code indicates temporary backend capacity issues rather than quota exhaustion.

Changes

Added detection and retry logic for 529 "API overloaded" responses
Applied the same backoff strategy used for 429 rate limit errors
Enhanced error messages to distinguish between rate limits and overloaded states
Added unit test to verify correct 529 status code handling
chore: cleaned up the deprecated GCP Vertex AI model ID for Gemini 2.0 Pro Experimental

This improves reliability when interacting with Anthropic models through the Vertex AI provider during high-traffic periods.

…9 overloaded status code

baxen · 2025-06-16T23:31:29Z

We're doing a cleanup run on providers for #2953, since this is now a bit out of date with other changes i suggest we close this PR and include this in the refactor? Thank you for the contrib!

uddhav marked this pull request as ready for review April 20, 2025 05:37

uddhav force-pushed the gcp-vertex-ai-overloaded-retry branch from d88ab0f to ab94cfc Compare April 21, 2025 22:42

feat(gcpvertexai): do HTTP 429 like retries for Anthropic API HTTP 52…

dfbb7e4

…9 overloaded status code

uddhav force-pushed the gcp-vertex-ai-overloaded-retry branch from ab94cfc to dfbb7e4 Compare May 2, 2025 01:51

baxen closed this Jun 16, 2025

uddhav mentioned this pull request Jun 22, 2025

feat(gcpvertexai): do HTTP 429 like retries for Anthropic API HTTP 529 overloaded status code #3026

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(gcpvertexai): do HTTP 429 like retries for Anthropic API HTTP 529 overloaded status code #2280

feat(gcpvertexai): do HTTP 429 like retries for Anthropic API HTTP 529 overloaded status code #2280

Uh oh!

uddhav commented Apr 20, 2025 •

edited

Loading

Uh oh!

baxen commented Jun 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat(gcpvertexai): do HTTP 429 like retries for Anthropic API HTTP 529 overloaded status code #2280

feat(gcpvertexai): do HTTP 429 like retries for Anthropic API HTTP 529 overloaded status code #2280

Uh oh!

Conversation

uddhav commented Apr 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

GCP Vertex AI: Add retry handling for Anthropic API 529 overloaded status code

Changes

Uh oh!

baxen commented Jun 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

uddhav commented Apr 20, 2025 •

edited

Loading