Skip to content

Conversation

heiruwu
Copy link
Contributor

@heiruwu heiruwu commented Mar 24, 2025

Because

  • current config cannot handle concurrent requests > 100

This commit

  • enlarge max queue size

@heiruwu heiruwu merged commit 1086e6d into main Mar 24, 2025
7 checks passed
@heiruwu heiruwu deleted the update-scale-config branch March 24, 2025 20:23
pinglin pushed a commit that referenced this pull request Apr 18, 2025
🤖 I have created a release *beep* *boop*
---


##
[0.17.0](v0.16.2...v0.17.0)
(2025-04-15)


### Features

* **cli:** enhance CLI functionality and add Docker support
([#268](#268))
([8fa8fed](8fa8fed))
* **ray:** add high scale config
([#261](#261))
([ccf24b2](ccf24b2))


### Bug Fixes

* **client, const:** add secure argument to latest SDK client
([#271](#271))
([1355086](1355086))
* **ray:** align autoscale config
([#263](#263))
([c07b787](c07b787))
* **ray:** fix config not applied
([0bc15de](0bc15de))
* **ray:** override max replica
([7e456ca](7e456ca))
* **ray:** update high scale model config
([#264](#264))
([1086e6d](1086e6d))

---
This PR was generated with [Release
Please](https://github.com/googleapis/release-please). See
[documentation](https://github.com/googleapis/release-please#release-please).
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
No open projects
Status: No status
Development

Successfully merging this pull request may close these issues.

2 participants