-
Notifications
You must be signed in to change notification settings - Fork 3.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[App] Resolve some bugs from the Training Studio scaling #16114
Conversation
⚡ Required checks status: All passing 🟢Groups summary🟢 lightning_app: Tests workflow
These checks are required after the changes to 🟢 lightning_app: Examples
These checks are required after the changes to 🟢 lightning_app: Azure
These checks are required after the changes to 🟢 lightning_app: Docs
These checks are required after the changes to 🟢 mypy
These checks are required after the changes to 🟢 installThese checks are required after the changes to Thank you for your contribution! 💜
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM 😃
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm uncomfortable with us looping a request towards a server without pausing
Co-authored-by: thomas <[email protected]>
Co-authored-by: thomas <[email protected]>
Co-authored-by: thomas <[email protected]> (cherry picked from commit 51ec949)
Co-authored-by: thomas <[email protected]>
Co-authored-by: thomas <[email protected]>
Co-authored-by: thomas <[email protected]>
* chlog continued * [App] PoC: Add support for Request (#16047) (cherry picked from commit 592b126) * Have checkgroup pull the latest runs (#16033) (cherry picked from commit 4c91dec) * Update Multinode Warning (#16091) (cherry picked from commit 3d509f6) * [App] Serve datatypes with better client code (#16018) (cherry picked from commit 23013be) * docs: add PT version (#16010) * docs: add PT version * stable Co-authored-by: Adrian Wälchli <[email protected]> Co-authored-by: Adrian Wälchli <[email protected]> (cherry picked from commit c8b4bce) * add 1.13.1 to adjust versions (#16099) (cherry picked from commit acd48d3) * [App] Add display name property to the work (#16095) Co-authored-by: thomas <[email protected]> (cherry picked from commit 22b254f) * Fix detection of whether app is running in cloud (#16045) (cherry picked from commit e54f4f5) * [App] Add work.delete (#16103) Co-authored-by: thomas <[email protected]> (cherry picked from commit 80b7faf) * [App] Improve the autoscaler UI (#16063) [App] Improve the autoscaler UI (#16063) (cherry picked from commit 39d27f6) * [App] Min replica=0 would break autoscaler component (#16092) * fixing the bug where num_replica=0 would fail * changelog (cherry picked from commit aba5f12) * [App] Scale out/in interval for autoscaler (#16093) * Adding arguments for scale out/in interval * Tests (cherry picked from commit 0fd3d54) * Set the default work start method to spawn on MacOS (#16089) (cherry picked from commit f3157f3) * [App] Add status endpoint, enable `ready` (#16075) Co-authored-by: thomas chaton <[email protected]> (cherry picked from commit 2a85d9b) * Clarify `work.stop()` limitation (#16073) (cherry picked from commit 0abd686) * Update torchvision requirement from <=0.14.0,>=0.11.1 to >=0.11.1,<0.15.0 in /requirements (#16108) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Jirka <[email protected]> (cherry picked from commit e8ac9e9) * CI: settle file names (#16098) * CI: settle file names * rename * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> (cherry picked from commit a203b04) * [App] Resolve some bugs from the Training Studio scaling (#16114) Co-authored-by: thomas <[email protected]> (cherry picked from commit 51ec949) * Update scikit-learn requirement from <1.1.3,>0.22.1 to >0.22.1,<1.2.1 in /requirements (#16107) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> (cherry picked from commit d070e72) * [App] update app testing (#16000) Co-authored-by: thomas <[email protected]> (cherry picked from commit 8b68e20) * [App] Change overwrite to True (#16009) Co-authored-by: thomas <[email protected]> (cherry picked from commit 085c6b3) * [App] Improve PythonServer info message on startup (#15989) * change msg * update chgl * show the user's class name (cherry picked from commit 848ea56) * [App] Cold start proxy in autoscaler (#16094) * cold start proxy * Update src/lightning_app/components/serve/auto_scaler.py * changelog * better-doc Co-authored-by: Akihiro Nitta <[email protected]> Co-authored-by: thomas chaton <[email protected]> Co-authored-by: Jirka Borovec <[email protected]> (cherry picked from commit 7cbdc68) * Fix endpoint information tab not showing up in AutoScaler UI (#16128) * . * why * Revert "why" This reverts commit 375d3e8. * tried api access with fixed values * Revert "tried api access with fixed values" This reverts commit f1720f6. * Fix typo 🎉 * update chglog * revert removing lines in chlog * update chglog Co-authored-by: Akihiro Nitta <[email protected]> (cherry picked from commit 7b275bc) * [App] Implement `ready` for components (#16129) (cherry picked from commit 711aec5) * [App] Fix e2e tests (#16146) (cherry picked from commit c98b0c4) * [App] Remove outdated warning from cloud requirements (#16140) (cherry picked from commit bf5fa35) * [App] Fix support for streamlit > 1.14 (#16139) (cherry picked from commit ec336bc) * Fix broken link in "Build a Model" section of docs (#16025) fix build model link (cherry picked from commit 1c02288) * drop colossalai from testing as no stable release yet (#16122) (cherry picked from commit 1e16836) * [App] Simplify messaging in cloud dispatch (#16160) (cherry picked from commit 9657671) * [App] Add annotations endpoint (#16159) (cherry picked from commit 0630444) * Small fix in test_cli.py to avoid failure with future version of jsonargparse (#16156) (cherry picked from commit 50331e0) * Fix test failing on master due to bad auto-merge (#16118) (cherry picked from commit 07b775f) * ver1.8.6 Co-authored-by: Sherin Thomas <[email protected]> Co-authored-by: thomas chaton <[email protected]> Co-authored-by: Carlos Mocholí <[email protected]> Co-authored-by: Justus Schock <[email protected]> Co-authored-by: Adrian Wälchli <[email protected]> Co-authored-by: Akihiro Nitta <[email protected]> Co-authored-by: Ethan Harris <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Mauricio Villegas <[email protected]>
What does this PR do?
This PR resolves couple of issues found during Training Studio development:
This PR can be split in pieces if requested by reviewers.
Fixes #<issue_number>
Does your PR introduce any breaking changes? If yes, please list them.
Before submitting
PR review
Anyone in the community is welcome to review the PR.
Before you start reviewing, make sure you have read the review guidelines. In short, see the following bullet-list:
Did you have fun?
Make sure you had fun coding 🙃
cc @Borda