Handle recovery from resize failure #73036

gnufied · 2019-01-17T19:13:47Z

Currently when resize fails of a volume it keeps retrying indefinitely. This leads to two problems:

Unnecessary API usage for an action that is not going to succeed (for example when you are out of quota on cloud provider or out of bricks on glusterfs).
Sometimes user may want to retry volume expansion with lower value. For example - my current PVC is of 12GB and I tried to expand it to 40GB. But that failed. Now I want to retry volume expansion with 20GB except I can’t.

/sig storage

bswartz · 2019-01-17T21:02:57Z

I honestly don't see (1) as a problem. This is how everything in kubernetes works.

(2) Is the case that interests me. As long as the actual size doesn't change, it should be legal to "cancel" the resize request by changing the spec size back to the old value. Or as you say, retry a resize with a smaller increment.

I think we also want to think about future compatibility with a volume-shrink feature. It's inevitable that users will ask for it, and it's inevitable that at least a subset of implementers will want to implement it. Assuming we allow the feature to happen, we'll want the interface to work straightforwardly.

fejta-bot · 2019-04-17T21:06:56Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

fejta-bot · 2019-05-17T21:50:42Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle rotten

fejta-bot · 2019-06-16T22:38:00Z

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

k8s-ci-robot · 2019-06-16T22:38:08Z

@fejta-bot: Closing this issue.

In response to this:

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

gnufied · 2019-07-18T17:15:19Z

/reopen
/remove-lifecycle-rotten
/lifecycle frozen

k8s-ci-robot · 2019-07-18T17:15:20Z

@gnufied: Reopened this issue.

In response to this:

/reopen
/remove-lifecycle-rotten
/lifecycle frozen

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

andyzhangx · 2020-08-03T07:16:07Z

@gnufied do you know how to cancel volume resize? e.g. current size is 4TB, and resize to 6TB failed, how can I change back to 4TB?

gnufied · 2020-08-03T14:38:26Z

@andyzhangx You can use - https://docs.openshift.com/container-platform/4.5/storage/expanding-persistent-volumes.html#expanding-recovering-from-failure_expanding-persistent-volumes

FengJunLiu · 2024-01-25T13:13:09Z

@gnufied https://github.com/kubernetes/enhancements/tree/master/keps/sig-storage/1790-recover-resize-failure#recovery-from-volume-expansion-failure
This KEP describes the drawback of not being able to restore the capacity of PVC to its original capacity in the event of PVC expansion failure. Is there a plan to address this drawback in the future? thanks

gnufied added the kind/bug Categorizes issue or PR as related to a bug. label Jan 17, 2019

k8s-ci-robot added the sig/storage Categorizes an issue or PR as relevant to SIG Storage. label Jan 17, 2019

bswartz mentioned this issue Feb 8, 2019

REQUEST: New membership for bswartz kubernetes/org#457

Closed

6 tasks

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Apr 17, 2019

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels May 17, 2019

k8s-ci-robot closed this as completed Jun 16, 2019

k8s-ci-robot reopened this Jul 18, 2019

k8s-ci-robot added lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. and removed lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. labels Jul 18, 2019

gnufied mentioned this issue Jul 18, 2019

Add StatefulSet Volume Expansion Kep kubernetes/enhancements#660

Closed

gnufied added this to Volume expansion GA Jun 3, 2024

gnufied moved this to In progress in Volume expansion GA Jun 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle recovery from resize failure #73036

Handle recovery from resize failure #73036

gnufied commented Jan 17, 2019

bswartz commented Jan 17, 2019

fejta-bot commented Apr 17, 2019

fejta-bot commented May 17, 2019

fejta-bot commented Jun 16, 2019

k8s-ci-robot commented Jun 16, 2019

gnufied commented Jul 18, 2019

k8s-ci-robot commented Jul 18, 2019

andyzhangx commented Aug 3, 2020

gnufied commented Aug 3, 2020

FengJunLiu commented Jan 25, 2024

Handle recovery from resize failure #73036

Handle recovery from resize failure #73036

Comments

gnufied commented Jan 17, 2019

bswartz commented Jan 17, 2019

fejta-bot commented Apr 17, 2019

fejta-bot commented May 17, 2019

fejta-bot commented Jun 16, 2019

k8s-ci-robot commented Jun 16, 2019

gnufied commented Jul 18, 2019

k8s-ci-robot commented Jul 18, 2019

andyzhangx commented Aug 3, 2020

gnufied commented Aug 3, 2020

FengJunLiu commented Jan 25, 2024