-
Notifications
You must be signed in to change notification settings - Fork 787
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Failure to AttachVolume, RequestCanceled: context deadline exceeded #795
Comments
Judging from the information given by the AWS support tech, it seems possible that the controller may be somehow causing Ubuntu's block device driver to release the device name improperly when a volume is detached, or that a device name that is already in use is being chosen. We do not see this issue when using the k8s-provided EBS CSI integration, but that controller does not provide support for gp3 volumes.
|
Updated with a more specific set of repro steps that should hopefully allow others to observe this behavior. |
Thank you @wmgroot we will look into this issue |
Awesome, appreciate the update. |
Issues go stale after 90d of inactivity. If this issue is safe to close now please do so with Send feedback to sig-contributor-experience at kubernetes/community. |
Stale issues rot after 30d of inactivity. If this issue is safe to close now please do so with Send feedback to sig-contributor-experience at kubernetes/community. |
The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs. This bot triages issues and PRs according to the following rules:
You can:
Please send feedback to sig-contributor-experience at kubernetes/community. /close |
@k8s-triage-robot: Closing this issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/kind bug
What happened?
What you expected to happen?
I expect EBS Volumes to be attached successfully to a node when a PVC is created.
How to reproduce it (as minimally and precisely as possible)?
This is occurring non-deterministically for us. It seems to happen more frequently with higher volumes of PVC requests.
We are running a few workloads with high Pod/PVC turnover, and we occasionally run into large spikes of this Volume Attachment error.
I was able to reproduce this issue on a fresh node with the following StatefulSet.
Pod Events
StorageClass
Anything else we need to know?:
I had thought this might be an AWS API Request Rate Limit issue, but the errors appear as ClientError metrics, not RequestLimitExceeded in Cloudwatch.
If I examine the Volume in the AWS Console, it shows as "in-use", but will eventually display a message claiming "Volume stuck in attaching since 1 hours and 38 minutes ago ago."
This could be an issue entirely on AWS's side of the house.
Additional information from the AWS support technician who was helping us troubleshoot this issue.
Environment
kubectl version
):The text was updated successfully, but these errors were encountered: