-
Notifications
You must be signed in to change notification settings - Fork 4.8k
Bug 1823406: Upstream: 89160: Remove potentially unhealthy symlink only for dead containers #24926
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
…ontainers Signed-off-by: Ted Yu <zyu@redhat.com>
|
/cc @rphillips @sjenning |
|
/lgtm /assign @sjenning |
|
Upstream ref |
|
/approve |
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: rphillips, sjenning, tedyu The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
|
/retitle Bug 1823406: Remove potentially unhealthy symlink only for dead containers |
|
/retitle Bug 1823406: Upstream: 89160: Remove potentially unhealthy symlink only for dead containers |
|
@tedyu: This pull request references Bugzilla bug 1823406, which is valid. The bug has been moved to the POST state. The bug has been updated to refer to the pull request using the external bug tracker. 3 validation(s) were run on this bug
DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
|
/retest |
|
/retest Please review the full test history for this PR and help us cut down flakes. |
3 similar comments
|
/retest Please review the full test history for this PR and help us cut down flakes. |
|
/retest Please review the full test history for this PR and help us cut down flakes. |
|
/retest Please review the full test history for this PR and help us cut down flakes. |
|
@tedyu: All pull requests linked via external trackers have merged: openshift/origin#24926. Bugzilla bug 1823406 has been moved to the MODIFIED state. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
As the discussion over #52172 showed, there is race condition between the container log rotation and the kubelet GC which may result in the loss of symlink.
Here is how container log rotation works (see containerLogManager#rotateLatestLog):
There is small but indeterministic amount of time during which log file doesn't exist (between steps #1 and #2, between #1 and #3). Hence the symlink may be deemed unhealthy during that period.
This PR resorts to runtimeService.ContainerStatus() to check whether the container corresponding to the potentially unhealthy symlink is alive or not. The symlink would only be removed for dead containers.