createpod doesn't delete dead pods, spawned by the old createpod instance #165

d-uzlov · 2021-05-27T09:02:42Z

Expected Behavior

createpod monitors all its pods' state. If a pod died, then this pod should be deleted by the createpod element to prevent pod list pollution.

Current Behavior

createpod only monitors the pods that it has created during the current session. If a server with createpod dies and respawns, then all of the pods that were spawned by the old server remains in the pod list forever, until they are removed manually.

Steps to Reproduce

Create a deployment with createpod element.
Use server with createpod to spawn a new pod.
Restart the deployment.
Wait until spawned server dies.
The pod it was running in will not be removed from the list be the server with createpod element.

Solution

Add a label to each of the pods that createpod spawns.
Watch for all pods with this label, regardless of if it was spawned by this server of not.

The text was updated successfully, but these errors were encountered:

d-uzlov · 2021-05-27T09:35:03Z

Other issues caused by createpod having transient state:

Issue 1: If a server with createpod creates a pod and then dies and respawns, and the new server gets a request, then a new pod is created, despite the fact that we already have the old pod.

Issue 2: If we accidentally or deliberately have several servers with createpod that create the same pod, then these servers will each create one pod on any node, thus one node will potentially have as many pods as we have servers spawning them.

Both of these issues can actually be solved exactly the same way as the issue with dead pods remaining in the list forever.
We could restore the list of created pods on the start using the label and then update our list using the pod events to synchronize with potentially existing other servers and minimize the chances of having several pods on one node.

@denis-tingaikin do you approve fixing this or is current behaviour desired?

denis-tingaikin · 2021-05-27T09:51:43Z

it is a good catch. Within this issue, we need to consider a scenario with a few suppliers on the cluster. So currently it is out of scope and the issue doesn't break the core scenario scale from zero.
Thus issue will be considered after release.

2. resolve issue networkservicemesh#165 Signed-off-by: Denis Tingaikin <[email protected]>

…k@main PR link: networkservicemesh/sdk#1505 Commit: f96fdf6 Author: Network Service Mesh Bot Date: 2023-08-28 11:31:19 -0500 Message: - Update go.mod and go.sum to latest version from networkservicemesh/api@main (#1505) PR link: networkservicemesh/api#165 Commit: c4a3ece Author: Nikita Skrynnik Date: 2023-08-22 21:51:24 +0700 Message: - Fix Connection Selector (#165) * fix selector * minor check * fix golang linter * add tests for MatchesMonitorScopeSelector * fix linter issue --------- Signed-off-by: NSMBot <[email protected]>

d-uzlov changed the title ~~Make createpod watch for pod events by label~~ createpod doesn't delete dead pods, spawned by the old createpod instance May 27, 2021

denis-tingaikin added the bug Something isn't working label May 27, 2021

denis-tingaikin assigned NikitaSkrynnik Oct 26, 2021

denis-tingaikin added a commit to denis-tingaikin/sdk-k8s that referenced this issue May 12, 2022

1. allow to createpod spawn pods in multicluster scenarious

90abf5e

2. resolve issue networkservicemesh#165 Signed-off-by: Denis Tingaikin <[email protected]>

denis-tingaikin mentioned this issue May 12, 2022

fix: chain element 'createpod' doesn't work correctly for interdomain scenarios #356

Merged

edwarnicke closed this as completed in #356 May 12, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

createpod doesn't delete dead pods, spawned by the old createpod instance #165

createpod doesn't delete dead pods, spawned by the old createpod instance #165

d-uzlov commented May 27, 2021

d-uzlov commented May 27, 2021

denis-tingaikin commented May 27, 2021

createpod doesn't delete dead pods, spawned by the old createpod instance #165

createpod doesn't delete dead pods, spawned by the old createpod instance #165

Comments

d-uzlov commented May 27, 2021

Expected Behavior

Current Behavior

Steps to Reproduce

Solution

d-uzlov commented May 27, 2021

denis-tingaikin commented May 27, 2021