HDDS-5274. Revert "HDDS-5153. Decommissioning a dead node should complete immediately (#2190)" #2282

sodonnel · 2021-05-25T16:46:44Z

What changes were proposed in this pull request?

After some discussion with István Fajth and Siddharth Wagle we believe that the change in HDDS-5153 should be reverted.

If a DN starts decommissioning or maintenance, but goes dead before it completes the process, then the node is moved back to a state of IN_SERVICE and DEAD by the decommission monitor when it notices it has become dead. This is because decommission should gracefully remove the node, but it goes dead first, we may not be able to replicate its containers. In this case decommission effectively fails.

In HDDS-5153, we decided that if a node is already dead and you decommission it, it should immediately move to DECOMMISSIONED. However that is not really consistent with the above behaviour.

Also, there is no real value in decommissioning a dead node - it does not do anything except adjust its state in SCM.

To keep things consistent, I propose we revert HDDS-5153 so starting decommission on a dead node will work the same as when a node goes dead part way through decommission. In both cases the node will end up as IN_SERVICE + DEAD.

What is the link to the Apache JIRA

https://issues.apache.org/jira/browse/HDDS-5274

How was this patch tested?

Existing tests

…lete immediately (apache#2190)" This reverts commit a920f25.

fapifta

LGTM +1

HDDS-5274. Revert "HDDS-5153. Decommissioning a dead node should comp…

dd0f61a

…lete immediately (apache#2190)" This reverts commit a920f25.

sodonnel requested a review from fapifta May 25, 2021 16:47

fapifta approved these changes Jun 3, 2021

View reviewed changes

sodonnel merged commit 8c1de61 into apache:master Jun 3, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

HDDS-5274. Revert "HDDS-5153. Decommissioning a dead node should complete immediately (#2190)" #2282

HDDS-5274. Revert "HDDS-5153. Decommissioning a dead node should complete immediately (#2190)" #2282

Uh oh!

sodonnel commented May 25, 2021

Uh oh!

fapifta left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

HDDS-5274. Revert "HDDS-5153. Decommissioning a dead node should complete immediately (#2190)" #2282

HDDS-5274. Revert "HDDS-5153. Decommissioning a dead node should complete immediately (#2190)" #2282

Uh oh!

Conversation

sodonnel commented May 25, 2021

What changes were proposed in this pull request?

What is the link to the Apache JIRA

How was this patch tested?

Uh oh!

fapifta left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants