Skip to content

HDDS-11618. Enable HA mode for OM and SCM#20

Merged
Tejaskriya merged 19 commits intoapache:mainfrom
Tejaskriya:HDDS-11618
Jan 21, 2026
Merged

HDDS-11618. Enable HA mode for OM and SCM#20
Tejaskriya merged 19 commits intoapache:mainfrom
Tejaskriya:HDDS-11618

Conversation

@Tejaskriya
Copy link
Contributor

@Tejaskriya Tejaskriya commented Sep 16, 2025

What changes were proposed in this pull request?

Co-authored by: @pyttel
HA for om and scm is implemented with the help of the "replicas count". In the helpers.tpl, if the replica count is higher than 1, then the necessary configs for enabling HA are set.
Further, ozone.om.bootstrap.nodes and ozone.om.decommissioned.nodes lists are maintained to keep a track of which OMs are bootstarapped and decommissioned respectively.
The required pods are exposed for each service.

Ref, the original stale PR: #10

What is the link to the Apache JIRA

https://issues.apache.org/jira/browse/HDDS-11618

How was this patch tested?

Green CI run, and tested manually with leader transfer and basic ozone commands

@Tejaskriya Tejaskriya marked this pull request as ready for review September 22, 2025 09:20
Copy link

@ptlrs ptlrs left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you for the PR @Tejaskriya and @pyttel.
The implementation looks good to me.
I mainly have some questions around the upgrade flow.
Some of the other suggestions can be done in followup PRs.

@Tejaskriya Tejaskriya requested a review from ptlrs October 28, 2025 19:20
Copy link

@ptlrs ptlrs left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @Tejaskriya for the updates to the PR.

I have posted my concerns as discussed about the two jobs. We can address those in future PRs.

Rest of the changes LGTM.

@Tejaskriya
Copy link
Contributor Author

@ptlrs thanks for the feedback. The suggestions make sense, and as you have mentioned, I'd like to take care of these in followup tasks. Thanks for your approval on the PR!

@kerneltime would you like to take a look?

Copy link

@rakeshadr rakeshadr left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@Tejaskriya thanks for the efforts in testing and maintaining this PR. Added a few comments, please go through it.

@Tejaskriya Tejaskriya requested a review from rakeshadr January 9, 2026 13:02
@rakeshadr
Copy link

@Tejaskriya Thanks for putting up the changes. I could see a few open comments, I don't think these are blockers. Please create a followup task under HDDS-14382 and work on it. Then proceed further.

+1 LGTM

@Tejaskriya Tejaskriya merged commit bd435a3 into apache:main Jan 21, 2026
1 check passed
@Tejaskriya
Copy link
Contributor Author

Thanks for the reviews @rakeshadr @ptlrs and the co-contribution @pyttel

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants