Umbrella issue for HA and HA upgrades #1733

chrislovecnm · 2016-11-19T01:11:02Z

Making one home to review:
#1531 and #1530

cc: @roberthbailey @gmarek @thockin @kris-nova @justinsb @brandoncole @hongchaodeng

Great meeting you guys face to face in Seattle. The conversations about HA were really enjoyable, and I realized that I am missing some key points of exactly how a master runs in HA.

I would like to update the k8s website documentation for HA. After that, I will be writing base documentation for HA Upgrades. Do we have documentation on how each component in the Master handle leader election, and how each component is HA? I am guessing not in detail, so are the components that I am documenting, I have open questions, and really could use a second set of eyes.

HA

These components are well documented:

Etcd - https://coreos.com/etcd/docs/latest/clustering.html ~ Well documented and recommending DNS lookup because everyone pretty much has DNS
- There can be workers as well - https://coreos.com/os/docs/latest/cluster-architectures.html ~ But that is kinda overkill for k8s.
API Server – HTTP load balanced across multiple machines – stateless
Kubelet – node independent not really a consideration to HA

Here are the open questions:

Pod Master – does this still exist?? ~ http://kubernetes.io/docs/admin/high-availability/?? I am thinking no …
Scheduler – leader elections? What pattern for this is used?
Contoller Manager – again leader elections?
Federation?

Also found https://github.com/kubernetes/kubernetes/blob/master/docs/design/control-plane-resilience.md

Controller and Scheduler - Multiple self-hosted, self-healing warm standby stateless controller managers and schedulers with leader election and automatic failover of API server clients.

Upgrades

Nodes are pretty simple … kubectl drain, create new node, wash, rinse repeat. Do we upgrade the masters and then the nodes? That is the pattern that GKE follows.

In regards to each component ‘where be dragons’ with the masters in regards to upgrades? I am thinking that kubectl drain is going to handle all of the components expect for etcd. What about federation?

Etdc upgrades are documented here: https://coreos.com/etcd/docs/2.3.7/upgrade_2_3.html

Any ideas with the fact that a 2.3->3.0 etcd cannot be rolled back? Any patterns that we could use? Backup, but lions, tigers, and bears oh my. I am wondering if bumping up the masters from 3 -> 6 and only upgrade three of them. Verify that the three new nodes are running correctly, and then remove the old nodes. From the etcd docs:

“If all members have been upgraded to v3.0, the cluster will be upgraded to v3.0, and downgrade from this completed state is not possible. If any single member is still v2.3, however, the cluster and its operations remains “v2.3”, and it is possible from this mixed cluster state to return to using a v2.3 etcd binary on all members”. Does this same pattern work with the controller manager, api, and scheduler?

Or we could just upgrade two masters, let it sit, and then upgrade the third? I am wondering if this is another reason to move etcd to its own server. We do immutable servers, so upgrading a single component does not work.

Thanks in advance!

The text was updated successfully, but these errors were encountered:

chrislovecnm · 2016-11-19T02:03:44Z

Actually I should just have added everyone to the party @kubernetes/sig-cluster-lifecycle

hongchaodeng · 2016-11-19T04:37:29Z

Any ideas with the fact that a 2.3->3.0 etcd cannot be rolled back?

This is generic across many database systems. New version system could be writing data format that the old version couldn't understand as new features getting added.

See etcd upgrade slides

patterns that we could use?

A common pattern is to periodically backup the data, not only limited to upgrade actually. In case of any disaster failure, you can restore etcd cluster from previous backup.

Another common pattern is that database systems usually get upgraded pretty often in open source version, while most production env still keep using the old version. As a result, it lets broader community test the new version and get bugs fixed before it finally rolls out in production env.

brandoncole · 2016-11-19T14:21:24Z

@chrislovecnm Awesome summary, and thanks for taking the lead on getting the issues raised, documented and vetted. Definitely want to try the upgrade process out on some clusters.

I just noticed @hongchaodeng's slides and I thought something was interesting in there I want to learn about regarding mixed clusters on Slide 3:

GOOD: 3.1.1, 3.2.3, 3.2.3
BAD: 3.0.2, 3.2.3, 3.2.3

There is probably some etcd documentation that talks about how mixed clusters that jump more than a minor version are bad but this is good to know.

gmarek · 2016-11-20T11:23:25Z

cc @fgrzadkowski @jszczepkowski

chrislovecnm · 2016-11-20T17:41:37Z

@kubernetes/docs can someone assign this to me?

chrislovecnm · 2016-11-20T17:42:01Z

@gmarek you cannot cc people and not comment.... SGTM

chrislovecnm · 2016-11-20T18:14:56Z

Thanks @dims

chrislovecnm · 2016-11-29T21:14:12Z

@roberthbailey @gmarek @thockin ~ still have a bunch of open questions about HA specifics within k8s. Who can assist?

jszczepkowski · 2016-11-30T10:46:40Z

PR with user doc for HA master can be find here: #1810
I hope it answers some questions

chrislovecnm · 2016-11-30T17:18:47Z

Thanks!

I am mostly looking for details on how specific k8s components run ha, as I mentioned above.

chrislovecnm · 2016-12-07T04:33:49Z

Bump - I know everyone is swamped with the release

chrislovecnm · 2016-12-15T00:47:14Z

@smarterclayton / @bprashanth ~ @jbeda mentioned that you may be able to assist with some of these questions. Specifically basic HA with the open questions that I have. I know that a couple of folks in the HA team are on leave, and I wanted to reach out to you.

Thanks

Chris

fgrzadkowski · 2016-12-15T19:11:03Z

@chrislovecnm Can you please say which information in Implementation notes in #1810 is missing?

fgrzadkowski · 2016-12-15T19:14:19Z

Also, there's a design doc which covers this with details: https://github.com/kubernetes/community/blob/master/contributors/design-proposals/ha_master.md

chrislovecnm · 2016-12-15T23:00:33Z

@fabianofranz

So those docs are awesome. That is how you deploy HA in GCE. Not really how HA works. Meaning https://raw.githubusercontent.com/kubernetes/kubernetes.github.io/3b572b567f655b771e4ff8a03a4f8baf9e094511/docs/admin/ha-master-gce.png <- how those components actually HA?

Here are my questions from above:

Here are the open questions:

Pod Master – does this still exist?? ~ http://kubernetes.io/docs/admin/high-availability/?? I am thinking no …
Scheduler – leader elections? What pattern for this is used?
Contoller Manager – again leader elections?
Federation?

^ how do all of those components implement HA?? What happens in a failure? What is recovery? Based on that ... how do I upgrade? Do I drain the master?

Think of it from a users perspective. Do I need two Controller or do I need three? Well I need 3 etcd for sure...

Thanks

Chris

fgrzadkowski · 2016-12-16T12:16:52Z

@chrislovecnm Answers to all of those questions ARE in the docs I've sent you. Only the implementation is GCE specific, but the design doc doesn't talk about GCE (except for few places where say which option we will choose for GCE, e.g. loadbalancing).

Pod master - there's not component now called pod master; none of the docs talk about it
scheduler - leader election (see here
controller manager - leader election (see here
federation control plane does not support HA yet
upgrade is described here

"Think of it from a users perspective":

from user perspective nothing changes, they just have endpoint to talk to
from cluster admin perspective obviously we need more than 1 replica of each component; etcd is special and it's advised to have 3 or 5 replicas.

Does that answer your questions?

I still don't understand what else is missing? Should the doc be restructured somehow to make it easier to find the answers?

chrislovecnm · 2016-12-16T16:37:03Z

@fgrzadkowski let me review again. I am reading this from a different perspective. I am thinking about an upgrade as well. Let me re-read again.

chrislovecnm · 2016-12-18T01:09:43Z

@fgrzadkowski thanks btw... I appreciate you patience and help with this.

chrislovecnm · 2016-12-23T02:07:18Z

A note to myself. Add a section about how the active component is found by looking at the endpoints via kubectl on kube-system.

jaredbhatti · 2017-01-12T20:31:56Z

Chris, is this an issue you're tracking on the site? What are the next steps for you?

The SIG-Docs team is going through all of the docs issues and I want to make sure this is actually being worked on, otherwise I'm going to close it.

Thanks!

jszczepkowski · 2017-03-23T08:05:24Z

I've rewrote a doc about setting-up HA Clusters in general: #2941

roberthbailey · 2017-05-23T21:42:34Z

A couple of other docs that have been recently discussed in sig cluster lifecycle that are relevant here:

fejta-bot · 2017-12-25T02:40:38Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

Prevent issues from auto-closing with an /lifecycle frozen comment.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or @fejta.
/lifecycle stale

luxas · 2017-12-25T17:33:15Z

ping @jszczepkowski @fgrzadkowski @chrislovecnm @kubernetes/sig-cluster-lifecycle-pr-reviews for a status on this
cc @errordeveloper

errordeveloper · 2018-01-15T11:36:31Z

/lifecycle frozen
/remove-lifecycle stale

neolit123 · 2018-07-18T20:33:22Z

the HA and HA-ETC documentation received an overhaul in 1.11:
https://kubernetes.io/docs/setup/independent/high-availability/
https://kubernetes.io/docs/tasks/administer-cluster/setup-ha-etcd-with-kubeadm/

all the new edits were the result of combined SIG efforts and based on existing proposals for improvements.

closing until further notice.
/close

dims assigned chrislovecnm Nov 20, 2016

chrislovecnm mentioned this issue Dec 15, 2016

We should have a doc for HA kubernetes/kops#1163

Closed

chenopis added the P2 label Jan 12, 2017

chenopis assigned jaredbhatti Jan 12, 2017

chenopis removed the P2 label Jan 12, 2017

This was referenced Jan 12, 2017

HA Upgrades /admin/high-availability/upgrades.md #1531

Closed

HA edit /admin/high-availability/ #1530

Closed

jaredbhatti added the P2 label Jan 12, 2017

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Dec 25, 2017

k8s-ci-robot added the sig/cluster-lifecycle Categorizes an issue or PR as relevant to SIG Cluster Lifecycle. label Dec 25, 2017

k8s-ci-robot added the lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. label Jan 15, 2018

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 22, 2018

justaugustus mentioned this issue Feb 20, 2018

Improve documentation around production Kubernetes considerations #7463

Closed

3 tasks

k8s-ci-robot assigned neolit123 Jul 18, 2018

k8s-ci-robot closed this as completed Jul 18, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Umbrella issue for HA and HA upgrades #1733

Umbrella issue for HA and HA upgrades #1733

chrislovecnm commented Nov 19, 2016

chrislovecnm commented Nov 19, 2016

hongchaodeng commented Nov 19, 2016

brandoncole commented Nov 19, 2016

gmarek commented Nov 20, 2016

chrislovecnm commented Nov 20, 2016

chrislovecnm commented Nov 20, 2016

chrislovecnm commented Nov 20, 2016

chrislovecnm commented Nov 29, 2016

jszczepkowski commented Nov 30, 2016

chrislovecnm commented Nov 30, 2016

chrislovecnm commented Dec 7, 2016

chrislovecnm commented Dec 15, 2016

fgrzadkowski commented Dec 15, 2016

fgrzadkowski commented Dec 15, 2016

chrislovecnm commented Dec 15, 2016 •

edited

Loading

fgrzadkowski commented Dec 16, 2016

chrislovecnm commented Dec 16, 2016

chrislovecnm commented Dec 18, 2016

chrislovecnm commented Dec 23, 2016

jaredbhatti commented Jan 12, 2017

jszczepkowski commented Mar 23, 2017

roberthbailey commented May 23, 2017

fejta-bot commented Dec 25, 2017

luxas commented Dec 25, 2017

errordeveloper commented Jan 15, 2018 •

edited

Loading

neolit123 commented Jul 18, 2018

Umbrella issue for HA and HA upgrades #1733

Umbrella issue for HA and HA upgrades #1733

Comments

chrislovecnm commented Nov 19, 2016

chrislovecnm commented Nov 19, 2016

hongchaodeng commented Nov 19, 2016

brandoncole commented Nov 19, 2016

gmarek commented Nov 20, 2016

chrislovecnm commented Nov 20, 2016

chrislovecnm commented Nov 20, 2016

chrislovecnm commented Nov 20, 2016

chrislovecnm commented Nov 29, 2016

jszczepkowski commented Nov 30, 2016

chrislovecnm commented Nov 30, 2016

chrislovecnm commented Dec 7, 2016

chrislovecnm commented Dec 15, 2016

fgrzadkowski commented Dec 15, 2016

fgrzadkowski commented Dec 15, 2016

chrislovecnm commented Dec 15, 2016 • edited Loading

fgrzadkowski commented Dec 16, 2016

chrislovecnm commented Dec 16, 2016

chrislovecnm commented Dec 18, 2016

chrislovecnm commented Dec 23, 2016

jaredbhatti commented Jan 12, 2017

jszczepkowski commented Mar 23, 2017

roberthbailey commented May 23, 2017

fejta-bot commented Dec 25, 2017

luxas commented Dec 25, 2017

errordeveloper commented Jan 15, 2018 • edited Loading

neolit123 commented Jul 18, 2018

chrislovecnm commented Dec 15, 2016 •

edited

Loading

errordeveloper commented Jan 15, 2018 •

edited

Loading