support seamless rollback 1 minor revision #7308

mml · 2017-02-11T00:29:17Z

Motivation

This is a feature request to make rolling out upgrades safer, specifically for kubernetes cluster admins, but this could easily apply to others. It's often the case that a user may upgrade k8s from N-1 to N. Let's say that we'd like N to also include a change of the bundled etcd from M-1 to M.

k8s vN may include bugs that aren't acceptable. The easiest resolution is usually to roll back to N-1, and to avoid having to test N-1 against two versions of etcd (M-1 and M), we'd prefer that etcd be rolled back to M-1. Another reason we may wish to go back to M-1 is that the source of the bug is etcd itself.

Request

Ideally, the storage format changes between adjacent revisions are such that whatever M is writing to disk is by-design readable by M-1. It may contain new features, but they will be harmlessly ignored in the event of a rollback. One example of an encoding that works like this is proto2, which ignores unknown fields. (This isn't to advocate proto2, just to give an example.)

That said, even if etcd stabilizes on an encoding with these properties, it might want to change that encoding scheme in the future. In that case it makes sense to treat both upgrade and downgrade equally, authoring and testing tools that go in both directions, and testing round-tripping in both directions.

@xiang90 @hongchaodeng @heyitsanthony @wojtek-t

mml · 2017-02-11T00:35:12Z

@xiang90 asked some questions in another thread.

what kind of rollback (online or offline) do you expect?

If i understand the question correctly, online is best. It's ideal if the changes made to the data on disk don't render that data unreadable by the previous version.

what do you want to preserve after a rollback? discard or preserve data used by any new feature (so that upgrade again the data of the new feature will be available again )?

What's an example of a feature where this question would come up? I think the answer will be to think about the safety and semantics users will expect around rollback of the specific feature, but maybe we can come up with a principle.

what happens if only part of the cluster is rolled back?

What happens today if only a part of the cluster is upgraded? One idea is that nothing happens unless a quorum can be found all sharing the same version.

xiang90 · 2017-02-12T23:44:10Z

@mml

Ideally, the storage format changes between adjacent revisions are such that whatever M is writing to disk is by-design readable by M-1.

We already ensure this today. The on-disk entry format is protobuf. Reading an entry with previous pb will result in some unknown values, but will not panic anything.

but they will be harmlessly ignored in the event of a rollback.

What does this mean exactly? Say in version M+1, we introduce a new filed called "Dog". Once we rollback to version M, the "Dog" field will magically disappear. This is probably OK for non-clustered systems. But for etcd, this means you can get "Dog" field from a node running version M+1, but not a node rollbacked to M. What is more magic is that, if you upgrade the node again "Dog" will appear again. Client will see inconsistent state based on which node they talk to.

That said, even if etcd stabilizes on an encoding with these properties, it might want to change that encoding scheme in the future.

I am not super worry about this. Rewriting the WAL or even DB is not hard.

xiang90 · 2017-02-12T23:47:02Z

What happens today if only a part of the cluster is upgraded?

this is very different. the cluster cannot use ANY new feature until ALL members are upgraded.

Here is an example

[3.0, 3.0, 3.0] -> only write 3.0 entry and only 3.0 feature
[3.1, 3.1, 3.0] -> still one node on 3.1. so only write 3.0 entry and only 3.0 feature
[3.1, 3.1, 3.1] -> 3.1 is enabled!

If you only downgrade one member, it breaks the rule and client will see inconsistent state "unavoidably".

T1 [3.1, 3.1, 3.1] -> 3.1 is enabled!
T2 [3.1, 3.1, 3.0] -> 3.1 is disabled since we detect a 3.0 back online.

However, T2 becomes a magic timestamp... Clients will get confused... Why a filed is cleared?!

I would love to hear how similar systems work for downgrade path. Any open source clustered system or google internal examples would be super helpful.

mml · 2017-02-14T00:03:04Z

What are some concrete examples of user-visible fields that etcd has introduced or plans to introduce?

xiang90 · 2017-02-14T00:11:02Z

What are some concrete examples of user-visible fields that etcd has introduced or plans to introduce?

For example, we introduced PrevKV field to return the previous value of a key when modifying a key, which can save one round trip time. We introduce some fields in the range request for querying revision ranges. There are more fields we might introduce in the future. Also there might be new APIs, like native locking API.

xiang90 · 2017-02-27T19:55:47Z

@mml Any update on this issue?

xiang90 · 2017-03-22T20:00:05Z

@mml

I do not think this can happen in 3.2 timeframe. But I REALLY want to sort this out to make Kubernetes users happy. Please ping us when you guys have time.

I am looping some other people in since I feel they are interested in this as well.

/cc @justinsb @wojtek-t @timothysc

xiang90 · 2017-10-04T22:19:52Z

move this to 3.4

gyuho · 2018-09-25T20:56:00Z

Updates: @wenjiaswe is working on this. We are reviewing the design doc.

philips · 2018-10-23T21:28:35Z

Where is the doc?

wenjiaswe · 2018-10-23T21:40:08Z

@philips
Here is the etcd downgrade design doc.
Here is the tracking issue on etcd: Improve etcd upgrade/downgrade policy and tests.
And here is the link to the etcd-dev topic: etcd downgrade design document ready for review.

Any suggestion or comments are welcome and appreciated!

wenjiaswe · 2019-10-10T17:33:40Z

assign @YoyinZyc

stale · 2020-04-06T20:56:37Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

zaisongz · 2020-09-02T03:09:37Z

@wenjiaswe i could not open the downgrade documents, did you have a new link?

wenjiaswe · 2020-09-02T05:50:06Z

@zaisongz I just checked the design doc linked here: https://docs.google.com/document/d/1mSihXRJz8ROhXf4r5WrBGc8aka-b8HKjo2VDllOc6ac/edit#heading=h.e4jdx621yd8s. It is shared with Anyone on the internet with this link. Could you check again and let me know if you can't open it?

zaisongz · 2020-09-02T08:17:54Z

@wenjiaswe just realized it is casued by the my proxy setting, everthing is good now. Thanks a lot for your quick response.

xiang90 self-assigned this Feb 12, 2017

heyitsanthony added this to the v3.2.0 milestone Feb 14, 2017

xiang90 mentioned this issue Mar 15, 2017

Rewrite the etcd doc to be about upgrading etcd. kubernetes/website#2767

Merged

xiang90 modified the milestones: v3.3.0, v3.2.0 Mar 22, 2017

jamiehannaford mentioned this issue May 24, 2017

Finding a solution for etcd kubernetes/kubeadm#277

Closed

xiang90 modified the milestones: v3.3.0, v3.4.0 Oct 4, 2017

xiang90 assigned jpbetz Oct 5, 2017

gyuho mentioned this issue Nov 3, 2017

release, documentation, tools: Expand patch management support to the previous two minor versions #8805

Merged

gyuho mentioned this issue Feb 8, 2018

Improve etcd upgrade/downgrade policy and tests #9306

Closed

gyuho added the type/feature label Feb 25, 2018

gyuho unassigned xiang90 Feb 25, 2018

gyuho assigned wenjiaswe Sep 25, 2018

jpbetz mentioned this issue Oct 9, 2018

Remove etcd2 images kubernetes/kubernetes#69577

Closed

gyuho mentioned this issue Oct 22, 2018

Add KEP for etcdadm kubernetes/community#2835

Merged

gyuho mentioned this issue Nov 14, 2018

etcdserver/*: add "etcd_cluster_version" metric #10257

Merged

hexfusion mentioned this issue Feb 15, 2019

Documentation: downgrade clarification #10461

Closed

gyuho removed this from the etcd-v3.4 milestone Aug 5, 2019

gyuho added this to the etcd-v3.5 milestone Aug 5, 2019

wenjiaswe assigned wenjiaswe and unassigned jpbetz and wenjiaswe Oct 22, 2019

YoyinZyc mentioned this issue Nov 14, 2019

Etcd Cluster Downgrade #11362

Closed

nolouch mentioned this issue Nov 27, 2019

Revert "*: upgrade etcd to v3.4.3 (#1907)" tikv/pd#1974

Merged

stale bot added the stale label Apr 6, 2020

stale bot closed this as completed Apr 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support seamless rollback 1 minor revision #7308

support seamless rollback 1 minor revision #7308

mml commented Feb 11, 2017

mml commented Feb 11, 2017 •

edited

Loading

xiang90 commented Feb 12, 2017

xiang90 commented Feb 12, 2017 •

edited

Loading

mml commented Feb 14, 2017

xiang90 commented Feb 14, 2017

xiang90 commented Feb 27, 2017

xiang90 commented Mar 22, 2017

xiang90 commented Oct 4, 2017

gyuho commented Sep 25, 2018

philips commented Oct 23, 2018

wenjiaswe commented Oct 23, 2018

wenjiaswe commented Oct 10, 2019 •

edited

Loading

stale bot commented Apr 6, 2020

zaisongz commented Sep 2, 2020 •

edited

Loading

wenjiaswe commented Sep 2, 2020

zaisongz commented Sep 2, 2020

support seamless rollback 1 minor revision #7308

support seamless rollback 1 minor revision #7308

Comments

mml commented Feb 11, 2017

Motivation

Request

mml commented Feb 11, 2017 • edited Loading

xiang90 commented Feb 12, 2017

xiang90 commented Feb 12, 2017 • edited Loading

mml commented Feb 14, 2017

xiang90 commented Feb 14, 2017

xiang90 commented Feb 27, 2017

xiang90 commented Mar 22, 2017

xiang90 commented Oct 4, 2017

gyuho commented Sep 25, 2018

philips commented Oct 23, 2018

wenjiaswe commented Oct 23, 2018

wenjiaswe commented Oct 10, 2019 • edited Loading

stale bot commented Apr 6, 2020

zaisongz commented Sep 2, 2020 • edited Loading

wenjiaswe commented Sep 2, 2020

zaisongz commented Sep 2, 2020

mml commented Feb 11, 2017 •

edited

Loading

xiang90 commented Feb 12, 2017 •

edited

Loading

wenjiaswe commented Oct 10, 2019 •

edited

Loading

zaisongz commented Sep 2, 2020 •

edited

Loading