Create per-object sequence number and report last value seen in status of each object #7328

bgrant0607 · 2015-04-25T05:03:39Z

It's hard to write race-free clients without knowing whether controllers have observed mutations. Controllers should report the most recent resourceVersion seen in status when they post status. Just returning it in responses (#1184) is not sufficient.

An example problem:
https://github.com/GoogleCloudPlatform/kubernetes/pull/7321/files#r29097446

derekwaynecarr · 2015-04-25T12:39:54Z

Is reporting sufficient or do you need to infer order between versions? I think to do what you want you need to be able to sort resource version to know if you have seen a more recent value which we have been hesitant to do since they are supposed to be opaque.

Sent from my iPhone

On Apr 25, 2015, at 1:04 AM, Brian Grant [email protected] wrote:

It's hard to write race-free clients without knowing whether controllers have observed mutations. Controllers should report the most recent resourceVersion seen in status when they post status. Just returning it in responses (#1184) is not sufficient.

An example problem:
https://github.com/GoogleCloudPlatform/kubernetes/pull/7321/files#r29097446

—
Reply to this email directly or view it on GitHub.

bgrant0607 · 2015-04-25T16:47:36Z

Good point. It may be time to introduce a per-object sequence number.

pmorie · 2015-04-25T17:48:09Z

@bgrant0607 @derekwaynecarr if we add a per-object sequence, will we use
that or resourceVersion to get a prior value for an object?
On Sat, Apr 25, 2015 at 12:47 PM Brian Grant [email protected]
wrote:

Good point. It may be time to introduce a per-object sequence number.

—
Reply to this email directly or view it on GitHub
#7328 (comment)
.

bgrant0607 · 2015-04-28T01:44:58Z

@pmorie We need to tease apart the different uses:

concurrency control (read-modify-write transactions)
RAW and WAW consistency
watch

bprashanth · 2015-06-18T00:33:41Z

Is reporting sufficient or do you need to infer order between versions?

For this to be useful I think we need order.

I think to do what you want you need to be able to sort resource version to know if you have seen a more recent value which we have been hesitant to do since they are supposed to be opaque.

Resource version is generated by etcd so we can't rely on different members/shards of the cluster giving you an ever increasing resource version (and non-etcd datastores won't have it). My understanding of sequence numbers is as long as the database is consistent, it's part of the data, so it should always be whatever we wrote across all members.

bgrant0607 · 2015-06-18T02:00:31Z

Also worth noting the proposed v3 etcd API, which distinguishes "index" from per-object "version": https://github.com/coreos/etcd/blob/master/Documentation/rfc/v3api.proto#L172

bgrant0607 · 2015-06-18T02:45:42Z

Responding to some questions from #9739 in this more general issue.

With respect to the name of the sequence field, etcd v3 uses "version", and internally we typically use "version" or "generation" (in various APIs -- Borg and Chubby use the latter). I'm somewhat partial to "generation", since it's what I'm used to and since "version" is already fairly overloaded.

Re. metadata vs. spec:

The sequence number implemented here and discussed in #7328 should not be incremented when updating the status. Furthermore, as discussed in #2726 and #8625, sometime soon we're going to need to move status to a separate key in etcd. metadata should remain with spec, since at least namespace/name, labels, and deletion-related fields affect the desired state and annotations typically reflect additional information about the desired state, and the sequence number should be incremented upon changes to those fields, as well.

In general, such a sequence number needs to exist for each sub-part of the object that we'd like to update independently. For instance, Chubby has separate generation numbers for the payload, ACLs, and lock: http://static.googleusercontent.com/media/research.google.com/en/us/archive/chubby-osdi06.pdf. I could imagine that controllers and the client cache might want a sequence number on status, in addition to the one covering spec and metadata, but that's much less critical and could be added later by prefixing the new field's name with a qualifier (e.g., "statusGeneration"), so I'm inclined to ignore that for now.

Putting the field in metadata makes the most sense to me because:

it is a generic pattern
it's metadata about the desired state rather than part of the desired state
the user won't ever specify it (which, btw, means that it must be optional, so it should be tagged with omitempty)

The name of the corresponding field in status should clarify that it's the most recent generation that has been observed by the responsible controller, such as observedGeneration or enactingGeneration.

Re. multiple entities updating status: There should be a single component that is primarily responsible for reifying the desired state. Only that component should update the observedGeneration. In cases, like the node controller, where another component fills in part of the status when the primary component is unresponsive, that backup component should leave the observedGeneration unchanged. If there were really independently updated sub-parts of status reflecting the degree to which the desired state had been acknowledged, those sub-parts should each have their own observedGeneration fields. This applies to components with internal concurrent processes, as well. An update to observedGeneration should imply that the responsible component should no longer be working towards previously specified desired states.

ash2k · 2017-03-20T06:02:23Z

What about TPR? I have a controller which needs this mechanism so that external clients can tell if it has seen the spec update and the current status reflects the updated spec.

deads2k · 2017-06-02T12:29:06Z

What about TPR? I have a controller which needs this mechanism so that external clients can tell if it has seen the spec update and the current status reflects the updated spec.

This is out for CRD in 1.7. I recommend a specific issue to add it to make it easier to schedule.

@Kargakis specific types should have specific issues. It's special for each one at the moment, so I think its a type owner activity, not general machinery.

Removing milestone.

0xmichalis · 2017-06-02T12:56:12Z

@Kargakis specific types should have specific issues. It's special for each one at the moment, so I think its a type owner activity, not general machinery.

The reality is that it's the same for all controller-type objects (or at least all handled cases are doing it in the same way) but we are left with handling it on a case-by-case basis. One issue we should solve with ObservedGeneration that warrants case-by-case handling is #25170 (but still I suspect the solution to it will need to be applied holistically because every core controller will likely end up reusing it).

deads2k · 2017-06-02T13:04:51Z

One issue we should solve with ObservedGeneration that warrants case-by-case handling is #25170 (but still I suspect the solution to it will need to be applied holistically because every core controller will likely end up reusing it).

It seems like each one ends up as a snowflake, "bump my generation when spec changes unless its one of these field". It ends up looking like a strategy and that ends up back where we are today.

0xmichalis · 2017-06-02T13:07:49Z

For special spec fields, we need special Observed* status fields. ObservedGeneration is meant to cover the whole Spec AFAIK.

fejta-bot · 2017-12-26T02:03:38Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

Prevent issues from auto-closing with an /lifecycle frozen comment.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or @fejta.
/lifecycle stale

bgrant0607 · 2018-01-22T21:28:06Z

/remove-lifecycle stale
/lifecycle frozen

I would still like generation/observedGeneration to be implemented for all relevant resources

nikhita · 2018-05-27T19:55:01Z

I would still like generation/observedGeneration to be implemented for all relevant resources

Custom resources support observedGeneration. And added validation for it in #64379.

Do we still want it for CRDs as well?

nikhita · 2018-05-27T19:55:19Z

/cc @sttts

bgrant0607 added area/api Indicates an issue on api area. priority/awaiting-more-evidence Lowest priority. Possibly useful, but not yet enough support to actually get it done. team/master labels Apr 25, 2015

bgrant0607 changed the title ~~Report last resourceVersion seen in status of each object~~ Report last resource version seen in status of each object Apr 25, 2015

bgrant0607 mentioned this issue May 13, 2015

Create the v1 API #7018

Closed

bgrant0607 added the sig/api-machinery Categorizes an issue or PR as relevant to SIG API Machinery. label May 14, 2015

bgrant0607 mentioned this issue May 22, 2015

API: frequent ResourceVersion changes (from status updates) cause false conflicts #8625

Closed

bgrant0607 mentioned this issue Jun 2, 2015

Kubectl delete should handle stopping an rc while it's starting replicas #9147

Closed

bgrant0607 changed the title ~~Report last resource version seen in status of each object~~ Create per-object sequence number and report last value seen in status of each object Jun 3, 2015

bgrant0607 mentioned this issue Jun 18, 2015

Fix kubectl stop rc with sequence numbers #9739

Merged

bgrant0607 mentioned this issue Jun 19, 2015

"v2" API (API/client redesign umbrella issue) #8190

Closed

davidopp removed the team/master label Aug 22, 2015

bgrant0607 added this to the v1.2-candidate milestone Sep 12, 2015

bgrant0607 mentioned this issue Oct 8, 2015

Write proposal for controller pod management: adoption, orphaning, ownership, etc. (aka controllers v2) #14961

Open

bgrant0607 modified the milestones: next-candidate, v1.2-candidate Jan 29, 2016

bgrant0607 removed this from the next-candidate milestone Jul 31, 2016

bgrant0607 added the sig/apps Categorizes an issue or PR as relevant to SIG Apps. label Jul 31, 2016

bgrant0607 mentioned this issue Mar 21, 2017

Workload API v1 requirements umbrella issue #42752

Closed

This was referenced Apr 13, 2017

[Federation] Federation should fully support clusters with a previous API version. #44160

Closed

Requirements for ThirdPartyResource to graduate to beta #22768

Closed

0xmichalis removed the team/api (deprecated - do not use) label Apr 25, 2017

ledzep2 mentioned this issue May 10, 2017

Initial commit kube-node/nodeset#1

Merged

0xmichalis mentioned this issue May 21, 2017

CustomResourceDefinitions kubernetes/enhancements#95

Closed

deads2k modified the milestones: next-candidate, v1.7 Jun 2, 2017

ash2k mentioned this issue Jul 13, 2017

Support Generation/ObservedGeneration atlassian/smith#102

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Dec 26, 2017

k8s-ci-robot added lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Jan 22, 2018

bgrant0607 mentioned this issue Jan 25, 2018

Initial commit of Kubernetes Application proposal KEP kubernetes/community#1629

Closed

irfanurrehman mentioned this issue Feb 2, 2018

[Federation] Federation should fully support clusters with a previous API version. kubernetes-retired/federation#209

Closed

nikhita mentioned this issue May 27, 2018

apiextensions: validate status.observedGeneration for custom resources #64379

Closed

KnVerey mentioned this issue Aug 8, 2018

Compare generations when available Shopify/krane#325

Merged

This was referenced Sep 22, 2018

Increment metadata.generation on spec updates for all resources #68978

Closed

The metadata.generation of a Custom Resource is always incremented #69059

Merged

shomron mentioned this issue Jan 30, 2020

Report Resource Generation on byPod Status of Constraints/Templates open-policy-agent/gatekeeper#444

Closed

smarterclayton mentioned this issue Sep 29, 2022

RV vs object-wide logical clock #112684

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create per-object sequence number and report last value seen in status of each object #7328

Create per-object sequence number and report last value seen in status of each object #7328

bgrant0607 commented Apr 25, 2015

derekwaynecarr commented Apr 25, 2015

bgrant0607 commented Apr 25, 2015

pmorie commented Apr 25, 2015

bgrant0607 commented Apr 28, 2015

bprashanth commented Jun 18, 2015

bgrant0607 commented Jun 18, 2015

bgrant0607 commented Jun 18, 2015

ash2k commented Mar 20, 2017

deads2k commented Jun 2, 2017

0xmichalis commented Jun 2, 2017

deads2k commented Jun 2, 2017

0xmichalis commented Jun 2, 2017

fejta-bot commented Dec 26, 2017

bgrant0607 commented Jan 22, 2018

nikhita commented May 27, 2018

nikhita commented May 27, 2018

Create per-object sequence number and report last value seen in status of each object #7328

Create per-object sequence number and report last value seen in status of each object #7328

Comments

bgrant0607 commented Apr 25, 2015

derekwaynecarr commented Apr 25, 2015

bgrant0607 commented Apr 25, 2015

pmorie commented Apr 25, 2015

bgrant0607 commented Apr 28, 2015

bprashanth commented Jun 18, 2015

bgrant0607 commented Jun 18, 2015

bgrant0607 commented Jun 18, 2015

ash2k commented Mar 20, 2017

deads2k commented Jun 2, 2017

0xmichalis commented Jun 2, 2017

deads2k commented Jun 2, 2017

0xmichalis commented Jun 2, 2017

fejta-bot commented Dec 26, 2017

bgrant0607 commented Jan 22, 2018

nikhita commented May 27, 2018

nikhita commented May 27, 2018