Support specifying kube-proxy tolerations #699

discordianfish · 2018-02-12T16:59:26Z

FEATURE REQUEST

kubeadm deploys kube-proxy with these tolerations:

      tolerations:
      - effect: NoSchedule
        key: node-role.kubernetes.io/master
      - effect: NoSchedule
        key: node.cloudprovider.kubernetes.io/uninitialized
        value: "true"

This allows kube-proxy to run on masters but not any other tainted hosts. In my case, I want to taint a group of hosts to reserve them for special workloads. But they still need to run kube-proxy. I could just edit the daemonset directly but would prefer a declarative way (e.g kubeadm config) and I'm worried that kubeadm might revert this change on upgrades/replacing masters.

Alternatively the toleration could also just be made to apply to all NoSchedule taints which would solve my problem without introducing a new flag/config.

The text was updated successfully, but these errors were encountered:

discordianfish · 2018-03-26T16:36:10Z

I can confirm: kubeadm init seems to revert the tolerations, so this is required for prod operations.

discordianfish · 2018-03-26T16:43:56Z

Related #609
A simple stop-gap would be to not touch the kube-proxy ds if possible.

@luxas Both issues are today the biggest problems we have using kubeadm in a prod/HA cluster. I can help fixing them if you can tell me how you'd like to approach these things. I can also bring that up somewhere more suitable to discuss this, don't really know where though.

timothysc · 2018-04-06T23:13:23Z

/assign @chuckha @detiber

discordianfish · 2018-04-09T12:06:39Z

Any ideas already how to approach this? This hit us multiple time now and I'd be happy helping to fix this.

For something as fundamental as kube-proxy maybe just having a generic NoSchedule toleration would be fine?

timothysc · 2018-04-10T15:33:48Z

/assign @fabriziopandini

timothysc · 2018-04-10T15:34:29Z

@discordianfish - we're going to address in the 1.11 cycle, but the backlog is pretty Y'uge atm.

discordianfish · 2018-04-10T15:49:30Z

@timothysc That's why I'm offering my help :)
So if there is a actionable plan or route to get there, I can look into it.

discordianfish · 2018-04-11T12:38:10Z

So I think there are two options:

Tolerate all taints, that's trivial to change
Tolerate only specific taints but make it possible to use taints for special nodes (eg. GPU workers like in my case)
Would be very involved since it would require a mechanism to update the kube-proxy manifest during runtime, all things given. The best way I can imaging to solve this would be changing the taints to allow them to specify pods for which it not applies. This way I could taint my workers for all pods except kube-proxy or maybe everything in kube-system.

So I'd just go with 1., at least for now. I think it's very rare that somebody wants to prevent kube-proxy from running on their nodes. At least it's arguably much more likely that somebody tainting their nodes want kube-proxy on it than not. Since it's so trivial, I'll just submit a PR for this where this approach can be discussed.

As a essential core component, kube-proxy should generally run on all nodes even if the cluster operator taints nodes for special purposes. This fixes kubernetes/kubeadm#699

mxey · 2018-06-28T15:24:34Z

This change was reverted/broken by the next commit to the manifest, kubernetes/kubernetes@d194926#diff-e3ad35b550d4fcbf99d00903a91c787e

neolit123 · 2018-06-28T15:38:00Z

^ @dixudx WDYT?

luxas · 2018-06-28T15:52:22Z

@mxey Yeah 😕
Let's fix this in v1.11.1

timothysc · 2018-07-03T18:52:18Z

@kubernetes/sig-cluster-lifecycle-bugs - who is actively working on re-fixing this one?

neolit123 · 2018-07-03T18:53:55Z

i pinged @dixudx earlier because he made the last change in this space.
@dixudx, do you think you have time to fix this?

luxas · 2018-07-06T18:57:43Z

/assign @neolit123
for fixing this on master & cherrypicking

@luxas

Automatic merge from submit-queue (batch tested with PRs 65931, 65705, 66033). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. kubeadm: run kube-proxy on non-master tainted nodes **What this PR does / why we need it**: kube-proxy should be able to run on all nodes, independent on the taint of such nodes. This restriction was previously removed in bb28449 but then was brought back in d194926. /cc @kubernetes/sig-cluster-lifecycle-pr-reviews /cc @luxas @detiber @dixudx @discordianfish @mxey /kind bug /area kube-proxy /area kubeadm **Which issue(s) this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close the issue(s) when PR gets merged)*: Fixes kubernetes/kubeadm#699 **Special notes for your reviewer**: we are removing the requirement again, but please have a look at all the implications here. hopefully we don't have to bring it again. **Release note**: ```release-note kubeadm: run kube-proxy on non-master tainted nodes ```

k8s-ci-robot assigned chuckha and detiber Apr 6, 2018

timothysc added kind/feature Categorizes issue or PR as related to a new feature. priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. labels Apr 6, 2018

timothysc added this to the v1.11 milestone Apr 6, 2018

discordianfish mentioned this issue Apr 9, 2018

kube-proxy configmap not using apiserver-advertise-address #590

Closed

k8s-ci-robot assigned fabriziopandini Apr 10, 2018

discordianfish mentioned this issue Apr 11, 2018

kubeadm: Make kube-proxy tolerate all taints kubernetes/kubernetes#62390

Merged

timothysc added the lifecycle/active Indicates that an issue or PR is actively being worked on by a contributor. label Apr 24, 2018

timothysc unassigned fabriziopandini Apr 24, 2018

k8s-github-robot closed this as completed in kubernetes/kubernetes#62390 Apr 24, 2018

luxas reopened this Jun 28, 2018

luxas added cherrypick-candidate and removed kind/feature Categorizes issue or PR as related to a new feature. labels Jun 28, 2018

luxas mentioned this issue Jun 29, 2018

Cherrypick tracking issue for v1.11 #958

Closed

9 tasks

timothysc unassigned chuckha and detiber Jul 3, 2018

k8s-ci-robot added the sig/cluster-lifecycle Categorizes an issue or PR as relevant to SIG Cluster Lifecycle. label Jul 3, 2018

timothysc self-assigned this Jul 3, 2018

k8s-ci-robot assigned neolit123 Jul 6, 2018

neolit123 mentioned this issue Jul 7, 2018

kubeadm: run kube-proxy on non-master tainted nodes kubernetes/kubernetes#65931

Merged

timothysc added priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. and removed priority/critical-urgent Highest priority. Must be actively worked on as someone's top priority right now. cherrypick-candidate labels Jul 9, 2018

k8s-github-robot closed this as completed in kubernetes/kubernetes#65931 Jul 10, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support specifying kube-proxy tolerations #699

Support specifying kube-proxy tolerations #699

discordianfish commented Feb 12, 2018

discordianfish commented Mar 26, 2018

discordianfish commented Mar 26, 2018

timothysc commented Apr 6, 2018

discordianfish commented Apr 9, 2018

timothysc commented Apr 10, 2018

timothysc commented Apr 10, 2018

discordianfish commented Apr 10, 2018

discordianfish commented Apr 11, 2018

mxey commented Jun 28, 2018 •

edited

Loading

neolit123 commented Jun 28, 2018

luxas commented Jun 28, 2018

timothysc commented Jul 3, 2018

neolit123 commented Jul 3, 2018

luxas commented Jul 6, 2018

Support specifying kube-proxy tolerations #699

Support specifying kube-proxy tolerations #699

Comments

discordianfish commented Feb 12, 2018

FEATURE REQUEST

discordianfish commented Mar 26, 2018

discordianfish commented Mar 26, 2018

timothysc commented Apr 6, 2018

discordianfish commented Apr 9, 2018

timothysc commented Apr 10, 2018

timothysc commented Apr 10, 2018

discordianfish commented Apr 10, 2018

discordianfish commented Apr 11, 2018

mxey commented Jun 28, 2018 • edited Loading

neolit123 commented Jun 28, 2018

luxas commented Jun 28, 2018

timothysc commented Jul 3, 2018

neolit123 commented Jul 3, 2018

luxas commented Jul 6, 2018

mxey commented Jun 28, 2018 •

edited

Loading