Why kube-proxy add external-lb's address to node local iptables rule? #66607

BSWANG · 2018-07-25T11:41:53Z

/kind friction

What happened:
I have a LoadBalancer type service A of address 1.1.1.1. The external loadbalancer of service A is a TLS decoder, it will convert https requests to http hostport and endpoint. But since the kube-proxy add the external-lb's address to local iptables rule. Requests of https//1.1.1.1 will hijack to local http endpoints. Then https request failed.

What you expected to happen:
Kube-proxy don't add external-lb's address to local iptables. And the request will go through external-lb.

Environment:

Kubernetes version (use kubectl version):
1.10.4
Cloud provider or hardware configuration:
Alibaba Cloud
OS (e.g. from /etc/os-release):
Centos 7.4
Kernel (e.g. uname -a):
3.10.0-693
Install tools:
kubeadm

The text was updated successfully, but these errors were encountered:

BSWANG · 2018-07-25T11:43:33Z

/sig network

Lion-Wei · 2018-07-26T07:04:53Z

I think the reason is, traffic from in cluster have no need to go out side(lb) then come back.
And probably lb should not do TLS decoder work.

BSWANG · 2018-07-27T02:54:17Z

@Lion-Wei Will it be an option to config this behavior？Some functions like TLS,Monitor,Logging doing on lb are more reasonable and high performance.
If kube-proxy add this option, I would like to commit a PR to it.

fejta-bot · 2018-11-22T00:20:03Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

BSWANG · 2018-11-28T08:59:21Z

/remove-lifecycle stale

fejta-bot · 2019-02-26T09:35:11Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

BSWANG · 2019-03-04T06:38:50Z

/remove-lifecycle stale

jcodybaker · 2019-03-19T18:25:57Z

We're also seeing this as an issue at DigitalOcean. It's a concern not just for load-balancer TLS termination, but also for supporting proxy protocol as encouraged in ( https://kubernetes.io/docs/tutorials/services/source-ip/ ). Traffic addressed to the LB ip from within the cluster never reaches the load-balancer, and the required proxy header isn't applied, causing a protocol violation.

The in-tree AWS service type loadbalancer supports proxy protocol and TLS termination, but because they populate status.loadbalancer.ingress.hostname rather than .ip they avoid this bug/optimization.

We're willing to put together a PR to address this there's interest from sig-network to accept it. We've considered a kube-proxy flag to disable the optimization, or the more complex option of extending v1.LoadBalancerIngress to include feedback from the cloud provider.

bowei · 2019-03-25T22:27:33Z

It seems like what is being implemented here is not modeled all that well in the current kube-proxy construct; your LB is not transparent from an L3/4 perspective and lives somewhere other than the node (it is terminating TLS, adding a proxy protocol field). It seems possible for nothing to break if we remove the external LB IPs from the node as a provider-specific setting, but it will require some thought.

snuxoll · 2019-04-19T23:40:38Z

If I’m being quite honest this behavior goes against the principle of least surprise. If I want kube-proxy to route my traffic I would use a ClusterIP service or similar and configure the application to use it, if I am hitting a LoadBalancer provided by a cloud provider there is likely to be a specific reason for that and kube-proxy shouldn’t try to second guess it.

johnl · 2019-04-23T17:03:02Z

We hit this same problem at Brightbox with our cloud controller manager. Our Load Balancers can terminate SSL and support the PROXY protocol. Confirmed with both ipvs and iptables kube-proxy modes. Any IPs we provide as LoadBalancerIngress are used to redirect outgoing connections and keep them within the cluster, breaking things.

To put it more clearly (imo), the problem is that kube-proxy assumes all load balancer services are straight TCP/UDP and that it can optimize them internally but that is not the case.

So either the assumption is wrong and kube-proxy needs fixing, or the assumption is right and external load balancers should not be doing these fancy things. I obviously think kube-proxy is wrong here :)

As @jcodybaker says above, AWS avoids this problem simply but not listing IP addresses as LoadBalancerIngress, only hostnames, which kube-proxy doesn't resolve.

We've changed our cloud controller to do the same, listing only hostnames in there, and it's fixed our problem too.

This feels a bit like a hack, but maybe it could be argued that's the way for a cloud controller to enable/disable the optimisation - a bit misguiding though.

We were also exposing the IPv6 addresses of our load balancers too, which actually didn't look properly supported further down the stack at all, so things do seem to be a bit of a mess here generally.

So a kube-proxy flag to disable the optimization would be a great start, but I think this all needs a closer look!

Plus, it's worth noting that in ipvs mode, the IP address is actually added locally to each node so it's not possible to selectively support this per-port. As soon as a load balancer lists the IP, it's completely unreachable directly.

whereisaaron · 2019-05-05T08:53:43Z

If I create an external loadbalancer I don't expect kube-proxy to hijack traffic addressed to that external loadbalancer.

That completely disregards any functionality the load balancer provides, including TLS termination, proxy protocol, logging, metrics, DPI firewalling, and... the actual the load balancing algorithm!

I'm perplexed that this is default behavior? It only seems useful if your external load balancer doesn't actually do anything useful 😄

As @jcodybaker says above, AWS avoids this problem simply but not listing IP addresses as LoadBalancerIngress, only hostnames, which kube-proxy doesn't resolve. I'd like to be able to lock this out on kube-proxy in general for cluster deployments. This behavior is only going to break things.

Thanks for raising this @BSWANG. This submerged iceberg could have spoiled my day one day, but now I know to watch out for this 👀

BSWANG · 2021-04-28T03:45:59Z

reopen due this pr #96454 reverted

aojea · 2021-04-30T10:47:16Z

reopen due this pr #96454 reverted

the revert was done because that PR merged accidentally, to avoid misunderstandings, is not that the project doesn't want to implement, is that there are some process to add new features and that PR didn't follow them.
The KEP to implement the change has been approved https://github.com/kubernetes/enhancements/tree/master/keps/sig-network/1860-kube-proxy-IP-node-binding

@Sh4d1 are you still working on this #97681? if you want to include this in 1.22, the process has changed, please see https://groups.google.com/g/kubernetes-sig-network/c/3NYVEgvE1Uc/m/o68JAIt0DAAJ for more details

Sh4d1 · 2021-04-30T10:52:15Z

@aojea yep! Layed the groundwork on #101027 but I was waiting on Tim's review, tough I understood he's been a bit busy! ( You're welcome if you want to review!).
Once this one is merged, I can rebase #97681 and it should be good.

Thanks, I'll read that 😄

Bugfix for OAuth redirect_url not being http. This is due to an upstream proxy issue with kubernetes/kubernetes#66607

ryanlelek · 2021-05-23T20:33:23Z

Commenting for support/+1 that self-referencing the cluster from within would be useful

@mkreidenweis-schulmngr 's hair-pin solution (kind of) worked but didn't pick up the correct name of the pod(s) to reroute to and would have worked with further digging.
Setting up a separate cluster solved the issue in the meantime.

Thanks for your work on this

thockin · 2021-07-08T21:33:56Z

#101027 was not assigned to me, so I missed it.

thockin · 2021-08-05T20:47:51Z

I'm going to close this as a dup, since we do have the KEP open

liudongbo123 · 2021-11-12T14:01:51Z

how is the progress ？

dcardellino · 2022-10-25T13:50:59Z

Is there any solution for this?

thockin · 2022-10-25T14:40:12Z

Unfortunately the work to fix this sort of stalled for lack of someone to drive it to completion. Help wanted?

…

On Tue, Oct 25, 2022 at 8:51 AM Dominic Cardellino ***@***.***> wrote: Is there any solution for this? — Reply to this email directly, view it on GitHub <#66607 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ABKWAVAQTN5RNEIQRTB7IQLWE7QVJANCNFSM4FL7W5EQ> . You are receiving this because you modified the open/close state.Message ID: ***@***.***>

timoreimann · 2022-10-26T04:47:56Z

Hey @thockin 👋

I'd be happy to help pushing this over the finishing line. Is there a place to refresh my memory on where we stand right now and what remains to be done?

I vaguely remember that time was spent on a KEP but don't recall the details.

Appreciate any starting pointers.

Sh4d1 · 2022-11-02T10:42:36Z

@timoreimann 👋 you can read kubernetes/enhancements#1860 (comment)
and Tim opened #106242 instead of #101027 .

So if nothing really changed, my last message still stands I guess!

This fixes problems with kube-proxy in ipvs mode considering the lb IP as local to the node See kubernetes/kubernetes#66607 This can also be used to access PROXY proto service from the inside

k8s-ci-robot added needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. kind/bug Categorizes issue or PR as related to a bug. labels Jul 25, 2018

k8s-ci-robot added sig/network Categorizes an issue or PR as relevant to SIG Network. kind/friction and removed needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. labels Jul 25, 2018

BSWANG changed the title ~~[Question]Why kube-proxy add external-lb's address to node local iptables rule?~~ Why kube-proxy add external-lb's address to node local iptables rule? Jul 25, 2018

k8s-ci-robot added kind/cleanup Categorizes issue or PR as related to cleaning up code, process, or technical debt. and removed kind/friction labels Aug 23, 2018

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Nov 22, 2018

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Nov 28, 2018

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Feb 26, 2019

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Mar 4, 2019

thockin added the triage/unresolved Indicates an issue that can not or will not be resolved. label Mar 8, 2019

jcodybaker mentioned this issue Apr 3, 2019

workaround: cluster->lb traffic w/ TLS/ProxyProto digitalocean/digitalocean-cloud-controller-manager#206

Closed

snuxoll mentioned this issue Apr 19, 2019

Preserve client IP address on both HTTP and HTTPS digitalocean/digitalocean-cloud-controller-manager#144

Closed

johnl mentioned this issue May 2, 2019

proxy_protocol mode breaks HTTP01 challenge Check stage cert-manager/cert-manager#466

Closed

freehan added kind/feature Categorizes issue or PR as related to a new feature. and removed triage/unresolved Indicates an issue that can not or will not be resolved. labels May 2, 2019

chnyda mentioned this issue Apr 29, 2021

Add external_openstack_enable_ingress_hostname option for external-openstack-cloud-controller-manager kubernetes-sigs/kubespray#7572

Merged

Miniland1333 added a commit to kreut/libretext that referenced this issue May 2, 2021

Forces HTTPS if APP_URL is https

ca0cc4d

Bugfix for OAuth redirect_url not being http. This is due to an upstream proxy issue with kubernetes/kubernetes#66607

process0 mentioned this issue Jun 22, 2021

Loadbalancer healthchecks report error despite node ports connectable hetznercloud/hcloud-cloud-controller-manager#212

Closed

thockin closed this as completed Aug 5, 2021

dakraus mentioned this issue Dec 7, 2021

NetworkPolicies still block egress traffic to OIDC provider kubermatic/kubermatic#8407

Closed

nabokihms mentioned this issue Feb 16, 2022

cert-manager does not work with Ingress Nginx's proxy-protocol mode deckhouse/deckhouse#841

Closed

2 tasks

rastislavs mentioned this issue Sep 26, 2022

IP allowlist setup for user clusters kubermatic/kubermatic#10407

Closed

chess-knight mentioned this issue Jan 19, 2023

Research: Investigate octavia w/ proxy protocol options SovereignCloudStack/issues#250

Closed

splattner mentioned this issue Jan 27, 2023

Try to fix proxy-protocol and http01 challenge acend/infrastructure#38

Closed

kayrus mentioned this issue Mar 6, 2023

Add support for "expect-proxy level4" for a list of CIDRs jcmoraisjr/haproxy-ingress#996

Open

hrak mentioned this issue May 5, 2023

Add hostname annotation to set loadbalancer ingress hostname apache/cloudstack-kubernetes-provider#48

Merged

UltraInstinct14 mentioned this issue Sep 18, 2023

[Request]: Static IP LoadBalancer loxilb-io/kube-loxilb#42

Closed

simon-wessel mentioned this issue Mar 11, 2024

Allow avoiding cluster-internal rerouting for LoadBalancer services by only setting hostname in status field netscaler/netscaler-k8s-ingress-controller#641

Open

charlesg99 mentioned this issue Mar 14, 2024

Confusion with do-loadbalancer-hostname digitalocean/digitalocean-cloud-controller-manager#698

Closed

brandond mentioned this issue Jul 11, 2024

Agent loadbalancer may deadlock when servers are removed rancher/rke2#6208

Closed

megian mentioned this issue Aug 20, 2024

Exposed load balancer IP causes cluster internal traffic fail together with the proxy protocol cloudscale-ch/cloudscale-cloud-controller-manager#15

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why kube-proxy add external-lb's address to node local iptables rule? #66607

Why kube-proxy add external-lb's address to node local iptables rule? #66607

BSWANG commented Jul 25, 2018 •

edited

Loading

BSWANG commented Jul 25, 2018

Lion-Wei commented Jul 26, 2018

BSWANG commented Jul 27, 2018 •

edited

Loading

fejta-bot commented Nov 22, 2018

BSWANG commented Nov 28, 2018

fejta-bot commented Feb 26, 2019

BSWANG commented Mar 4, 2019

jcodybaker commented Mar 19, 2019

bowei commented Mar 25, 2019

snuxoll commented Apr 19, 2019

johnl commented Apr 23, 2019

whereisaaron commented May 5, 2019

BSWANG commented Apr 28, 2021

aojea commented Apr 30, 2021 •

edited

Loading

Sh4d1 commented Apr 30, 2021

ryanlelek commented May 23, 2021

thockin commented Jul 8, 2021

thockin commented Aug 5, 2021

liudongbo123 commented Nov 12, 2021

dcardellino commented Oct 25, 2022

thockin commented Oct 25, 2022 via email

timoreimann commented Oct 26, 2022

Sh4d1 commented Nov 2, 2022

Why kube-proxy add external-lb's address to node local iptables rule? #66607

Why kube-proxy add external-lb's address to node local iptables rule? #66607

Comments

BSWANG commented Jul 25, 2018 • edited Loading

BSWANG commented Jul 25, 2018

Lion-Wei commented Jul 26, 2018

BSWANG commented Jul 27, 2018 • edited Loading

fejta-bot commented Nov 22, 2018

BSWANG commented Nov 28, 2018

fejta-bot commented Feb 26, 2019

BSWANG commented Mar 4, 2019

jcodybaker commented Mar 19, 2019

bowei commented Mar 25, 2019

snuxoll commented Apr 19, 2019

johnl commented Apr 23, 2019

whereisaaron commented May 5, 2019

BSWANG commented Apr 28, 2021

aojea commented Apr 30, 2021 • edited Loading

Sh4d1 commented Apr 30, 2021

ryanlelek commented May 23, 2021

thockin commented Jul 8, 2021

thockin commented Aug 5, 2021

liudongbo123 commented Nov 12, 2021

dcardellino commented Oct 25, 2022

thockin commented Oct 25, 2022 via email

timoreimann commented Oct 26, 2022

Sh4d1 commented Nov 2, 2022

BSWANG commented Jul 25, 2018 •

edited

Loading

BSWANG commented Jul 27, 2018 •

edited

Loading

aojea commented Apr 30, 2021 •

edited

Loading