listener: listen socket factory can be cloned from listener draining … by lambdai · Pull Request #18686 · envoyproxy/envoy

lambdai · 2021-10-19T23:52:35Z

…filter chain

Commit Message:
Currently the in place updated old listener holds the listen fd during the drain timeout.

If the corresponding active listener is removed and added back, the added back listener creates fresh new passive sockets with new inodes.
The kernel spread the accepted requests to the old listen fd and new listenfd.

The accepted sockets to the old listen fd will be lost. Even somehow the kernel/ebpf requeue these sockets,
these sockets will be handled by envoy after the drain timeout is triggered.

This PR duplicate the listen fd from the old listener. The new listen fd share the same inode.
The newly create listener will consume the accepted sockets if any.

Signed-off-by: Yuchen Dai silentdai@gmail.com

Additional Description:
Risk Level:
Testing: Unittest and a bunch of istio e2e tests
Docs Changes:
Release Notes:
Platform Specific Features:
[Optional Runtime guard:]
Fix #18616
[Optional Fixes commit #PR or SHA]
[Optional Deprecated:]
[Optional API Considerations:]

…filter chain Signed-off-by: Yuchen Dai <silentdai@gmail.com>

lambdai · 2021-10-19T23:52:48Z

/assign @mattklein123

lambdai · 2021-10-19T23:59:41Z

I would argue that #18677 is also needed.

In the test case, we add the listener 0.0.0.0:8080. It's happy that the requests arrived prior to the add is queued.

However, the add back is pretty much unpredictable.
I would expect stopping the listener 0.0.0.0:8080 means envoy would reject the new request ASAP.

lambdai · 2021-10-20T02:21:32Z

/retest

repokitteh-read-only · 2021-10-20T02:21:36Z

Retrying Azure Pipelines:
Retried failed jobs in: envoy-presubmit

🐱

Caused by: a #18686 (comment) was created by @lambdai.

see: more, trace.

mattklein123 · 2021-10-20T15:36:49Z

I would argue that #18677 is also needed.

Can you explain this more please? From our lengthy offline discussion I know don't understand how #18677 helps. As long as the sockets are duplicated/cloned, there should be no issue?

mattklein123

Thanks, makes sense to me with small comment.

/wait

source/server/listener_impl.cc

Signed-off-by: Yuchen Dai <silentdai@gmail.com>

mattklein123

Thanks! We should backport this obviously.

lambdai · 2021-10-20T16:22:40Z

I would argue that #18677 is also needed.

Can you explain this more please? From our lengthy offline discussion I know don't understand how #18677 helps. As long as the sockets are duplicated/cloned, there should be no issue?

There is no issue if we add a fresh new listener after we stop the listener on that address.

Think about why adding back a fresh listener can duplicate that draining filter chain listener.
It is because the old inplace updated listener doesn't close the listen socket during the drain_timeout after the listener is told to remove.

The state of the system is described here:
https://github.com/envoyproxy/envoy/pull/18686/files#diff-f8e00b71778057a3747a30c4fc3f3ad13e551cadebaf0eca9bdb476b42cadbcaR5112-R5117

lambdai · 2021-10-20T16:24:13Z

Thanks! We should backport this obviously.

Thank you for the offline guidance and the quick approval!

lambdai · 2021-10-20T16:24:46Z

/backport

Only need backport to 1.20

mattklein123 · 2021-10-20T16:27:08Z

Think about why adding back a fresh listener can duplicate that draining filter chain listener.
It is because the old inplace updated listener doesn't close the listen socket during the drain_timeout after the listener is told to remove.

Sorry I still don't understand. If you want to continue discussion on the other PR we can talk there, but I don't understand any scenario in which that PR will help. The real fix is cloning if we want to reuse a socket and not drop anything.

envoyproxy#18686) Signed-off-by: Yuchen Dai <silentdai@gmail.com>

listener: listen socket factory can be cloned from listener draining …

9266b96

…filter chain Signed-off-by: Yuchen Dai <silentdai@gmail.com>

repokitteh-read-only bot assigned mattklein123 Oct 19, 2021

mattklein123 requested changes Oct 20, 2021

View reviewed changes

source/server/listener_impl.cc Outdated Show resolved Hide resolved

repokitteh-read-only bot added the waiting label Oct 20, 2021

debugLog() use debug level

e62aefb

Signed-off-by: Yuchen Dai <silentdai@gmail.com>

repokitteh-read-only bot removed the waiting label Oct 20, 2021

mattklein123 approved these changes Oct 20, 2021

View reviewed changes

repokitteh-read-only bot added the backport/review Request to backport to stable releases label Oct 20, 2021

mattklein123 merged commit 6f0ecc5 into envoyproxy:main Oct 20, 2021

lambdai deleted the fixsetnewordraining branch October 20, 2021 18:34

sunjayBhatia mentioned this pull request Oct 21, 2021

Bump Envoy to v1.20.1 projectcontour/contour#4075

Merged

ericvn mentioned this pull request Oct 26, 2021

Automator: update proxy@master in istio/istio@master istio/istio#34188

Merged

lambdai added a commit to lambdai/envoy-dai that referenced this pull request Oct 26, 2021

listener: listen socket factory can be cloned from listener draining … (

bc6bce8

envoyproxy#18686) Signed-off-by: Yuchen Dai <silentdai@gmail.com>

lambdai mentioned this pull request Oct 26, 2021

[Backport 1.20]:listen socket factory can be cloned from listener draining … #18782

Merged

alyssawilk added backport/approved Approved backports to stable releases and removed backport/review Request to backport to stable releases labels Nov 29, 2021

shashankram mentioned this pull request Feb 28, 2022

Difference in how connections are handled for draining listeners from v1.19.1 to v1.20.x,v1.21.x #20113

Closed

rectified95 mentioned this pull request Mar 2, 2022

Update Envoy to 1.21.1 openservicemesh/osm#4565

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

listener: listen socket factory can be cloned from listener draining …#18686

listener: listen socket factory can be cloned from listener draining …#18686
mattklein123 merged 2 commits intoenvoyproxy:mainfrom
lambdai:fixsetnewordraining

lambdai commented Oct 19, 2021

Uh oh!

lambdai commented Oct 19, 2021

Uh oh!

lambdai commented Oct 19, 2021

Uh oh!

lambdai commented Oct 20, 2021

Uh oh!

repokitteh-read-only bot commented Oct 20, 2021

Uh oh!

mattklein123 commented Oct 20, 2021

Uh oh!

mattklein123 left a comment

Uh oh!

Uh oh!

mattklein123 left a comment

Uh oh!

lambdai commented Oct 20, 2021 •

edited

Loading

Uh oh!

lambdai commented Oct 20, 2021

Uh oh!

lambdai commented Oct 20, 2021

Uh oh!

mattklein123 commented Oct 20, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

lambdai commented Oct 19, 2021

Uh oh!

lambdai commented Oct 19, 2021

Uh oh!

lambdai commented Oct 19, 2021

Uh oh!

lambdai commented Oct 20, 2021

Uh oh!

repokitteh-read-only bot commented Oct 20, 2021

Uh oh!

mattklein123 commented Oct 20, 2021

Uh oh!

mattklein123 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mattklein123 left a comment

Choose a reason for hiding this comment

Uh oh!

lambdai commented Oct 20, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lambdai commented Oct 20, 2021

Uh oh!

lambdai commented Oct 20, 2021

Uh oh!

mattklein123 commented Oct 20, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lambdai commented Oct 20, 2021 •

edited

Loading