test: more fast-timeouts. by alyssawilk · Pull Request #15959 · envoyproxy/envoy

alyssawilk · 2021-04-13T21:25:02Z

H2 waitForReset also picks up end stream, while H2 doesn't, so a bunch of QUIC tests
were actually spinning on waitForReset until disconnect happened.

Fixing it and fixing it forward by having a faster timeout which catches if we're waiting for the wrong thing.

Risk Level: n/a
Testing: passes locally at least =P
Docs Changes: n/a
Release Notes: n/a

Signed-off-by: Alyssa Wilk <alyssar@chromium.org>

dmitri-d · 2021-04-13T22:38:41Z

test/integration/quic_http_integration_test.cc

  ASSERT_TRUE(response->waitForEndStream());
  // The delayed close timeout should trigger since client is not closing the connection.
-  EXPECT_TRUE(codec_client_->waitForDisconnect(std::chrono::milliseconds(5000)));
+  EXPECT_TRUE(codec_client_->waitForDisconnect(std::chrono::milliseconds(10000)));


Why did you decide to increase the timeout here?

dmitri-d · 2021-04-13T22:39:23Z

lgtm

snowp · 2021-04-13T23:03:56Z

test/integration/integration_stream_decoder.cc

 }

-void IntegrationStreamDecoder::waitForReset() {
+AssertionResult IntegrationStreamDecoder::waitForReset(std::chrono::milliseconds timeout) {


must use result?

Did you mean ABSL_MUST_USE_RESULT?

Yep, seems like we want callers to check this?

Ahh, maybe I can on this one - I couldn't on the other one I ported without changing signature, updating envoy filter examples, and then adding MUST_USE but I think Enovy filter examples doens't use this function

snowp · 2021-04-13T23:07:16Z

test/integration/integration_stream_decoder.cc

-void IntegrationStreamDecoder::waitForReset() {
+AssertionResult IntegrationStreamDecoder::waitForReset(std::chrono::milliseconds timeout) {
  if (!saw_reset_) {
+    Event::TimerPtr timer(dispatcher_.createTimer([this]() -> void { dispatcher_.exit(); }));


Maybe add a comment on how this works (or this is a common pattern?), it took me a minute. If it is a common pattern, can we extract this to some helper?

Also might want to note somewhere that this leaves the dispatcher in a bad state so you really want ASSERT vs EXPECT

I don't think it leaves the dispatcher in a bad state, what do you think the problem is?

I'm probably misunderstanding how this works, but seems to me like this stops the dispatcher after the timer expires? It would then mean that it's no longer running after this? But on the other hand we don't disable this so this will fire on its own at some point anyways, so I guess it can't be messing up the dispatcher. So what exactly does calling dispatcher_.exit() do?

Ah, the dispatcher below is run in blocking mode, so it will loop for ever unless something tells it to exit.
Previously, it would exit if reset was received (or disconnect), now it also gives up after 5s.

yanavlasov · 2021-04-14T02:11:46Z

It is unexpected that waitForReset is triggered when stream ends normally for H2. I wonder if we should make it trigger on resets only or if it could break too many tests.

yanavlasov · 2021-04-14T14:07:01Z

/wait

Signed-off-by: Alyssa Wilk <alyssar@chromium.org>

snowp

Thanks!

alyssawilk · 2021-04-14T18:15:46Z

Agree, Yan, it would be good to fix H2 so your waitForX didn't finish if X didn't happen, but I'm going to put off shaving that yak for another day :-)

H2 waitForReset also picks up end stream, while H2 doesn't, so a bunch of QUIC tests were actually spinning on waitForReset until disconnect happened. Fixing it and fixing it forward by having a faster timeout which catches if we're waiting for the wrong thing. Risk Level: n/a Testing: passes locally at least =P Docs Changes: n/a Release Notes: n/a Signed-off-by: Alyssa Wilk <alyssar@chromium.org> Signed-off-by: Douglas Reid <douglas-reid@users.noreply.github.com>

H2 waitForReset also picks up end stream, while H2 doesn't, so a bunch of QUIC tests were actually spinning on waitForReset until disconnect happened. Fixing it and fixing it forward by having a faster timeout which catches if we're waiting for the wrong thing. Risk Level: n/a Testing: passes locally at least =P Docs Changes: n/a Release Notes: n/a Signed-off-by: Alyssa Wilk <alyssar@chromium.org>

test: more fast-timeouts.

e9d8f48

Signed-off-by: Alyssa Wilk <alyssar@chromium.org>

alyssawilk requested a review from snowp as a code owner April 13, 2021 21:25

htuch assigned snowp Apr 13, 2021

dmitri-d reviewed Apr 13, 2021

View reviewed changes

snowp suggested changes Apr 13, 2021

View reviewed changes

yanavlasov self-assigned this Apr 14, 2021

repokitteh-read-only bot added the waiting label Apr 14, 2021

comments

38f4da1

Signed-off-by: Alyssa Wilk <alyssar@chromium.org>

repokitteh-read-only bot removed the waiting label Apr 14, 2021

snowp approved these changes Apr 14, 2021

View reviewed changes

alyssawilk merged commit 96573e2 into envoyproxy:main Apr 14, 2021

alyssawilk deleted the timeout2 branch February 28, 2022 21:25

Conversation

alyssawilk commented Apr 13, 2021

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dmitri-d commented Apr 13, 2021

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

yanavlasov commented Apr 14, 2021

Uh oh!

yanavlasov commented Apr 14, 2021

Uh oh!

snowp left a comment

Choose a reason for hiding this comment

Uh oh!

alyssawilk commented Apr 14, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants