[Hotfix][Testing] Wait for RPCServer to be established #9150

junrushao · 2021-09-29T18:43:29Z

In unittests, we establish a "faked" RPC tracker/runner locally, but we forgot to wait until the server process is set up, which causes flakiness on mainline.

https://ci.tlcpack.ai/blue/organizations/jenkins/tvm/detail/main/1815/pipeline

Thanks @vinx13 for reporting! CC @zxybazh

shingjan

overall LGTM. Would 0.5s be enough?

junrushao · 2021-09-29T18:45:57Z

@shingjan Yes it's far more than enough as I experimented with @zxybazh weeks ago, but I did some refactoring when upstreaming the codebase, which accidentally dropped this line...

zxybazh · 2021-09-29T18:47:25Z

LGTM. Thanks for the fix.

tqchen · 2021-09-29T19:59:05Z

Interesting, under popen impl, we should have wait until we get the related fields(where socket already binds) https://github.com/apache/tvm/blob/main/python/tvm/rpc/tracker.py#L450

Same thing for server, so i wonder why wait is still needed

tqchen · 2021-09-29T20:00:33Z

OK, answering my own Q, this might be needed for server to connect to tracker.

junrushao · 2021-09-29T20:31:42Z

@tqchen right, the server needs some time to talk to the tracker

* main: (80 commits) Introduce centralised name transformation functions (apache#9088) [OpenCL] Add vectorization to cuda conv2d_nhwc schedule (apache#8636) [6/6] Arm(R) Ethos(TM)-U NPU codegen integration with `tvmc` (apache#8854) [microTVM] Add wrapper for creating project using a MLF (apache#9090) Fix typo (apache#9156) [Hotfix][Testing] Wait for RPCServer to be established (apache#9150) Update find cublas so it search default path if needed. (apache#9149) [TIR][LowerMatchBuffer] Fix lowering strides when source region has higher dimension than the buffer (apache#9145) Fix flaky NMS test by making sure scores are unique (apache#9140) [Relay] Merge analysis/context_analysis.cc and transforms/device_annotation.cc (apache#9038) [LLVM] Make changes needed for opaque pointers (apache#9138) Arm(R) Ethos(TM)-U NPU codegen integration (apache#8849) [CI] Split Integration tests out of first phase of pipeline (apache#9128) [Meta Schedule][M3b] Runner (apache#9111) Fix Google Mock differences between Ubuntu 18.04 and 16.04 (apache#9141) [TIR] add loop partition hint pragma (apache#9121) fix things (apache#9146) [Meta Schedule][M3a] SearchStrategy (apache#9132) [Frontend][PyTorch] support for quantized conv_transpose2d op (apache#9133) [UnitTest] Parametrized test_conv2d_int8_intrinsics (apache#9143) ...

areusch · 2021-10-04T18:00:45Z

it would be great for these fixes if, in the future, they could come with a comment explaining why we're adding sleep :).

[Hotfix][Testing] Wait for RPCServer to be established

3b8ac4c

junrushao requested review from comaniac, jroesch, tqchen, yzhliu and zhiics as code owners September 29, 2021 18:43

vinx13 approved these changes Sep 29, 2021

View reviewed changes

shingjan approved these changes Sep 29, 2021

View reviewed changes

shingjan mentioned this pull request Sep 29, 2021

[MetaSchedule][M4a] Local runner #9153

Merged

junrushao merged commit 677f2d4 into apache:main Sep 30, 2021

ylc pushed a commit to ylc/tvm that referenced this pull request Jan 7, 2022

[Hotfix][Testing] Wait for RPCServer to be established (apache#9150)

4eeef61

ylc pushed a commit to ylc/tvm that referenced this pull request Jan 13, 2022

[Hotfix][Testing] Wait for RPCServer to be established (apache#9150)

390ac0f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Hotfix][Testing] Wait for RPCServer to be established #9150

[Hotfix][Testing] Wait for RPCServer to be established #9150

junrushao commented Sep 29, 2021 •

edited

Loading

shingjan left a comment

junrushao commented Sep 29, 2021

zxybazh commented Sep 29, 2021

tqchen commented Sep 29, 2021

tqchen commented Sep 29, 2021

junrushao commented Sep 29, 2021

areusch commented Oct 4, 2021

[Hotfix][Testing] Wait for RPCServer to be established #9150

[Hotfix][Testing] Wait for RPCServer to be established #9150

Conversation

junrushao commented Sep 29, 2021 • edited Loading

shingjan left a comment

Choose a reason for hiding this comment

junrushao commented Sep 29, 2021

zxybazh commented Sep 29, 2021

tqchen commented Sep 29, 2021

tqchen commented Sep 29, 2021

junrushao commented Sep 29, 2021

areusch commented Oct 4, 2021

junrushao commented Sep 29, 2021 •

edited

Loading