flaky onlineddl tests: reduce -online_ddl_check_interval#7847
flaky onlineddl tests: reduce -online_ddl_check_interval#7847shlomi-noach merged 31 commits intovitessio:masterfrom
Conversation
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
|
Hmmm actually the problem is more specific to |
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
|
ugh, now failing: https://github.com/vitessio/vitess/pull/7847/checks?check_run_id=2331837611 . I'll keep looking into. |
|
The nature of this error is unclear: |
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
|
Now, it fails on a nonsensical issue: the very first This does not reproduce locally, ever. All Also, once in a while, another This feels like resource exhaustion. I have no other explanation to this behavior. |
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
bc1ffd6 to
d7f0e8d
Compare
|
rebased and removed the os tune up commit since we have a different fix for CI now |
|
@shlomi-noach: tests are passing. I see you moved it to Draft: is it ready to be merged? |
|
I moved it to |
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
…s into flaky-tests-online-ddl Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
|
Some update on where In
I invested time to verify/deby that the problem is with Reiterating that all these tests pass just fine on multiple environemtns, just no in GitHub CI. BTW, the test passes in #7850. So I'm still looking for a platform problem. Sockets/files/other resources. |
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
|
The Running even more experiments to double validate this. |
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
|
If I remove hooks, then the next thing It's all about ports. |
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
deepthi
left a comment
There was a problem hiding this comment.
- Was it intentional to checkin a new gh-ost binary?
- When we merge this, we should squash-merge since so many of the commits were experimental.
| with: | ||
| go-version: 1.15 | ||
|
|
||
| - name: Tune the OS |
There was a problem hiding this comment.
Do we still need the port range fix? Or is the /etc/hosts fix sufficient?
There was a problem hiding this comment.
This had immediate positive effect last week on local_example tests. I'll experiment without it.
There was a problem hiding this comment.
Removed, and tests are green!
No! It was experimental. Reverting.
Cool |
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>
|
Removing the port range patch, 8f4444c, seems to look good. Tests are still passing without the patch. |
|
Suqash commit: 2179d0c |
Description
Solves #7834
Not sure how this happened, will later look in
git blameand history, but in the tests suite,vtctldshould run with a very short-online_ddl_check_interval. For some reason the flag didn't appear in any of the online ddl tests. It must have changes somehow over time, looking into. anyway, this should resolve the current CI failures seen mostly inonlineddl_ghostbut also in otheronlineddl_*tests.Related Issue(s)
Checklist
Deployment Notes
Impacted Areas in Vitess
Components that this PR will affect: