Skip to content

[Test] Connection timeout in RecoveryIT#testRecoverSyncedFlushIndex on rolling-upgrade #32027

@cbuescher

Description

@cbuescher

This happened on 6.x: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+6.x+default-distro-bwc-tests/149/console

07:32:46 ERROR   70.7s | RecoveryIT.testRecoverSyncedFlushIndex <<< FAILURES!
07:32:46    > Throwable #1: org.elasticsearch.client.ResponseException: method [GET], host [http://[::1]:42775], URI [/_cluster/health/recover_synced_flush_index?wait_for_no_relocating_shards=true&level=shards&timeout=70s&wait_for_status=green], status line [HTTP/1.1 408 Request Timeout]
07:32:46   2> NOTE: leaving temporary files on disk at: /var/lib/jenkins/workspace/elastic+elasticsearch+6.x+default-distro-bwc-tests/qa/rolling-upgrade/build/testrun/v6.3.2-SNAPSHOT#oneThirdUpgradedTestRunner/J0/temp/org.elasticsearch.upgrades.RecoveryIT_6C47D2BFDE00AF80-001
07:32:46    > {"cluster_name":"rolling-upgrade","status":"yellow","timed_out":true,"number_of_nodes":3,"number_of_data_nodes":3,"active_primary_shards":1,"active_shards":1,"relocating_shards":0,"initializing_shards":0,"unassigned_shards":1,"delayed_unassigned_shards":0,"number_of_pending_tasks":0,"number_of_in_flight_fetch":0,"task_max_waiting_in_queue_millis":0,"active_shards_percent_as_number":96.84210526315789,"indices":{"recover_synced_flush_index":{"status":"yellow","number_of_shards":1,"number_of_replicas":1,"active_primary_shards":1,"active_shards":1,"relocating_shards":0,"initializing_shards":0,"unassigned_shards":1,"shards":{"0":{"status":"yellow","primary_active":true,"active_shards":1,"relocating_shards":0,"initializing_shards":0,"unassigned_shards":1}}}}}
07:32:46    > 	at __randomizedtesting.SeedInfo.seed([6C47D2BFDE00AF80:47760F687957DAC9]:0)
07:32:46    > 	at org.elasticsearch.client.RestClient$SyncResponseListener.get(RestClient.java:920)
07:32:46   2> NOTE: test params are: codec=Asserting(Lucene70): {}, docValues:{}, maxPointsInLeafNode=597, maxMBSortInHeap=6.887940697412328, sim=RandomSimilarity(queryNorm=true): {}, locale=es-BO, timezone=Asia/Irkutsk
07:32:46   2> NOTE: Linux 4.16.8-300.fc28.x86_64 amd64/Oracle Corporation 1.8.0_172 (64-bit)/cpus=16,threads=1,free=442107176,total=514850816
07:32:46   2> NOTE: All tests run in this JVM: [RecoveryIT]
07:32:46    > 	at org.elasticsearch.client.RestClient.performRequest(RestClient.java:227)
07:32:46    > 	at org.elasticsearch.test.rest.ESRestTestCase.ensureGreen(ESRestTestCase.java:513)
07:32:46    > 	at org.elasticsearch.upgrades.RecoveryIT.testRecoverSyncedFlushIndex(RecoveryIT.java:318)
07:32:46    > 	at java.lang.Thread.run(Thread.java:748)
07:32:46    > Caused by: org.elasticsearch.client.ResponseException: method [GET], host [http://[::1]:42775], URI [/_cluster/health/recover_synced_flush_index?wait_for_no_relocating_shards=true&level=shards&timeout=70s&wait_for_status=green], status line [HTTP/1.1 408 Request Timeout]
07:32:46    > {"cluster_name":"rolling-upgrade","status":"yellow","timed_out":true,"number_of_nodes":3,"number_of_data_nodes":3,"active_primary_shards":1,"active_shards":1,"relocating_shards":0,"initializing_shards":0,"unassigned_shards":1,"delayed_unassigned_shards":0,"number_of_pending_tasks":0,"number_of_in_flight_fetch":0,"task_max_waiting_in_queue_millis":0,"active_shards_percent_as_number":96.84210526315789,"indices":{"recover_synced_flush_index":{"status":"yellow","number_of_shards":1,"number_of_replicas":1,"active_primary_shards":1,"active_shards":1,"relocating_shards":0,"initializing_shards":0,"unassigned_shards":1,"shards":{"0":{"status":"yellow","primary_active":true,"active_shards":1,"relocating_shards":0,"initializing_shards":0,"unassigned_shards":1}}}}}
07:32:46    > 	at org.elasticsearch.client.RestClient$1.completed(RestClient.java:540)
07:32:46    > 	at org.elasticsearch.client.RestClient$1.completed(RestClient.java:529)
07:32:46    > 	at org.apache.http.concurrent.BasicFuture.completed(BasicFuture.java:119)
07:32:46    > 	at org.apache.http.impl.nio.client.DefaultClientExchangeHandlerImpl.responseCompleted(DefaultClientExchangeHandlerImpl.java:177)
07:32:46    > 	at org.apache.http.nio.protocol.HttpAsyncRequestExecutor.processResponse(HttpAsyncRequestExecutor.java:436)
07:32:46    > 	at org.apache.http.nio.protocol.HttpAsyncRequestExecutor.inputReady(HttpAsyncRequestExecutor.java:326)
07:32:46    > 	at org.apache.http.impl.nio.DefaultNHttpClientConnection.consumeInput(DefaultNHttpClientConnection.java:265)
07:32:46    > 	at org.apache.http.impl.nio.client.InternalIODispatch.onInputReady(InternalIODispatch.java:81)
07:32:46    > 	at org.apache.http.impl.nio.client.InternalIODispatch.onInputReady(InternalIODispatch.java:39)
07:32:46    > 	at org.apache.http.impl.nio.reactor.AbstractIODispatch.inputReady(AbstractIODispatch.java:114)
07:32:46    > 	at org.apache.http.impl.nio.reactor.BaseIOReactor.readable(BaseIOReactor.java:162)
07:32:46    > 	at org.apache.http.impl.nio.reactor.AbstractIOReactor.processEvent(AbstractIOReactor.java:337)
07:32:46    > 	at org.apache.http.impl.nio.reactor.AbstractIOReactor.processEvents(AbstractIOReactor.java:315)
07:32:46    > 	at org.apache.http.impl.nio.reactor.AbstractIOReactor.execute(AbstractIOReactor.java:276)
07:32:46    > 	at org.apache.http.impl.nio.reactor.BaseIOReactor.execute(BaseIOReactor.java:104)
07:32:46    > 	at org.apache.http.impl.nio.reactor.AbstractMultiworkerIOReactor$Worker.run(AbstractMultiworkerIOReactor.java:588)
07:32:46    > 	... 1 more

This sadly doesn't reproduce locally:

./gradlew :qa:rolling-upgrade:v6.3.2-SNAPSHOT#oneThirdUpgradedTestRunner -Dtests.seed=6C47D2BFDE00AF80 -Dtests.class=org.elasticsearch.upgrades.RecoveryIT -Dtests.method="testRecoverSyncedFlushIndex" -Dtests.security.manager=true -Dtests.locale=es-BO -Dtests.timezone=Asia/Irkutsk -Dtests.distribution=zip

Metadata

Metadata

Assignees

Labels

:Distributed Indexing/RecoveryAnything around constructing a new shard, either from a local or a remote source.>test-failureTriaged test failures from CIv6.5.0

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions