-
Notifications
You must be signed in to change notification settings - Fork 25.6k
Closed
Labels
:Distributed Coordination/NetworkHttp and internode communication implementationsHttp and internode communication implementationsMetaTeam:Distributed (Obsolete)Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination.Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination.team-discuss
Description
When using Cross Cluster Search and a remote cluster becomes unreachable due to network issues, it takes the CCS node a while to detect that. This seems particularly bad if a firewall in-between drops connections, as it makes CCS searches hang, despite TCP connections can be initiated from the CCS node to the remote cluster nodes on port 9300.
This has been reported on our forum and also on #30247 .
The following are changes that we could make to improve this:
- allow to enable scheduled pings directed to remote clusters without affecting intra-cluster communication (see also Schedule transport ping interval #10189) (Allow to enable pings for specific remote clusters #34753)
- possibly adapt the transport ping to come back with a response (Make keepalive pings bidirectional and optimizable #35441)
- add support for a timeout to the transport ping? (users should do this at the TCP level and Suggest reducing tcp_retries2 #59222 documents how)
- consider enabling transport pings by default for remote clusters with a sensible default interval (see also Update documentation to prevent dropping tcp connections #5067, some discussion happened in Allow to enable pings for specific remote clusters #34753)
- given that CCS searches are timing out in the initial search_shards phase, we also may want to apply a sensible timeout and/or make the timeout configurable (see CCS: Should
timeoutparameter be honored? #32678)
Metadata
Metadata
Assignees
Labels
:Distributed Coordination/NetworkHttp and internode communication implementationsHttp and internode communication implementationsMetaTeam:Distributed (Obsolete)Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination.Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination.team-discuss