Do not attempt to re-establish connections when applying a cluster state update #29025

DaveCTurner · 2018-03-13T17:20:41Z

Today, when a new cluster state is applied, we attempt to connect to all nodes in the new cluster state:

elasticsearch/server/src/main/java/org/elasticsearch/cluster/service/ClusterApplierService.java

Line 467 in 5904d93

nodeConnectionsService.connectToNodes(newClusterState.nodes());

This set of nodes may include some nodes that were previously connected but which have failed, but whose failure is to be processed in a later cluster state update. Attempts to reconnect to these nodes will also fail, but may do so very slowly if they are unresponsive. On the other hand, the NodesConnectionService takes responsibility for periodically re-establishing connections that have dropped.

This means that there's no real need to attempt to re-establish these connections while applying a cluster state update, and #28920 is an example of a situation where it's undesirable to do so. Therefore it seems sensible to skip these nodes.

NB it's important that the nodes in the cluster state align with the nodes known to the NodeConnectionsService, so they can't simply be omitted in the call to connectToNodes() above.

The text was updated successfully, but these errors were encountered:

elasticmachine · 2018-03-13T17:20:42Z

Pinging @elastic/es-distributed

Today, when a new cluster state is committed we attempt to connect to all of its nodes as part of the application process. This is the right thing to do with new nodes, and is a no-op on any already-connected nodes, but is questionable on known nodes from which we are currently disconnected: there is a risk that we are partitioned from these nodes so that any attempt to connect to them will hang until it times out. This can dramatically slow down the application of new cluster states which hinders the recovery of the cluster during certain kinds of partition. If nodes are disconnected from the master then it is likely that they are to be removed as part of a subsequent cluster state update, so there's no need to try and reconnect to them like this. Moreover there is no need to attempt to reconnect to disconnected nodes as part of the cluster state application process, because we periodically try and reconnect to any disconnected nodes, and handle their disconnectedness gracefully in the meantime. This commit alters this behaviour to avoid reconnecting to known nodes during cluster state application. Resolves elastic#29025.

Today, when applying new cluster state we attempt to connect to all of its nodes as a blocking part of the application process. This is the right thing to do with new nodes, and is a no-op on any already-connected nodes, but is questionable on known nodes from which we are currently disconnected: there is a risk that we are partitioned from these nodes so that any attempt to connect to them will hang until it times out. This can dramatically slow down the application of new cluster states which hinders the recovery of the cluster during certain kinds of partition. If nodes are disconnected from the master then it is likely that they are to be removed as part of a subsequent cluster state update, so there's no need to try and reconnect to them like this. Moreover there is no need to attempt to reconnect to disconnected nodes as part of the cluster state application process, because we periodically try and reconnect to any disconnected nodes, and handle their disconnectedness reasonably gracefully in the meantime. This commit alters this behaviour to avoid reconnecting to known nodes during cluster state application. Resolves elastic#29025. Supersedes elastic#31547.

Today, when applying new cluster state we attempt to connect to all of its nodes as a blocking part of the application process. This is the right thing to do with new nodes, and is a no-op on any already-connected nodes, but is questionable on known nodes from which we are currently disconnected: there is a risk that we are partitioned from these nodes so that any attempt to connect to them will hang until it times out. This can dramatically slow down the application of new cluster states which hinders the recovery of the cluster during certain kinds of partition. If nodes are disconnected from the master then it is likely that they are to be removed as part of a subsequent cluster state update, so there's no need to try and reconnect to them like this. Moreover there is no need to attempt to reconnect to disconnected nodes as part of the cluster state application process, because we periodically try and reconnect to any disconnected nodes, and handle their disconnectedness reasonably gracefully in the meantime. This commit alters this behaviour to avoid reconnecting to known nodes during cluster state application. Resolves #29025.

DaveCTurner added help wanted adoptme :Distributed/Recovery Anything around constructing a new shard, either from a local or a remote source. v7.0.0 v6.3.0 labels Mar 13, 2018

DaveCTurner mentioned this issue Mar 13, 2018

Slow recovery of write availability after partition of a large cluster #28920

Closed

colings86 added the >enhancement label Apr 24, 2018

bleskes added v6.3.1 v6.4.0 and removed v6.3.0 v6.3.1 labels Apr 26, 2018

DaveCTurner mentioned this issue Jun 24, 2018

Only connect to new nodes on new cluster state #31547

Closed

lcawl added v6.4.1 and removed v6.4.0 labels Aug 23, 2018

DaveCTurner mentioned this issue Dec 19, 2018

[Feature Request] Configuration to customize discovery/zen/fd/master_ping #36822

Closed

DaveCTurner added :Distributed/Cluster Coordination Cluster formation and cluster state publication, including cluster membership and fault detection. and removed :Distributed/Recovery Anything around constructing a new shard, either from a local or a remote source. labels Dec 19, 2018

andreykaipov mentioned this issue Feb 18, 2019

Slow re-election when elected master pod is deleted elastic/helm-charts#63

Closed

DaveCTurner mentioned this issue Mar 4, 2019

Only connect to new nodes on new cluster state #39629

Merged

DaveCTurner self-assigned this Mar 12, 2019

DaveCTurner removed the help wanted adoptme label Mar 12, 2019

DaveCTurner closed this as completed in #39629 Mar 12, 2019

michaelbaamonde added v7.0.0-rc1 and removed v7.0.0 v7.0.0-rc1 labels Mar 25, 2019

DaveCTurner mentioned this issue Jun 11, 2019

Long time for elect new master after existing leader unavailable #42983

Closed

jmlrt mentioned this issue May 12, 2021

[elasticsearch] remove masterTerminationFix elastic/helm-charts#1183

Merged

mark-vieira mentioned this issue May 27, 2021

[6.8] [elasticsearch] remove masterTerminationFix (#1183) elastic/helm-charts#1213

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do not attempt to re-establish connections when applying a cluster state update #29025

Do not attempt to re-establish connections when applying a cluster state update #29025

DaveCTurner commented Mar 13, 2018

elasticmachine commented Mar 13, 2018

Do not attempt to re-establish connections when applying a cluster state update #29025

Do not attempt to re-establish connections when applying a cluster state update #29025

Comments

DaveCTurner commented Mar 13, 2018

elasticmachine commented Mar 13, 2018