Improve master nodes deletion #668

barkbay · 2019-04-23T07:24:01Z

When some masters need to be deleted the following algorithm is applied:

calculateChanges calculates how many master nodes should be removed
CalculatePerformableChanges returns how many master nodes can be removed
Then for each master node to be removed:
Compute and apply new quorum size without the master node to be removed
Schedule master pod deletion

In some rare cases it can lead to a split brain situation, for instance:

Initial situation: 4 masters in a clusters, 2 masters need to be removed
minimum_master_nodes is decrease from 3 to 2
2 masters pods are scheduled to be deleted at the K8S level

Since minimum_master_nodes is set to 2 while there is still 4 masters running there is a small chance of having a split brain situation between steps 2. and 3.
This situation is mostly true for Zen1, with Zen2 masters are excluded before to be deleted.
The algorithm depicted above is the only way to move from two masters to one node, it is a special case which is inherently unsafe.

Some improvements can be done here:

We should never delete more than half of the masters (at least with Zen1 and with the exception of the special case of the two to one master)
If there are some dedicated masters, maybe we should treat them more carefully than the other nodes and do not blindly apply the maxUnavailable setting.
We should never go down to 1 master (spof)

The 2 latest points are also true for Zen 2 and might have a higher priority since we are moving to ES 7.

The text was updated successfully, but these errors were encountered:

sebgl · 2019-07-18T12:10:05Z

Related discussion for handling zen1 correctly in the sset refactoring: #1281

sebgl · 2019-09-12T13:23:24Z

We now add and remove one master node at a time (wip for rolling upgrades, other issues have been opened) and wait for our cache of resources to match expectations, to properly handle zen1 and zen2 settings.
Closing this issue in favor of keeping open #1710, #1628, #1693.

pebrc added this to the Beta milestone May 7, 2019

pebrc added the discuss We need to figure this out label May 7, 2019

pebrc removed this from the Beta milestone May 10, 2019

sebgl mentioned this issue Jul 18, 2019

Orchestrate zen1 and zen2 settings for StatefulSets #1262

Merged

sebgl closed this as completed Sep 12, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve master nodes deletion #668

Improve master nodes deletion #668

barkbay commented Apr 23, 2019

sebgl commented Jul 18, 2019

sebgl commented Sep 12, 2019

Improve master nodes deletion #668

Improve master nodes deletion #668

Comments

barkbay commented Apr 23, 2019

sebgl commented Jul 18, 2019

sebgl commented Sep 12, 2019