Deal with Total Cluster Failure in diskless log scenario #75

andrewjstone · 2017-01-13T20:38:38Z

The Viewstamped Replication Revisited protocol used by Haret is different from both Raft and Paxos in that it doesn't require any syncing to disk to operate and tolerate a minority of failures. However, it also suffers from the fact that if a majority of replicas fail at the same time, the system becomes unrecoverable.

Utilizing snapshots, haret can minimize the amount of data loss in this scenario and allow a restart of failed replicas. Their needs to be some way to join them into the cluster manually via an admin disaster recovery protocol. etcd has good docs on this for their system.

The text was updated successfully, but these errors were encountered:

andrewjstone added this to the Diskless-log-KV-1.0 milestone Jun 22, 2017

andrewjstone changed the title ~~Deal with Total Cluster Failure~~ Deal with Total Cluster Failure in diskless log scenario Jun 22, 2017

andrewjstone added enhancement RFC Required and removed enhancement labels Jun 22, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deal with Total Cluster Failure in diskless log scenario #75

Deal with Total Cluster Failure in diskless log scenario #75

andrewjstone commented Jan 13, 2017 •

edited

Loading

Deal with Total Cluster Failure in diskless log scenario #75

Deal with Total Cluster Failure in diskless log scenario #75

Comments

andrewjstone commented Jan 13, 2017 • edited Loading

andrewjstone commented Jan 13, 2017 •

edited

Loading