Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

etcd operator #222

Closed
35 of 37 tasks
hexfusion opened this issue Feb 20, 2020 · 6 comments
Closed
35 of 37 tasks

etcd operator #222

hexfusion opened this issue Feb 20, 2020 · 6 comments
Labels
lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. stage/stable
Milestone

Comments

@hexfusion
Copy link
Contributor

hexfusion commented Feb 20, 2020

https://github.com/openshift/enhancements/blob/master/enhancements/etcd/cluster-etcd-operator.md

Perf

Metal

  • ipv4 works on 3/4 (asked in slack channel)
  • ipv6 works on 3/4 (asked in slack channel)

IPv6

bootstrapping

shutdown clusters don't start back up

blocker bugs

Tests we think should work

Tests we think should fail

  • If one master is lost, instructions for how to.
    • create a new master that joins the cluster
    • removal of the old master from the cluster
  • Restoring etcd when no members start up correctly
    • Changes to the etcd-quorum recovery steps
    • restore etcd from backup @retroflexer
  • Make etcd consistent when one member needs to be removed. Ideally this becomes "remove etcd member, rollout". with Bug 1806751: Archive data-dir if target member is unstarted etcd#29 it may become that because we can auto-remove a bad data-dir. We may need to find a way to mark a node as "don't include in etcd"
  • restore etcd when 2 masters are replaced. @hexfusion
  • IP address change of all members
@deads2k deads2k changed the title cluster-etcd-operator: 4.4 outstanding tasks etcd operator Feb 20, 2020
@deads2k deads2k added this to the v4.4 milestone Feb 20, 2020
@retroflexer
Copy link
Contributor

I am working on:

  1. Restoring a cluster to previous state
  2. etcd-quorum recovery steps
  3. Removal of a member from etcd cluster
  4. Recovery of a member with a bad data-dir

@retroflexer
Copy link
Contributor

@deads2k I am not suresh gaikwad. My handle here is @retroflexer.

@openshift-bot
Copy link

Issues go stale after 90d of inactivity.

Mark the issue as fresh by commenting /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.
Exclude this issue from closing by commenting /lifecycle frozen.

If this issue is safe to close now please do so with /close.

/lifecycle stale

@openshift-ci-robot openshift-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Oct 4, 2020
@openshift-bot
Copy link

Stale issues rot after 30d of inactivity.

Mark the issue as fresh by commenting /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.
Exclude this issue from closing by commenting /lifecycle frozen.

If this issue is safe to close now please do so with /close.

/lifecycle rotten
/remove-lifecycle stale

@openshift-ci-robot openshift-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Nov 3, 2020
@openshift-bot
Copy link

Rotten issues close after 30d of inactivity.

Reopen the issue by commenting /reopen.
Mark the issue as fresh by commenting /remove-lifecycle rotten.
Exclude this issue from closing again by commenting /lifecycle frozen.

/close

@openshift-ci-robot
Copy link

@openshift-bot: Closing this issue.

In response to this:

Rotten issues close after 30d of inactivity.

Reopen the issue by commenting /reopen.
Mark the issue as fresh by commenting /remove-lifecycle rotten.
Exclude this issue from closing again by commenting /lifecycle frozen.

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. stage/stable
Projects
None yet
Development

No branches or pull requests

5 participants