Fix scaling #5889

champtar · 2020-04-03T01:24:27Z

What type of PR is this?
/kind bug

/kind cleanup

What this PR does / why we need it:
Allow to scale from 1 nodes to 4 nodes (2 master 3 etcd all workers) by just running cluster.yml

Which issue(s) this PR fixes:
NONE

Special notes for your reviewer:
See commit messages

Does this PR introduce a user-facing change?:

Fix scaling etcd and master

k8s-ci-robot · 2020-04-03T01:24:36Z

Hi @champtar. Thanks for your PR.

I'm waiting for a kubernetes-sigs member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

champtar · 2020-04-03T20:56:51Z

Update: I've also fixed master scaling

Signed-off-by: Etienne Champetier <champetier.etienne@gmail.com>

We want to wait for the full cluster to be healthy, so use all the cluster addresses Also we should be able to run the playbook when etcd[0] is down (not tested), so do not delegate to etcd[0] Signed-off-by: Etienne Champetier <champetier.etienne@gmail.com>

unhealthy cluster is expected on first run, so use failed_when instead of ignore_errors to remove scary red messages Also use run_once Signed-off-by: Etienne Champetier <champetier.etienne@gmail.com>

…ng /etc/hosts Signed-off-by: Etienne Champetier <champetier.etienne@gmail.com>

Signed-off-by: Etienne Champetier <champetier.etienne@gmail.com>

champtar · 2020-04-07T15:57:06Z

Can someone add tide/merge-method-rebase label ?

champtar · 2020-04-07T15:57:28Z

/assign @Miouge1

Miouge1 · 2020-04-08T08:26:23Z

I tested this locally works nicely. Thank you @champtar

/lgtm
/approve

k8s-ci-robot · 2020-04-08T08:26:35Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: champtar, Miouge1

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [Miouge1]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

* etcd: etcd-events doesn't depend on etcd_cluster_setup Signed-off-by: Etienne Champetier <champetier.etienne@gmail.com> * etcd: remove condition already present on include_tasks Signed-off-by: Etienne Champetier <champetier.etienne@gmail.com> * etcd: fix scaling up Signed-off-by: Etienne Champetier <champetier.etienne@gmail.com> * etcd: use *access_addresses, do not delegate to etcd[0] We want to wait for the full cluster to be healthy, so use all the cluster addresses Also we should be able to run the playbook when etcd[0] is down (not tested), so do not delegate to etcd[0] Signed-off-by: Etienne Champetier <champetier.etienne@gmail.com> * etcd: use failed_when for health check unhealthy cluster is expected on first run, so use failed_when instead of ignore_errors to remove scary red messages Also use run_once Signed-off-by: Etienne Champetier <champetier.etienne@gmail.com> * kubernetes/preinstall: ensure ansible_fqdn is up to date after changing /etc/hosts Signed-off-by: Etienne Champetier <champetier.etienne@gmail.com> * kubernetes/master: regenerate apiserver cert if needed Signed-off-by: Etienne Champetier <champetier.etienne@gmail.com> (cherry picked from commit a35b6dc)

* 'master' of https://github.com/kubernetes-sigs/kubespray: (21 commits) Remove hard-coded dependance to docker.service in kubelet.service file (kubernetes-sigs#5917) Update Calico to v3.13.2, Multus to v3.4.1. Add ConfigMap get permission to allow calico-node access to kubeadm config. (kubernetes-sigs#5912) Fix idempotence issue in bootstrap-os (kubernetes-sigs#5916) Terraform/OpenStack: Fix idempotency bug in module.network.openstack_networking_router_interface_v2.k8s[0] (kubernetes-sigs#5914) Add kubernetes 1.18.1 hashes (kubernetes-sigs#5915) Proxy fixes (kubernetes-sigs#5869) Remove 1.16.x flag for tf-ovh_coreos-calico (now 1.17 ready) (kubernetes-sigs#5853) Update docker RHEL/CentOS versions to the latest patch versions available. (kubernetes-sigs#5872) Fix conntrack for opensuse and docker support (kubernetes-sigs#5880) Add crictl 1.18.0 hashes for k8s 1.18 (kubernetes-sigs#5877) fix readonly flexvolume in fcos and coreos (kubernetes-sigs#5885) Fix scaling (kubernetes-sigs#5889) Fix chicken and egg problem with proxy_env not defined on the first … (kubernetes-sigs#5896) make explicit that doc is at kubespray.io (kubernetes-sigs#5878) add local-path-provosioner helper image def (kubernetes-sigs#5817) remove unused kubelet options (kubernetes-sigs#5903) Change docker.io repo to variable and upgrade alb image (kubernetes-sigs#5898) Replace latest tags for csi drivers (kubernetes-sigs#5899) CentOS 8 CI (kubernetes-sigs#5842) Bump requirements.txt versions / remove ansible_python_interpreter hack (kubernetes-sigs#5847) ...

* etcd: etcd-events doesn't depend on etcd_cluster_setup Signed-off-by: Etienne Champetier <champetier.etienne@gmail.com> * etcd: remove condition already present on include_tasks Signed-off-by: Etienne Champetier <champetier.etienne@gmail.com> * etcd: fix scaling up Signed-off-by: Etienne Champetier <champetier.etienne@gmail.com> * etcd: use *access_addresses, do not delegate to etcd[0] We want to wait for the full cluster to be healthy, so use all the cluster addresses Also we should be able to run the playbook when etcd[0] is down (not tested), so do not delegate to etcd[0] Signed-off-by: Etienne Champetier <champetier.etienne@gmail.com> * etcd: use failed_when for health check unhealthy cluster is expected on first run, so use failed_when instead of ignore_errors to remove scary red messages Also use run_once Signed-off-by: Etienne Champetier <champetier.etienne@gmail.com> * kubernetes/preinstall: ensure ansible_fqdn is up to date after changing /etc/hosts Signed-off-by: Etienne Champetier <champetier.etienne@gmail.com> * kubernetes/master: regenerate apiserver cert if needed Signed-off-by: Etienne Champetier <champetier.etienne@gmail.com>

* etcd: etcd-events doesn't depend on etcd_cluster_setup Signed-off-by: Etienne Champetier <champetier.etienne@gmail.com> * etcd: remove condition already present on include_tasks Signed-off-by: Etienne Champetier <champetier.etienne@gmail.com> * etcd: fix scaling up Signed-off-by: Etienne Champetier <champetier.etienne@gmail.com> * etcd: use *access_addresses, do not delegate to etcd[0] We want to wait for the full cluster to be healthy, so use all the cluster addresses Also we should be able to run the playbook when etcd[0] is down (not tested), so do not delegate to etcd[0] Signed-off-by: Etienne Champetier <champetier.etienne@gmail.com> * etcd: use failed_when for health check unhealthy cluster is expected on first run, so use failed_when instead of ignore_errors to remove scary red messages Also use run_once Signed-off-by: Etienne Champetier <champetier.etienne@gmail.com> * kubernetes/preinstall: ensure ansible_fqdn is up to date after changing /etc/hosts Signed-off-by: Etienne Champetier <champetier.etienne@gmail.com> * kubernetes/master: regenerate apiserver cert if needed Signed-off-by: Etienne Champetier <champetier.etienne@gmail.com> (cherry picked from commit a35b6dc)

k8s-ci-robot added kind/bug Categorizes issue or PR as related to a bug. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. labels Apr 3, 2020

k8s-ci-robot added the size/M Denotes a PR that changes 30-99 lines, ignoring generated files. label Apr 3, 2020

k8s-ci-robot requested review from bozzo and holmsten April 3, 2020 01:24

Miouge1 added this to the 2.13 milestone Apr 3, 2020

champtar changed the title ~~Fix etcd scaling~~ Fix scaling Apr 3, 2020

champtar force-pushed the scaleetcd branch 2 times, most recently from 0609c68 to 10a8b51 Compare April 3, 2020 22:42

champtar added 7 commits April 7, 2020 10:31

etcd: etcd-events doesn't depend on etcd_cluster_setup

1d38b92

Signed-off-by: Etienne Champetier <champetier.etienne@gmail.com>

etcd: remove condition already present on include_tasks

2e14881

Signed-off-by: Etienne Champetier <champetier.etienne@gmail.com>

etcd: fix scaling up

176e1a9

Signed-off-by: Etienne Champetier <champetier.etienne@gmail.com>

etcd: use failed_when for health check

2b65e29

unhealthy cluster is expected on first run, so use failed_when instead of ignore_errors to remove scary red messages Also use run_once Signed-off-by: Etienne Champetier <champetier.etienne@gmail.com>

kubernetes/preinstall: ensure ansible_fqdn is up to date after changi…

055acbb

…ng /etc/hosts Signed-off-by: Etienne Champetier <champetier.etienne@gmail.com>

kubernetes/master: regenerate apiserver cert if needed

a2cd302

Signed-off-by: Etienne Champetier <champetier.etienne@gmail.com>

champtar force-pushed the scaleetcd branch from 10a8b51 to a2cd302 Compare April 7, 2020 14:32

k8s-ci-robot assigned Miouge1 Apr 7, 2020

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Apr 8, 2020

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Apr 8, 2020

k8s-ci-robot merged commit a35b6dc into kubernetes-sigs:master Apr 8, 2020

champtar deleted the scaleetcd branch April 8, 2020 17:34

Miouge1 mentioned this pull request Apr 11, 2020

[2.12] Fix scaling etcd and master (#5889) #5911

Merged

Miouge1 mentioned this pull request Aug 1, 2020

Added option to force apiserver and respective client certificate to … #6403

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix scaling #5889

Fix scaling #5889

champtar commented Apr 3, 2020 •

edited

Loading

k8s-ci-robot commented Apr 3, 2020

champtar commented Apr 3, 2020

champtar commented Apr 7, 2020

champtar commented Apr 7, 2020

Miouge1 commented Apr 8, 2020

k8s-ci-robot commented Apr 8, 2020

Fix scaling #5889

Fix scaling #5889

Conversation

champtar commented Apr 3, 2020 • edited Loading

k8s-ci-robot commented Apr 3, 2020

champtar commented Apr 3, 2020

champtar commented Apr 7, 2020

champtar commented Apr 7, 2020

Miouge1 commented Apr 8, 2020

k8s-ci-robot commented Apr 8, 2020

champtar commented Apr 3, 2020 •

edited

Loading