`talosctl health` extra flags #7967

mircea-pavel-anton · 2023-11-20T10:20:30Z

Feature Request

Ability to specify whether or not to wait for nodes to be ready.

Description

When deloying Talos, I saw that a lot of people are disabling the CNI and opting to manually install one later on, mainly when doing gitops.

Currently, talosctl health checks on the health of the cluster end-to-end, i.e. both Talos and Kubernetes. I think there should be a flag, something like talosctl health --kubernetes=false which would validate the health up to and including the kubelet, so without checking if the nodes are in a Ready state, since without a CNI they will never reach that state.

This makes it a bit harder to automate installs like bootstrap -> wait -> apply CNI for example

The text was updated successfully, but these errors were encountered:

mircea-pavel-anton · 2023-11-20T11:18:48Z

For some context, I am currently using a bash script to wait until the kubelet becomes healthy on my nodes:

while true; do
    output=$(talosctl dmesg -n $NODE_IP 2>&1)

    if echo "$output" | grep -Fq "service[kubelet](Running): Health check successful"; then
        echo ""
        echo "Kubelet is Healthy on node $NODE_IP!"
        break
    else
        printf "."
        sleep 1
    fi
done

But I feel like there should be a more elegant way to handle this, since it's not an uncommon scenario to disable the CNI

mrclrchtr · 2024-03-18T12:12:58Z

That would be amazing. I have exactly the same problem with CNI. Especially in terraform, talos_cluster_health runs infinitely when no CNI is installed. This also makes a reapply on abort impossible because it wants to read first before you can apply.

…967 (comment)

github-actions · 2024-09-15T02:05:11Z

This issue is stale because it has been open 180 days with no activity. Remove stale label or comment or this will be closed in 7 days.

github-actions · 2024-09-21T01:59:45Z

This issue was closed because it has been stalled for 7 days with no activity.

mrclrchtr added a commit to hcloud-talos/terraform-hcloud-talos that referenced this issue Mar 18, 2024

fix: dont use the talos_cluster_health because of: siderolabs/talos#7…

c9165f3

…967 (comment)

mrclrchtr added a commit to hcloud-talos/terraform-hcloud-talos that referenced this issue Mar 18, 2024

fix: dont use the talos_cluster_health because of: siderolabs/talos#7…

239f3f2

…967 (comment)

github-actions bot added the Stale label Sep 15, 2024

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Sep 21, 2024

github-actions bot locked as resolved and limited conversation to collaborators Nov 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`talosctl health` extra flags #7967

`talosctl health` extra flags #7967

mircea-pavel-anton commented Nov 20, 2023

mircea-pavel-anton commented Nov 20, 2023

mrclrchtr commented Mar 18, 2024

github-actions bot commented Sep 15, 2024

github-actions bot commented Sep 21, 2024

talosctl health extra flags #7967

talosctl health extra flags #7967

Comments

mircea-pavel-anton commented Nov 20, 2023

Feature Request

Description

mircea-pavel-anton commented Nov 20, 2023

mrclrchtr commented Mar 18, 2024

github-actions bot commented Sep 15, 2024

github-actions bot commented Sep 21, 2024

`talosctl health` extra flags #7967

`talosctl health` extra flags #7967