KubePrism generates a lot of DNS queries #7690

ruifung · 2023-08-31T08:28:50Z

Discussed in #7689

^{Originally posted by ruifung August 31, 2023}
Is it just me, or does it seem like talos 1.5.1 with kubeprism enabled seem to constantly do DNS queries for the controlplane endpoint?

I've noticed that since I've updated to 1.5.1, the top domain queried is my controlplane endpoint DNS at over 40k queries over the last hour from 6 nodes (3 control, 3 worker).

I just noticed this when looking at the stats on my local DNS server (Technitium DNS)

Addendum:

I just tested, it does seem like KubePrism is indeed what's causing what seems like (compared to everything else on my network) an excessive amount of queries for the controlplane DNS (i.e. controlplane.cluster.home.arpa) to the point that 6 nodes (3 control, 3 worker) generated in excess of 40k queries for that per hour. Disabling KubePrism seems to resolve it.

On average, it appears to be generating 2 queries per second per node.

Is something not respecting the TTL set on the DNS records?
I'll leave KubePrism disabled for now because it's been filling the query logs and query stats.

smira · 2023-08-31T10:31:02Z

Talos does health checks with KubePrism enabled on all controlplane endpoints. Talos doesn't use the local DNS cache, but I see the problem - the checks are run too aggressively (too fast), and that needs to be fixed

smira · 2023-08-31T20:19:05Z

The PR #7692 doesn't fully solve the issue, as it will make less DNS requests, but will not do proper caching still.

I created #7693 to track DNS cache.

The default timeouts are very aggressive, and we should use explicit timeouts so that healh checks don't run that often. Fixes siderolabs#7690 Signed-off-by: Andrey Smirnov <andrey.smirnov@siderolabs.com> (cherry picked from commit 79bbdf4)

ruifung changed the title ~~Talos 1.5.1 controlplane endpoint DNS query flood~~ KubePrism generates a lot of DNS queries Aug 31, 2023

smira self-assigned this Aug 31, 2023

smira mentioned this issue Aug 31, 2023

fix: set proper timeouts for KubePrism loadbalancer #7692

Merged

talos-bot closed this as completed in 79bbdf4 Sep 1, 2023

frezbo mentioned this issue Sep 6, 2023

Talos nodes bombarding DNS server with <control plane> hostname query with KubePrism enabled #7721

Closed

ruifung mentioned this issue Nov 28, 2023

ARP flooding on unrelated interface #7997

Closed

github-actions bot locked as resolved and limited conversation to collaborators Jun 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KubePrism generates a lot of DNS queries #7690

KubePrism generates a lot of DNS queries #7690

ruifung commented Aug 31, 2023 •

edited

Loading

smira commented Aug 31, 2023

smira commented Aug 31, 2023

KubePrism generates a lot of DNS queries #7690

KubePrism generates a lot of DNS queries #7690

Comments

ruifung commented Aug 31, 2023 • edited Loading

Discussed in #7689

smira commented Aug 31, 2023

smira commented Aug 31, 2023

ruifung commented Aug 31, 2023 •

edited

Loading