Add support for running a nodelocal dns cache #3861

nysthee · 2018-12-07T23:15:58Z

After encountering dns issues in a cluster I was recently working on I
noticed Kubernetes 1.13 introduced support for running a nodelocal dns
cache.

I believe this can usefull for more people.

kubernetes/kubernetes@73b548d
https://github.com/kubernetes/enhancements/blob/master/keps/sig-network/0030-nodelocal-dns-cache.md

Feedback welcome!

nysthee · 2018-12-07T23:20:50Z

I would like a suggestion as well on where to put the documentation for this.

roles/kubernetes-apps/ansible/tasks/main.yml

After encountering dns issues in a cluster I was recently working on I noticed Kubernetes 1.13 introduced support for running a nodelocal dns cache. I believe this can usefull for more people. kubernetes/kubernetes@73b548d https://github.com/kubernetes/enhancements/blob/master/keps/sig-network/0030-nodelocal-dns-cache.md

woopstar

Can you describe what DNS issues you were encountering?
Can you please provide a doc in /docs folder and eventually update the DNS stack documentation

I guess to use the nodelocaldns cache, you'll have to use the defined local dns ip as resolver? 169.254.25.10 that is, currently ?

If you had issues with the conntract table being filled up with DNS entries, the you can avoid that by setting the following sysctl:

- name: 'net.netfilter.nf_conntrack_udp_timeout_stream'
  value: '10'
- name: 'net.netfilter.nf_conntrack_udp_timeout'
  value: '10'

inventory/sample/group_vars/k8s-cluster/k8s-cluster.yml

roles/kubernetes-apps/ansible/defaults/main.yml

roles/kubernetes-apps/ansible/tasks/cleanup_dns.yml

roles/kubernetes-apps/ansible/tasks/main.yml

roles/kubernetes-apps/ansible/tasks/nodelocaldns.yml

roles/kubernetes-apps/ansible/templates/nodelocaldns-deamonset.yml.j2

woopstar · 2018-12-10T20:12:06Z

Just ping me when you want me to review again :)

nysthee · 2018-12-10T20:12:31Z

Will do :)

roles/kubernetes-apps/ansible/templates/nodelocaldns-config.yml.j2

roles/kubernetes-apps/ansible/templates/nodelocaldns-deamonset.yml.j2

nysthee · 2018-12-10T20:22:31Z

The issues I was encountering were unexplainable DNS timeouts. Like every few requests.
I never noticed this before until I started running Kafka in my cluster and services was complaining service hostnames of depending services were not resolving.
Atm, the issue is mitigated with installing a nightly of Flannel but according to the release notes of 1.13, the implementation of the referenced KEP should solve this ass well.

References:

nysthee · 2018-12-10T20:37:43Z

@woopstar if you could review again please

roles/download/defaults/main.yml

roles/kubernetes-apps/ansible/tasks/nodelocaldns.yml

roles/kubernetes-apps/ansible/templates/nodelocaldns-config.yml.j2

roles/kubernetes-apps/ansible/templates/nodelocaldns-deamonset.yml.j2

woopstar · 2018-12-10T20:44:31Z

@woopstar if you could review again please

done

woopstar · 2018-12-10T20:44:40Z

ci check this

nysthee · 2018-12-10T20:52:01Z

Latest changes pushed as well.

nysthee · 2018-12-10T20:54:33Z

Maybe I should squash the commits?

woopstar · 2018-12-10T20:55:16Z

auto squash is enabled

nysthee · 2018-12-11T00:10:32Z

ci check this

ant31 · 2018-12-11T01:25:19Z

/lgtm
/approve

k8s-ci-robot · 2018-12-11T01:25:23Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: ant31, nysthee

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [ant31]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

woopstar · 2018-12-11T07:49:44Z

I still don't get how this works as a cache unless you use the nodelocaldns_ip as resolver?

woopstar · 2018-12-11T09:28:58Z

@nysthee

Seems I'm right. Looking here they set the nodelocal ip as the first cluster ip that gets populated into the pod.

You need to apply a PR fix asap where you apply the nodelocaldns_ip as the first ip in the list here

What this basically does is to simply just start a DNS pod on each node instead. Then you forwards requests from pods on a node to the local DNS pods running on the same node, which will prevent a DNAT. If that pod does not work, the clusterIP for the DNS plugin (kube-dns, coredns etc) is used. Here they just use CoreDNS as cache too. You can use Unbound, dnsmasq etc. too.

This should be enabled by default btw.

woopstar · 2018-12-11T09:32:55Z

@nysthee

Seems I'm right. Looking here they set the nodelocal ip as the first cluster ip that gets populated into the pod.

You need to apply a PR fix asap where you apply the nodelocaldns_ip as the first ip in the list here

What this basically does is to simply just start a DNS pod on each node instead. Then you forwards requests from pods on a node to the local DNS pods running on the same node, which will prevent a DNAT. If that pod does not work, the clusterIP for the DNS plugin (kube-dns, coredns etc) is used. Here they just use CoreDNS as cache too. You can use Unbound, dnsmasq etc. too.

This should be enabled by default btw.

Sorry. What you actually need to do is to overwrite the --cluster-dns to ONLY contain the nodelocaldns_ip if it's enabled (enable_nodelocaldns == true). As the local DNS cache pod will forward queries to kube-dns / CoreDNS.

nysthee · 2018-12-11T09:40:32Z

@woopstar
Is submitted a pr: #3879

* Add support for running a nodelocal dns cache After encountering dns issues in a cluster I was recently working on I noticed Kubernetes 1.13 introduced support for running a nodelocal dns cache. I believe this can usefull for more people. kubernetes/kubernetes@73b548d https://github.com/kubernetes/enhancements/blob/master/keps/sig-network/0030-nodelocal-dns-cache.md * Add requested changes * Add additional requested changes + documentation * Add requested changes after review * Replace incorrect variable

nvtkaszpir · 2022-02-07T10:48:00Z

Sorry for digging up the grave.
This change sets node-local-dns priorityClassName: system-cluster-critical - why is that?
Thouldn't it be priorityClassName: system-node-critical?

k8s-ci-robot requested review from chadswen and mirwan December 7, 2018 23:16

k8s-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. labels Dec 7, 2018

nysthee force-pushed the feature/add-node-local-dns-cache branch from aa19156 to 4a31e48 Compare December 7, 2018 23:19

thojkooi reviewed Dec 7, 2018

View reviewed changes

roles/kubernetes-apps/ansible/tasks/main.yml Outdated Show resolved Hide resolved

nysthee force-pushed the feature/add-node-local-dns-cache branch 2 times, most recently from c803ca5 to 0a46af3 Compare December 7, 2018 23:41

nysthee force-pushed the feature/add-node-local-dns-cache branch from 0a46af3 to 8172b6b Compare December 7, 2018 23:41

woopstar suggested changes Dec 10, 2018

View reviewed changes

woopstar self-assigned this Dec 10, 2018

Add requested changes

30cb824