Ingress controller leaves replacement proxy containers without pushed configuration #2107

seh · 2021-12-16T14:58:53Z

Is there an existing issue for this?

I have searched the existing issues

Current Behavior

Please see Kong/kong#8152 for where this started—in particular, starting with Kong/kong#8152 (comment). To summarize, if the ingress controller starts up successfully, pushes configuration to the proxy container at least once, and then the proxy container exits (perhaps killed by the kubelet due to its liveness probe failing), but when a replacement proxy container later starts, the ingress controller will not bother to push any configuration to the proxy until some observed cluster state changes materially. This is only the case when the "reverse sync" feature is disabled, as it is by default.

The ingress controller decides against pushing the configuration to the proxy as an optimization: Finding that the serialized configuration's message digest hasn't changed since the last successful push, there's seemingly no reason to push what would be the same configuration again. Unfortunately, the proxy container on the other side of that exchange is a different one from the last one that received the intended configuration.

Since the ingress controller's readiness gate is latching, once it's pushed one configuration successfully, it remains ready forevermore. Ideally we'd clear this flag each time that we detect that the proxy container has either no configuration or has fallen behind by some number of attempted updates.

Expected Behavior

The ingress controller should have a way to detect that the current proxy container lacks the intended configuration. There are several potential approaches; assuming that we wish to retain the current optimization, all of these straw men are more complicated than what we have today:

Create an emptyDir volume in the pod, mount it into both the ingress controller and proxy containers, have the ingress controller create and listen on a domain socket, and have the proxy container connect to it. When the proxy container hangs up, the ingress controller should assume that the next proxy container needs fresh configuration.
Create an emptyDir volume, mount it into both the ingress controller and proxy containers, have the proxy container read its container ID from the /proc/self/cgroup file, and write it to a file in that shared volume. The ingress controller can watch that file's content and react to the container ID changing, assuming that a change means that the next proxy container needs fresh configuration.
Grant RBAC permission for Kong's ingress controller to watch pods. The ingress controller would demand its own pod namespace and name via the Downward API, and it would open a watch on its own pod. From the updates, it can read the "status.containerStatuses[1].containerID" field to learn when the proxy container gets replaced.

Steps To Reproduce

Create a pod with both the ingress controller and proxy containers.
Confirm that the ingress controller becomes ready.
Use Kong's admin API to confirm that it has a route and service for at least one set of Kubernetes Ingress and Service objects.
Ensure that the observable cluster state remains stable.
Kill the proxy container.
Confirm that a replacement proxy container starts.
Wait at least three seconds.
Use Kong's admin API to confirm that it has no routes or services.

Kong Ingress Controller version

v2.0.6

Kubernetes version

version.Info{Major:"1", Minor:"19", GitVersion:"v1.19.9", GitCommit:"9dd794e454ac32d97cde41ae10be801ae98f75df", GitTreeState:"clean", BuildDate:"2021-03-18T01:00:06Z", GoVersion:"go1.15.8", Compiler:"gc", Platform:"linux/amd64"}

The text was updated successfully, but these errors were encountered:

rainest · 2021-12-16T22:34:19Z

Tentative plans are to add a Kong admin API endpoint that returns status code feedback as to whether a config has been loaded or not, and have the controller watch that, as well as use it as the Kong readiness endpoint instead of /status (which is more suited to liveness).

Lack of config is relevant to non-controller-managed environments also, so we want to add something that supports them as well.

mflendrich · 2022-01-03T23:41:29Z

Maybe a container hook notifying KIC when a proxy container restarts could be of use here.

mflendrich · 2022-01-11T18:08:26Z

~~blocked on Kong/kong#8256~~

mflendrich · 2022-01-19T16:59:52Z

update: Kong/kong#8214 is an alternative solution recommended by @dndx over Kong/kong#8256

therefore, now blocked on unclosing and getting Kong/kong#8214 across the line

jcam · 2022-02-28T17:13:52Z

^ Kong/kong#8214 has been merged. Do you need contributors here?

rainest · 2022-02-28T17:32:09Z

It has been merged, not released. We have code for this staged at https://github.com/Kong/kubernetes-ingress-controller/tree/feat/watch-config-hash but can't merge it into our repos until 2.8 is actually out, since nightly builds lack semver versions, and we need those to run the tests normally.

jcam · 2022-02-28T17:44:56Z

Ah okay. Well glad to see you're already on it! I'll be taking this as soon as I can :)

seh added the bug Something isn't working label Dec 16, 2021

seh mentioned this issue Dec 16, 2021

Kong proxy gets stuck after surge in requests, responding only with HTTP status code 404 Kong/kong#8152

Closed

1 task

shaneutt added the priority/high label Dec 16, 2021

shaneutt mentioned this issue Dec 17, 2021

feat: add configuration_hash in /status in dbless Kong/kong#8214

Merged

rainest mentioned this issue Jan 3, 2022

feat(api) add a config readiness endpoint Kong/kong#8256

Closed

rainest mentioned this issue Jan 7, 2022

feat(sendconfig) send config if none present #2145

Closed

1 task

mflendrich added the blocked label Jan 11, 2022

rainest mentioned this issue Mar 21, 2022

Resend config if status reports hash empty #2343

Merged

1 task

mflendrich mentioned this issue Mar 22, 2022

Detect an unitialized Gateway and re-push config #2346

Closed

3 tasks

mflendrich assigned rainest Mar 22, 2022

shaneutt removed the blocked label Mar 22, 2022

rainest closed this as completed in #2343 Mar 30, 2022

rainest mentioned this issue Apr 5, 2022

feat(sendconfig) send config if none present rainest/kubernetes-ingress-controller#8

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ingress controller leaves replacement proxy containers without pushed configuration #2107

Ingress controller leaves replacement proxy containers without pushed configuration #2107

seh commented Dec 16, 2021 •

edited

Loading

rainest commented Dec 16, 2021

mflendrich commented Jan 3, 2022

mflendrich commented Jan 11, 2022 •

edited

Loading

mflendrich commented Jan 19, 2022

jcam commented Feb 28, 2022

rainest commented Feb 28, 2022

jcam commented Feb 28, 2022

Ingress controller leaves replacement proxy containers without pushed configuration #2107

Ingress controller leaves replacement proxy containers without pushed configuration #2107

Comments

seh commented Dec 16, 2021 • edited Loading

Is there an existing issue for this?

Current Behavior

Expected Behavior

Steps To Reproduce

Kong Ingress Controller version

Kubernetes version

rainest commented Dec 16, 2021

mflendrich commented Jan 3, 2022

mflendrich commented Jan 11, 2022 • edited Loading

mflendrich commented Jan 19, 2022

jcam commented Feb 28, 2022

rainest commented Feb 28, 2022

jcam commented Feb 28, 2022

seh commented Dec 16, 2021 •

edited

Loading

mflendrich commented Jan 11, 2022 •

edited

Loading