Use wait instead of `TaskExit`. #1133

Random-Liu · 2019-04-12T01:28:55Z

This PR:

Uses Wait instead of TaskExit event. This should fix issues like Containers can't be stopped because TaskExit event failed to be published for runc.v2. containerd#3177 and Cannot stop the container: stop timeout containerd#3125;
Removes big sandbox and container start lock, because with Wait we can control the timing of exit event generation to avoid race condition. This is very important, because sandbox/container start lock is the last potential long running lock we have today. And in GKE, we do see slow container start holds container lock and blocks docker ps for Docker.

Please note that for containers/sandboxes in unknown state, we still need to rely on containerd TaskExit event, because they were not successfully loaded, there are no corresponding exit monitors running.

Since we'll support containerd 1.2 for a long time, I'd like to cherry-pick this into 1.2.
/cc @crosbymichael

Signed-off-by: Lantao Liu lantaol@google.com

crosbymichael

LGTM

jterry75 · 2019-04-15T23:36:43Z

LGTM! This is huge!

Random-Liu · 2019-04-15T23:45:14Z

Actually I have a different idea about handling container/sandbox in unknown state (failed to be loaded at startup).

Based on the state machine https://github.com/containerd/cri/blob/master/pkg/store/sandbox/status.go#L24-L53 and https://github.com/containerd/cri/blob/master/pkg/store/container/status.go#L31-L61, the only allowed operation to container/sandbox in unknown state is STOP;
We've changed Kubernetes to always try stopping container/sandbox in unknown state before removing Stop container in unknown state before recreate or remove. kubernetes/kubernetes#73802

Actually we don't really care about whether a sandbox/container in unknown state exits itself or not, because Kubernetes will stop it right away anyway. So we don't need to constantly monitor it, we only need to start a synchronized Wait in stop for container/sandbox in unknown state, in this way we can completely get rid of the dependency on TaskExit.

I'll make the change.

mikebrow · 2019-04-16T18:53:13Z

flying back today... will review in the morning!

mikebrow

See comments

mikebrow · 2019-04-17T18:18:48Z

pkg/server/container_start.go

+				logrus.WithError(err).Errorf("failed to set start failure state for container %q", id)
+			}
+		}
+		// Reset starting if start failed.


this is not in the retErr != nil block.. so resets in this defer if setContainerStarting was successful...

Yeah, forgot to remove the comment. Done.

mikebrow · 2019-04-17T18:27:15Z

pkg/server/container_start.go

+
+// resetContainerStarting resets the container starting state on start failure. So
+// that we could remove the container later.
+func resetContainerStarting(container containerstore.Container) error {


see above comment... .. Do we want this function to be idempotent? Or should it fail if called twice...

Should be idempotent

fair enough the set was not idempotent.. so the unset being so was unsettling ;-) But ok by me either way.

mikebrow · 2019-04-17T19:16:44Z

pkg/store/sandbox/sandbox_test.go

@@ -125,21 +125,16 @@ func TestSandboxStore(t *testing.T) {
 		assert.Equal(sb, got)
 	}

-	t.Logf("should not be able to get unknown sandbox")
+	t.Logf("should be able to get sandbox with Get")


... in unknown state

mikebrow · 2019-04-17T19:17:34Z

pkg/store/sandbox/status.go

@@ -53,23 +53,17 @@ import (
 // +-------------> DELETED

 // State is the sandbox state we use in containerd/cri.
-// It includes init and unknown, which are internal states not defined in CRI.
+// It includes unknown, which are internal states not defined in CRI.


/s/are/is/

which is an internal state not defined in CRI.. (INIT state has been removed)

Signed-off-by: Lantao Liu <lantaol@google.com>

mikebrow

/LGTM

Cherrypick #1133 release 1.2

Random-Liu assigned mikebrow Apr 12, 2019

k8s-ci-robot added the size/L label Apr 12, 2019

Random-Liu added this to the v1.2 milestone Apr 12, 2019

Random-Liu assigned yujuhong Apr 12, 2019

Random-Liu force-pushed the use-wait branch from bef2433 to ec53d14 Compare April 12, 2019 07:38

k8s-ci-robot added size/XL and removed size/L labels Apr 12, 2019

Random-Liu force-pushed the use-wait branch from ec53d14 to 3b49d3f Compare April 12, 2019 22:09

bergwolf mentioned this pull request Apr 15, 2019

containerd+kata-shimv2: how to recover from inconsistent situation? kata-containers/runtime#1529

Closed

crosbymichael approved these changes Apr 15, 2019

View reviewed changes

mikebrow reviewed Apr 17, 2019

View reviewed changes

Use wait instead of TaskExit.

d1f9611

Signed-off-by: Lantao Liu <lantaol@google.com>

Random-Liu force-pushed the use-wait branch from 23d1ac1 to d1f9611 Compare April 18, 2019 07:18

k8s-ci-robot added the lgtm label Apr 18, 2019

mikebrow approved these changes Apr 18, 2019

View reviewed changes

Random-Liu merged commit a5c5d55 into containerd:master Apr 18, 2019

Random-Liu deleted the use-wait branch April 18, 2019 18:10

Random-Liu added cherrypick-needed cherrypicked labels Apr 18, 2019

This was referenced Apr 18, 2019

Cherrypick #1133 release 1.2 #1136

Merged

Cannot stop the container: stop timeout containerd/containerd#3125

Closed

Random-Liu added a commit that referenced this pull request Apr 26, 2019

Merge pull request #1136 from Random-Liu/cherrypick-#1133-release-1.2

cdbb238

Cherrypick #1133 release 1.2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use wait instead of `TaskExit`. #1133

Use wait instead of `TaskExit`. #1133

Random-Liu commented Apr 12, 2019 •

edited

Loading

crosbymichael left a comment

jterry75 commented Apr 15, 2019

Random-Liu commented Apr 15, 2019 •

edited

Loading

mikebrow commented Apr 16, 2019

mikebrow left a comment

mikebrow Apr 17, 2019

Random-Liu Apr 18, 2019

mikebrow Apr 17, 2019

Random-Liu Apr 18, 2019

mikebrow Apr 18, 2019

mikebrow Apr 17, 2019

Random-Liu Apr 18, 2019

mikebrow Apr 17, 2019

Random-Liu Apr 18, 2019

mikebrow left a comment

Use wait instead of TaskExit. #1133

Use wait instead of TaskExit. #1133

Conversation

Random-Liu commented Apr 12, 2019 • edited Loading

crosbymichael left a comment

Choose a reason for hiding this comment

jterry75 commented Apr 15, 2019

Random-Liu commented Apr 15, 2019 • edited Loading

mikebrow commented Apr 16, 2019

mikebrow left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mikebrow left a comment

Choose a reason for hiding this comment

Use wait instead of `TaskExit`. #1133

Use wait instead of `TaskExit`. #1133

Random-Liu commented Apr 12, 2019 •

edited

Loading

Random-Liu commented Apr 15, 2019 •

edited

Loading