Prevent invalid errors from terminate #1295

stevenh · 2017-01-24T23:33:24Z

Both Process.Kill() and Process.Wait() can return errors that don't impact the correct behaviour of terminate.

Instead of letting these get returned and logged, which causes confusion, silently ignore them.

Currently the test needs to be a string test as the errors are private to the runtime packages, so its our only option.

This can be seen if init fails during the setns.

Signed-off-by: Steven Hartland steven.hartland@multiplay.co.uk

cyphar · 2017-01-25T07:28:58Z

libcontainer/process_linux.go

+
+	// TODO(steve): Update these to none string checks if the runtime exports them.
+	switch err.Error() {
+	case "os: process already finished", "exec: Wait was already called":


NACK. That's just not a good idea. Can you explain why you need the errors to be filtered? "Just silently ignore them inside a library" smells bad to me.

We've requested the process be terminated, reporting an error due to the fact that the process had already gone is something the caller simply isn't interested in, as the desired effect has been achieved.

Its analogous to a method to remove a file reporting "file doesn't exist", the caller simply doesn't care.

To bring in some context:

When the init process fails to exec execSetns fails.

When execSetns fails it calls p.cmd.Wait() so the process has gone before return.

newParentProcess returns the error.

The cleanup code in container start() ensures everything is cleaned up by calling terminate.

terminate returns the bogus error os: process already finished which is then logged.

We've requested the process be terminated, reporting an error due to the fact that the process had already gone is something the caller simply isn't interested in, as the desired effect has been achieved.

That depends on your perspective, and is not logic that should exist in a library. If you've asked us to destroy a container, under the assumption the container already exists, then pretending as though it already exists can cause logic bugs and other such issues.

Its analogous to a method to remove a file reporting "file doesn't exist", the caller simply doesn't care.

Some callers might care, and that's why unlink(2) (for example) does return -ENOENT in such cases. It's up to the user of the API to make a decision about what they care about -- not the library.

The cleanup code in container start() ensures everything is cleaned up by calling terminate.

terminate returns the bogus error os: process already finished which is then logged.

Right, so it looks like our code is incorrectly handling this case. The solution is to fix the calling code -- not to filter code inside this code.

If this was an public method I would totally agree, but as this is private and used solely for error recovery this seemed good.

However you are indeed correct that even though its private additional consumers of terminate could still be added where this behaviour is not desired, so I've moved the check to the consumer instead.

Both Process.Kill() and Process.Wait() can return errors that don't impact the correct behaviour of terminate. Instead of letting these get returned and logged, which causes confusion, silently ignore them. Currently the test needs to be a string test as the errors are private to the runtime packages, so its our only option. This can be seen if init fails during the setns. Signed-off-by: Steven Hartland <steven.hartland@multiplay.co.uk>

stevenh · 2017-02-15T14:32:10Z

@cyphar does the move of the test satisfy your requirement?

stevenh · 2017-06-07T23:08:36Z

@cyphar any update on this?

cyphar · 2017-06-09T04:11:21Z

I think the point I was trying to make was that the solution was to fix when we call .terminate() internally, such that we won't call .terminate() when unecessary. But looking at it again, I want to see what the other maintainers think. I'll be honest, my main concern is how heavily this depends on the string representation of errors.

/cc @crosbymichael @mrunalp

cyphar reviewed Jan 25, 2017

View reviewed changes

stevenh force-pushed the terminate-errors branch from 82a96a0 to edc42e5 Compare January 25, 2017 13:12

crosbymichael mentioned this pull request Oct 10, 2017

libcontainer: handler errors from terminate #1607

Merged

hqhq closed this in #1607 Oct 20, 2017

stevenh deleted the terminate-errors branch October 20, 2017 07:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prevent invalid errors from terminate #1295

Prevent invalid errors from terminate #1295

stevenh commented Jan 24, 2017

cyphar Jan 25, 2017

stevenh Jan 25, 2017

cyphar Jan 25, 2017 •

edited

Loading

stevenh Jan 25, 2017

stevenh commented Feb 15, 2017

stevenh commented Jun 7, 2017

cyphar commented Jun 9, 2017

Prevent invalid errors from terminate #1295

Prevent invalid errors from terminate #1295

Conversation

stevenh commented Jan 24, 2017

cyphar Jan 25, 2017

Choose a reason for hiding this comment

stevenh Jan 25, 2017

Choose a reason for hiding this comment

cyphar Jan 25, 2017 • edited Loading

Choose a reason for hiding this comment

stevenh Jan 25, 2017

Choose a reason for hiding this comment

stevenh commented Feb 15, 2017

stevenh commented Jun 7, 2017

cyphar commented Jun 9, 2017

cyphar Jan 25, 2017 •

edited

Loading