-
Notifications
You must be signed in to change notification settings - Fork 6
Docker test is flaky #187
Comments
Previous we were allowing the recvfd files to be closed by their finalizer and the GC. This sometimes led to premature closing of files leading to errors like: networkservicemesh/cmd-forwarder-vppagent#187 'Delete' case. This fixes that by closing the files only after the ctx has is Done. Signed-off-by: Ed Warnicke <hagbard@gmail.com>
I think this may fix the 'Delete' case (but need to test it further): |
Just did 150 test runs. The mentioned PR fixes the
|
It is my belief that when fully and properly implemented WaitDone will resolve the 'Unable to find' issue. |
This should help disambiguate the 'cannot support' issue: |
…548) Previous we were allowing the recvfd files to be closed by their finalizer and the GC. This sometimes led to premature closing of files leading to errors like: networkservicemesh/cmd-forwarder-vppagent#187 'Delete' case. This fixes that by closing the files only after the ctx has is Done. Signed-off-by: Ed Warnicke <hagbard@gmail.com>
…k@master networkservicemesh/sdk#548 networkservicemesh/sdk PR link: networkservicemesh/sdk#548 networkservicemesh/sdk commit message: commit 8b0bd573228d24958ad9e74f643d083709e81844 Author: Ed Warnicke <hagbard@gmail.com> Date: Tue Oct 20 16:03:21 2020 -0500 Explicitely close files for recvfd client *after* they don't matter. (#548) Previous we were allowing the recvfd files to be closed by their finalizer and the GC. This sometimes led to premature closing of files leading to errors like: networkservicemesh/cmd-forwarder-vppagent#187 'Delete' case. This fixes that by closing the files only after the ctx has is Done. Signed-off-by: Ed Warnicke <hagbard@gmail.com> Signed-off-by: NSMBot <nsmbot@networkservicmesh.io>
…k@master networkservicemesh/sdk#548 networkservicemesh/sdk PR link: networkservicemesh/sdk#548 networkservicemesh/sdk commit message: commit 8b0bd573228d24958ad9e74f643d083709e81844 Author: Ed Warnicke <hagbard@gmail.com> Date: Tue Oct 20 16:03:21 2020 -0500 Explicitely close files for recvfd client *after* they don't matter. (#548) Previous we were allowing the recvfd files to be closed by their finalizer and the GC. This sometimes led to premature closing of files leading to errors like: networkservicemesh/cmd-forwarder-vppagent#187 'Delete' case. This fixes that by closing the files only after the ctx has is Done. Signed-off-by: Ed Warnicke <hagbard@gmail.com> Signed-off-by: NSMBot <nsmbot@networkservicmesh.io>
…k@master networkservicemesh/sdk#548 networkservicemesh/sdk PR link: networkservicemesh/sdk#548 networkservicemesh/sdk commit message: commit 8b0bd573228d24958ad9e74f643d083709e81844 Author: Ed Warnicke <hagbard@gmail.com> Date: Tue Oct 20 16:03:21 2020 -0500 Explicitely close files for recvfd client *after* they don't matter. (#548) Previous we were allowing the recvfd files to be closed by their finalizer and the GC. This sometimes led to premature closing of files leading to errors like: networkservicemesh/cmd-forwarder-vppagent#187 'Delete' case. This fixes that by closing the files only after the ctx has is Done. Signed-off-by: Ed Warnicke <hagbard@gmail.com> Signed-off-by: NSMBot <nsmbot@networkservicmesh.io>
…k@master networkservicemesh/sdk#548 networkservicemesh/sdk PR link: networkservicemesh/sdk#548 networkservicemesh/sdk commit message: commit 8b0bd573228d24958ad9e74f643d083709e81844 Author: Ed Warnicke <hagbard@gmail.com> Date: Tue Oct 20 16:03:21 2020 -0500 Explicitely close files for recvfd client *after* they don't matter. (#548) Previous we were allowing the recvfd files to be closed by their finalizer and the GC. This sometimes led to premature closing of files leading to errors like: networkservicemesh/cmd-forwarder-vppagent#187 'Delete' case. This fixes that by closing the files only after the ctx has is Done. Signed-off-by: Ed Warnicke <hagbard@gmail.com> Signed-off-by: NSMBot <nsmbot@networkservicmesh.io>
…k@master networkservicemesh/sdk#548 networkservicemesh/sdk PR link: networkservicemesh/sdk#548 networkservicemesh/sdk commit message: commit 8b0bd573228d24958ad9e74f643d083709e81844 Author: Ed Warnicke <hagbard@gmail.com> Date: Tue Oct 20 16:03:21 2020 -0500 Explicitely close files for recvfd client *after* they don't matter. (#548) Previous we were allowing the recvfd files to be closed by their finalizer and the GC. This sometimes led to premature closing of files leading to errors like: networkservicemesh/cmd-forwarder-vppagent#187 'Delete' case. This fixes that by closing the files only after the ctx has is Done. Signed-off-by: Ed Warnicke <hagbard@gmail.com> Signed-off-by: NSMBot <nsmbot@networkservicmesh.io>
…k@master networkservicemesh/sdk#548 (#407) networkservicemesh/sdk PR link: networkservicemesh/sdk#548 networkservicemesh/sdk commit message: commit 8b0bd573228d24958ad9e74f643d083709e81844 Author: Ed Warnicke <hagbard@gmail.com> Date: Tue Oct 20 16:03:21 2020 -0500 Explicitely close files for recvfd client *after* they don't matter. (#548) Previous we were allowing the recvfd files to be closed by their finalizer and the GC. This sometimes led to premature closing of files leading to errors like: networkservicemesh/cmd-forwarder-vppagent#187 'Delete' case. This fixes that by closing the files only after the ctx has is Done. Signed-off-by: Ed Warnicke <hagbard@gmail.com> Signed-off-by: NSMBot <nsmbot@networkservicmesh.io> Co-authored-by: NSMBot <nsmbot@networkservicmesh.io>
The new combined error message:
The error occurs in the nested The outer The request is passed to the server The inner
Both on good and bad runs (on all requests except the one):
And on bad run only (on one request):
|
…etworkservicemesh#548) Previous we were allowing the recvfd files to be closed by their finalizer and the GC. This sometimes led to premature closing of files leading to errors like: networkservicemesh/cmd-forwarder-vppagent#187 'Delete' case. This fixes that by closing the files only after the ctx has is Done. Signed-off-by: Ed Warnicke <hagbard@gmail.com> Signed-off-by: Sergey Ershov <sergey.ershov@xored.com>
Failing CI jobs:
On my machine, the docker test failing with a ~30% chance, and there are at least three unique error messages. Logs from different failing runs on my machine:
Cannot support any of the requested Mechanisms
Unable to find interface with prefix "k2kns" in netns "k2k-client"
(DELETE): failed to switch namespace: failed to get or create namespace
The text was updated successfully, but these errors were encountered: