-
Notifications
You must be signed in to change notification settings - Fork 2.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[receiver/jaeger] Flaky TestJaegerAgentUDP_ThriftCompact #10368
Comments
Reopening, as this is still failing intermittently. Observed here: https://github.com/open-telemetry/opentelemetry-collector-contrib/runs/6863664900?check_suite_focus=true#step:7:4483 |
@frzifus, would you like to pick this one up? |
Sure :) |
For me it looks like there is something wrong in this area. Unfortunately ive no emulation or windows machine in place. @dgoscn would you like to give it a try? |
Yes @frzifus of course. Thanks |
Hi @djaglowski. How are you? I had some trouble to replicate this error that you mentioned here. But, I made some tries. I will try to replicate here what I did and please, if what I made makes no sense, just say 🦖 thank you So, what I did: In the function GetAvailableLocalNetworkAddress located at
And I added another Println just to get the result of the endpoint declared right below the code above, more precisely in line 60 :
I executed a So, let's go back to the I went to this path, and made a -failtest to the
I let the test running for some minutes in all of the tests, it pass! I decreased the number of -count to 1000 And after decresead to 100: I noticed that if I run the -count with the value 100 or a number less than 1000, I get success and for a higher than, I could get the timout printed above. So, when I run for example the --count for a higher number than 3000, sometimes I can get the error that you mentioned on the top of the issue. But it's not always that I got the error. I made some workarounds, killing some process using a port that was in use or something like that, allowing in firewall etc, but I think that is not the right answer. I was wondering to make a check if the port is already in use, if this returns true, we can skip the port or something like that... How we are getting some block binds to UDP ports, I found this piece of code that "maybe" we can "reuse", make some changes and resolve this definitively. The path for the code above: Do you think that makes sense? Thank you |
@dgoscn, thanks for looking into this. It's great that you've determined a way to reproduce the issue.
I'm not sure we can solve the problem by validating the port independently of the actual test, because some (tiny) amount of time will pass before we attempt to use it. But perhaps you are suggesting we just recover and retry the test? That may be reasonable if limited to a small number of retries. |
@djaglowski thanks for your answer. Hmmm. Yeah, I think that make sense what you said. About the reasonable time for the test, yes. I agree! from what I observed, keep a small number it's enough, because we are not trying to make some kind of brute force test. Maybe a limited number of retries, make sense. Thank you one more time. o/ |
Pinging code owners: @jpkrohling. See Adding Labels via Comments if you do not have permissions to add labels yourself. |
This issue has been inactive for 60 days. It will be closed in 60 days if there is no activity. To ping code owners by adding a component label, see Adding Labels via Comments, or if you are unsure of which component this issue relates to, please ping Pinging code owners:
See Adding Labels via Comments if you do not have permissions to add labels yourself. |
I don't remember seeing this issue happening anymore, but I also don't remember having this fixed. I'm leaving it open for some more time. |
great @jpkrohling |
This issue has been inactive for 60 days. It will be closed in 60 days if there is no activity. To ping code owners by adding a component label, see Adding Labels via Comments, or if you are unsure of which component this issue relates to, please ping Pinging code owners:
See Adding Labels via Comments if you do not have permissions to add labels yourself. |
Closing, as there haven't been any new reports about this. Feel free to reopen or comment if this happens again. |
https://github.com/open-telemetry/opentelemetry-collector-contrib/runs/6614143811?check_suite_focus=true#step:7:4448
The text was updated successfully, but these errors were encountered: