-
Notifications
You must be signed in to change notification settings - Fork 30.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Investigate flaky test-http-pipeline-regr-3332 on SmartOS #7649
Comments
I catch the same error on ubuntu server - my node.js app just crashed |
My apologies for being so long to take a look at this. @Trott Was a core file generated when this process aborted. If so is it possible to get access to that core file? |
@misterdjules This happened on CI, so if there was a core file, it would have been on the host used by that. So I'm guessing that by now, any core file is long gone. To be honest, I don't think I've seen this in over a month so I'm not sure it's going to be that easy to reproduce. I guess we could try running it 10000 times on a stress test job and see what happens? |
@Trott The core file could still be available, as by default core files generated by an application are stored in the global zone of the server running the virtual machine. I'll take a look on every server running the current build SmartOS VMs, and I'll report back on whether I found that core file. If we find a core file, we should be able to tell if the out of memory error was due to the process itself using too much memory, or to something else on that machine. My hunch is that it's a case of the latter, but it'd be nice if we could find some evidence of that. How many times did you see this test fail? |
It's entirely possible that this was the only time I'd seen it fail on SmartOS. If you can't find hard evidence that there isn't a problem with the test itself, and if a stress test comes up clean, then I'd be content to close this until either it recurs in CI or someone can provide a way to reproduce the issue with reasonable reliability. Stress test on 32-bit: https://ci.nodejs.org/job/node-stress-single-test/913/nodes=smartos14-32/console Stress test on 64-bit: https://ci.nodejs.org/job/node-stress-single-test/914/nodes=smartos14-64/console |
I won't be able to recover the core file from that crash. However the stress tests seem to have both passed, so I think we can consider this test as non-flaky for now, until we're proven otherwise in the future. What I'll do though is reach out to the build WG to setup the SmartOS test VMs in such a way that core files are kept at least for a reasonable amount of time that will allow collaborators to investigate future crashes if need be. Thank you for the help! |
https://ci.nodejs.org/job/node-test-commit-smartos/3215/nodes=smartos14-64/console
The text was updated successfully, but these errors were encountered: