-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[eventd]: Eventd failing to start due to rsyslogd on multi-asic #20775
Comments
@arlakshm for vis |
Hi @wumiaont, what is the status of this? I see you have closed your PR without merging - is there another fix in the works? |
We believe ##20248 is the culprit to cause the issue and #20947 reverted the changes of 20248. We did not see this issue after #20947. In that case my fix is not needed. |
Ah excellent - thanks for the update! |
202405 PR fix for this issue #20946 |
Closing this issue. @liamkearney-msft , please re-open if issue is still there. Thanks. |
Description
On T2 / multi-asic chassis, it can be seen when coming out of reboot, that sometimes eventd fails to start, leading to a degraded system according to systemd. The reason for this failure is attributed to rsyslogd failing to initially start, which cascades into eventd failing to start. rsyslogd eventually restarts and comes up fine, but eventd does not have auto-restart configured so it stays down. Manually restarting the service recovers the system.
rsyslogd is failing to initially start due to "network unreachable", as it's likely coming up before the docker service is ready / racing with docker (for multi-asic chassis, rsyslogd attaches to docker0 interface to pull the logs). Once rsyslogd is auto-restarted (which happens after the docker interface / service is up), it starts up fine.
See below for journalctl output for rsyslogd:
Steps to reproduce the issue:
platform_tests/test_reload_config::test_reload_configuration_checks
test, although is flaky as its a race conditionDescribe the results you received:
eventd fails to start due to rsyslogd failing on initial start
Describe the results you expected:
eventd either autorestarts on failure, or services are sequenced such that rsyslogd starts on first invocation (dependencies can be tricky here as we dont want to drop logs on boot)
Output of
show version
:On nokia 7250 chassis
/ 202405
Output of
show techsupport
:Additional information you deem important (e.g. issue happens only occasionally):
This is probably related to issues #20544 and #20521
The text was updated successfully, but these errors were encountered: