-
Notifications
You must be signed in to change notification settings - Fork 86
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Disable systemd auto-restart of dependent services for spineRouters #83
Disable systemd auto-restart of dependent services for spineRouters #83
Conversation
@mlok-nokia @anamehra for viz and review.. thanks! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
as comments
@deepak-singhal0408 can't clean cherry pick to 202305. |
@yxieca , can you help review/approve for 202205? Thanks! |
Thanks @yxieca ! |
Thanks @yxieca for help merging into 202205.. |
Description:
Issue details and RCA here. sonic-net/sonic-buildimage#16991
MSFT ADO:
25422412
How I did it:
Currently featured script overrides the systemd service files of the features depending upon auto_restart enable/disable.
I am skipping dependent features(syncd, gbsyncd for now) to have "RESTART=Always"
for them to not start immidiately, and instead get started by SWSS through swss.sh script.
The issue of syncd double stop is also applicable to pizza box platforms, however no traffic impact is seen there, where as on VOQ chassis, we do see traffic impact due to early start of syncd service.
The change can be extended to pizza box platforms as well in future once it gets vetted on SpineRouters.
How to verify it
a. No traffic loss end to end
b. Dont see double stop/start of syncd service
Also Ran few more tests.
Tested on both single asic and multi-asic hw platforms.
Tested config-reload
sudo reboot
service restart
syncd process crash, orchagent process crash
Confirmed from logs, that in none of the scenario, syncd gets killed twice