Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Disable systemd auto-restart of dependent services for spineRouters #83

Conversation

deepak-singhal0408
Copy link
Contributor

@deepak-singhal0408 deepak-singhal0408 commented Nov 9, 2023

Description:
Issue details and RCA here. sonic-net/sonic-buildimage#16991

MSFT ADO:
25422412

How I did it:
Currently featured script overrides the systemd service files of the features depending upon auto_restart enable/disable.
I am skipping dependent features(syncd, gbsyncd for now) to have "RESTART=Always"
for them to not start immidiately, and instead get started by SWSS through swss.sh script.
The issue of syncd double stop is also applicable to pizza box platforms, however no traffic impact is seen there, where as on VOQ chassis, we do see traffic impact due to early start of syncd service.
The change can be extended to pizza box platforms as well in future once it gets vetted on SpineRouters.

How to verify it

  1. extended Unit tests to cover this scenario.
  2. followed same steps as mentioned in T2-VOQ: Remote Linecard Crash(syncd process): Traffic loss of ~9-10seconds observed sonic-buildimage#16991 and confirmed that
    a. No traffic loss end to end
    b. Dont see double stop/start of syncd service

Also Ran few more tests.

Tested on both single asic and multi-asic hw platforms.
Tested config-reload
sudo reboot
service restart
syncd process crash, orchagent process crash
Confirmed from logs, that in none of the scenario, syncd gets killed twice

@deepak-singhal0408
Copy link
Contributor Author

@mlok-nokia @anamehra for viz and review.. thanks!

Copy link
Contributor

@abdosi abdosi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Copy link
Contributor

@arlakshm arlakshm left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

as comments

scripts/featured Show resolved Hide resolved
scripts/featured Show resolved Hide resolved
@deepak-singhal0408 deepak-singhal0408 merged commit 586b1e9 into sonic-net:master Nov 14, 2023
4 checks passed
@StormLiangMS
Copy link

@deepak-singhal0408 can't clean cherry pick to 202305.

@gechiang
Copy link

@yxieca , can you help review/approve for 202205? Thanks!

@yxieca
Copy link
Contributor

yxieca commented Nov 20, 2023

@yxieca , can you help review/approve for 202205? Thanks!

@gechiang in 202205 branch, this is not a submodule. A backport will have to be an PR in sonic-buildimage repo for 202205 branch.

@gechiang
Copy link

Thanks @yxieca !
@deepak-singhal0408 , Please raise a PR for this change for 202205 under sonic-buildimage.

@deepak-singhal0408
Copy link
Contributor Author

Thanks @yxieca for help merging into 202205..
@gechiang here is the 202205 PR that's merged...sonic-net/sonic-buildimage#17203

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

7 participants