Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

VOQ: add dependent peer's stronger dependency with swss to avoid double stop/start and traffic loss #17041

Closed
wants to merge 1 commit into from
Closed

VOQ: add dependent peer's stronger dependency with swss to avoid double stop/start and traffic loss #17041

wants to merge 1 commit into from

Conversation

deepak-singhal0408
Copy link
Contributor

Why I did it

Issue details and RCA here. #16991
it fixes #16991

Work item tracking
  • Microsoft ADO 25422412:

How I did it

In SWSS script, I have added stronger dependency with dependent peer services using "Requires=" command in unit files of dependent peer services. This will ensure that, syncd/gbsyncd dont start immediately and instead wait till swss service starts its init phase.
I have made this change for VOQ chassis for now, however this is also applicable to pizza boxes running SONiC(there is no functionality impact seen on pizza box however).

How to verify it

followed same steps as mentioned in #16991 and confirmed that
a. No traffic loss end to end
b. Dont see double stop/start of syncd service

Also Ran few more tests.

  1. Tested on both single asic and multi-asic hw platforms.
  2. Tested config-reload
  3. sudo reboot
  4. service restart
  5. syncd process crash, orchagent process crash

Confirmed from logs, that in none of the scenario, syncd gets killed twice.

Which release branch to backport (provide reason below if selected)

  • 201811
  • 201911
  • 202006
  • 202012
  • 202106
  • 202111
  • 202205
  • 202211
  • 202305

Tested branch (Please provide the tested image version)

SONiC.20220532.45

Description for the changelog

Link to config_db schema for YANG module changes

A picture of a cute animal (not mandatory but encouraged)

@deepak-singhal0408
Copy link
Contributor Author

@mlok-nokia for viz..

@rlhui
Copy link
Contributor

rlhui commented Nov 1, 2023

@anamehra also help review

@deepak-singhal0408
Copy link
Contributor Author

Opened another PR sonic-net/sonic-host-services#83 to fix it via hosfcfgd/featured script to keep systemd service files override at one place.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

T2-VOQ: Remote Linecard Crash(syncd process): Traffic loss of ~9-10seconds observed
2 participants