-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[warm-reboot] ERR swss#supervisor-proc-exit-listener: Process 'orchagent' is stuck in namespace 'host' (1.0 minutes). after performing warm-boot command #16686
Labels
Comments
@qiluo-msft @liuh-80 Can you please investigate and possibly stop watchdog during warmboot and fastboot commands? |
@vaibhavhd f.y.i |
I will investigation and create fix for this issue ASAP. |
7 tasks
11 tasks
This was referenced Nov 8, 2023
StormLiangMS
pushed a commit
to sonic-net/sonic-swss
that referenced
this issue
Nov 11, 2023
Orchangent send heartbeat during warm-reboot to prevent Orchagent stuck alert. Why I did it Orchangent will freese during warm-reboot, then supervisor-proc-exit-listener will generate false alert during warm reboot: sonic-net/sonic-buildimage#16686 Work item tracking Microsoft ADO: 25295846 How I did it Send heartbeat during warm-reboot freeze. How to verify it Pass all UT. Manually verify issue fixed by check syslog.
liuh-80
added a commit
to sonic-net/sonic-mgmt
that referenced
this issue
Nov 17, 2023
Add orchagent heartbeat during warm-reboot UT ### Description of PR Add orchagent heartbeat during warm-reboot UT ##### Work item tracking - Microsoft ADO: 25295846 ### Type of change <!-- - Fill x for your type of change. - e.g. - [x] Bug fix --> - [ ] Bug fix - [ ] Testbed and Framework(new/improvement) - [x] Test case(new/improvement) ### Back port request - [ ] 201911 - [ ] 202012 - [ ] 202205 ### Approach #### What is the motivation for this PR? Fix orchagent stuck error during warm-reboot: sonic-net/sonic-buildimage#16686 #### How did you do it? Add new UT, freeze orchanget for warm-reboot then check the process listener not send alert. #### How did you verify/test it? Pass all UT #### Any platform specific information? #### Supported testbed topology if it's a new test case? ### Documentation <!-- (If it's a new feature, new test case) Did you update documentation/Wiki relevant to your implementation? Link to the wiki page? -->
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Description
The following error message is seen when performing warmboot command. This is due to the recent watchdog introduced to monitor orchagent in #15429. However this should be disabled when executing warmboot or fastboot commands.
Steps to reproduce the issue:
Describe the results you received:
Error in logs
Describe the results you expected:
No error in logs.
Output of
show version
:Output of
show techsupport
:Additional information you deem important (e.g. issue happens only occasionally):
sonic_dump_r-anaconda-51_20230907_185038.tar.gz
The text was updated successfully, but these errors were encountered: