You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
swss#supervisor-proc-exit-listener: Process 'orchagent' is stuck in namespace 'host' (1.0 minutes) is seen sometimes during switch initialization
#17530
Closed
dgsudharsan opened this issue
Dec 16, 2023
· 7 comments
The below error log occurs sometime during switch initialization
Dec 14 04:12:21.433385 arc-switch1025 ERR swss#supervisor-proc-exit-listener: Process 'orchagent' is stuck in namespace 'host' (1.0 minutes).
On analysis, it appears that there is no further details on what orchagent does. It might be that due to other process getting initialized orchagent might not get cycle to send heartbeat. Here are the logs
Dec 14 04:11:29.907268 arc-switch1025 NOTICE swss#orchagent: :- addNextHopGroup: Create next hop group fc00::22@PortChannel1014,fc00::2a@PortChannel1017
Dec 14 04:12:21.433385 arc-switch1025 ERR swss#supervisor-proc-exit-listener: Process 'orchagent' is stuck in namespace 'host' (1.0 minutes).
Dec 14 04:12:27.465717 arc-switch1025 NOTICE swss#orchagent: :- doTask: Get port state change notification id:1000000000022 status:1
Dec 14 04:12:27.465927 arc-switch1025 NOTICE swss#orchagent: :- updatePortOperStatus: Port Ethernet50 oper state set from down to up
From sairedis rec it appears that orchagent was busy for about 40 seconds performing route installation and setting values but no logs for the last 20 seconds
It's difficult to identify orchagent stuck caused by orchagent busy or caused by some code issue make orchangent stuck. because in both case orchangent does not send heartbeat message in 1 minutes.
For example, if some code bug in SAI API make route create request take more than 10 minutes, do we want watchdog report this issue or not?
I will create a PR to increase watchdog threshold for this issue.
Description
The below error log occurs sometime during switch initialization
On analysis, it appears that there is no further details on what orchagent does. It might be that due to other process getting initialized orchagent might not get cycle to send heartbeat. Here are the logs
From swss rec
From sairedis rec it appears that orchagent was busy for about 40 seconds performing route installation and setting values but no logs for the last 20 seconds
Steps to reproduce the issue:
Describe the results you received:
Error log seen
Describe the results you expected:
No error log should be seen
Output of
show version
:Output of
show techsupport
:Additional information you deem important (e.g. issue happens only occasionally):
sonic_dump_arc-switch1025_20231214_053630.tar.gz
The text was updated successfully, but these errors were encountered: