Skip to content

Commit 009b0dd

Browse files
liuh-80mssonicbld
authored andcommitted
Change orchagent stuck message from ERR to WARNING (#17872)
Change orchagent stuck message from ERR to WARNING #### Why I did it During switch initialization, sometime Orchagent will busy for more than 40seconds and will trigger process stuck workdog error. To improve this issue, change watchdog error message to warning message. ##### Work item tracking - Microsoft ADO: 26517622 #### How I did it Change orchagent stuck message from ERR to WARNING. #### How to verify it Pass all UT. ### Description for the changelog Change orchagent stuck message from ERR to WARNING.
1 parent a467ff7 commit 009b0dd

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

files/scripts/supervisor-proc-exit-listener

+3-3
Original file line numberDiff line numberDiff line change
@@ -73,7 +73,7 @@ def get_group_and_process_list(process_file):
7373
return group_list, process_list
7474

7575

76-
def generate_alerting_message(process_name, status, dead_minutes):
76+
def generate_alerting_message(process_name, status, dead_minutes, priority=syslog.LOG_ERR):
7777
"""
7878
@summary: If a critical process was not running, this function will determine it resides in host
7979
or in a specific namespace. Then an alerting message will be written into syslog.
@@ -86,7 +86,7 @@ def generate_alerting_message(process_name, status, dead_minutes):
8686
else:
8787
namespace = namespace_prefix + namespace_id
8888

89-
syslog.syslog(syslog.LOG_ERR, "Process '{}' is {} in namespace '{}' ({} minutes)."
89+
syslog.syslog(priority, "Process '{}' is {} in namespace '{}' ({} minutes)."
9090
.format(process_name, status, namespace, dead_minutes))
9191

9292

@@ -213,7 +213,7 @@ def main(argv):
213213
elapsed_secs = epoch_time - process_heart_beat_info[process]["last_heart_beat"]
214214
if elapsed_secs >= ALERTING_INTERVAL_SECS:
215215
elapsed_mins = elapsed_secs // 60
216-
generate_alerting_message(process, "stuck", elapsed_mins)
216+
generate_alerting_message(process, "stuck", elapsed_mins, syslog.LOG_WARNING)
217217

218218
if __name__ == "__main__":
219219
main(sys.argv[1:])

0 commit comments

Comments
 (0)