You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When thermalctld exits, object finalizers run that attempt to connect to chassisdb to clean up table entries. This causes the process to crash if the database-chassis container isn't reachable, which can happen under certain circumstances during chassis reboot.
Steps to reproduce the issue:
Boot up a chassis system.
Kill the database-chassis container on the supervisor (may also need to disable the systemd service so it isn't automatically restarted)
On a linecard, kill the thermalctld process.
Describe the results you received:
The thermalctld process exited with the following exception:
2024 Dec 5 21:01:54.131224 nfc420-7 INFO pmon#thermalctld: Stop thermal monitoring loop
2024 Dec 5 21:01:54.135852 nfc420-7 INFO pmon#supervisord: thermalctld Exception ignored in: <function TemperatureUpdater.__del__ at 0x7f33cceb0040>
2024 Dec 5 21:01:54.135927 nfc420-7 INFO pmon#supervisord: thermalctld Traceback (most recent call last):
2024 Dec 5 21:01:54.136005 nfc420-7 INFO pmon#supervisord: thermalctld File "/usr/local/bin/thermalctld", line 561, in __del__
2024 Dec 5 21:01:54.136051 nfc420-7 INFO pmon#supervisord: thermalctld self.chassis_table._del(tk)
2024 Dec 5 21:01:54.136105 nfc420-7 INFO pmon#supervisord: thermalctld File "/usr/lib/python3/dist-packages/swsscommon/swsscommon.py", line 2970, in _del
2024 Dec 5 21:01:54.136634 nfc420-7 INFO pmon#supervisord: thermalctld return self.delete(*args, **kwargs)
2024 Dec 5 21:01:54.136686 nfc420-7 INFO pmon#supervisord: thermalctld ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
2024 Dec 5 21:01:54.136724 nfc420-7 INFO pmon#supervisord: thermalctld File "/usr/lib/python3/dist-packages/swsscommon/swsscommon.py", line 2964, in delete
2024 Dec 5 21:01:54.137163 nfc420-7 INFO pmon#supervisord: thermalctld return _swsscommon.Table_delete(self, *args, **kwargs)
2024 Dec 5 21:01:54.137163 nfc420-7 INFO pmon#supervisord: thermalctld ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
2024 Dec 5 21:01:54.137182 nfc420-7 INFO pmon#supervisord: thermalctld RuntimeError: RedisError: Failed to redisGetReply in RedisPipeline::pop, err=3: errstr=Server closed the connection
2024 Dec 5 21:01:54.137182 nfc420-7 INFO pmon#supervisord: thermalctld terminate called after throwing an instance of 'swss::RedisError'
2024 Dec 5 21:01:55.188986 nfc420-7 INFO pmon#supervisord: thermalctld what(): RedisError: Failed to redisGetReply in RedisPipeline::pop, err=3: errstr=Server closed the connection
2024 Dec 5 21:01:55.787212 nfc420-7 INFO pmon#supervisord 2024-12-05 21:01:55,786 WARN exited: thermalctld (terminated by SIGABRT (core dumped); not expected)
Description
When thermalctld exits, object finalizers run that attempt to connect to chassisdb to clean up table entries. This causes the process to crash if the database-chassis container isn't reachable, which can happen under certain circumstances during chassis reboot.
Steps to reproduce the issue:
Describe the results you received:
The thermalctld process exited with the following exception:
Describe the results you expected:
The thermalctld process exist cleanly.
Output of
show version
:Output of
show techsupport
:Additional information you deem important (e.g. issue happens only occasionally):
This is occassionally triggered by sonic-mgmt reboot or config reload tests.
The text was updated successfully, but these errors were encountered: