Details
-
Bug
-
Status: Complete
-
Medium
-
Resolution: Done
-
None
-
None
-
None
-
None
-
indy-node 1.2.50-stable
-
Sprint 18.03 Stability, DKMS, Sprint 18.04
Description
While testing pool upgrade was noted, that nodes which failed the upgrade procedure had broken indy-node processes. In systemd service we have option like 'Restart=on-failure' that meens that service must be restarted when running process will broke (by KeyError unhandled exception for example). All of nodes with faulty upgrade procedure had unhandled exception and systemctl did not restart indy-node service.
The main assumption is that main python process had unhandled exception, was stopped but child processes was not stopped and systemd not restart service.
Ways to investigate:
- There is several type of service - 'forked' and 'sample'. What of type do we use dy default?
- How many subprocesses we use for indy-node service
Also, set 'SIGTERM' into list of handled signal.
Attachments
Issue Links
- relates to
-
INDY-1150 Pool upgrade was scheduled but was not happened on part of nodes
-
- Complete
-