Process supervision in large SW systems.

Thomas Tornblom thomas at uppsala.telesoft.se
Thu Nov 1 00:59:08 AEST 1990


What method do people use to control/create/kill/supervise different
processes in large software systems?

People are asking me how to implement a supervision system. It should be
responsible for checking that processes are alive and are feeling well.

It should also be able to in some intelligent way restart a process
that has died or that is doing the wrong thing.

Problem areas are interprocess communication, how to detect status changes
(strict hierarchy of processes?, catching SIGCHILD?).

People must have done this before in systems that requires high reliability.

The system is going to used on fault tolerant hardware in the future so
we need fault tolerant software.

E-mail prefered.

Thanks 
Thomas
-- 
Real life:      Thomas Tornblom             Email:  thomas at uppsala.telesoft.se
Snail mail:     Telesoft Uppsala AB         Phone:  +46 18 189406
                Box 1218                    Fax:    +46 18 132039
                S - 751 42 Uppsala, Sweden



More information about the Comp.unix.misc mailing list