Steve Loughran created HADOOP-9086:
--------------------------------------

             Summary: Enforce process singleton rules through an exclusive 
write lock on a file, not a pid file +kill -0,
                 Key: HADOOP-9086
                 URL: https://issues.apache.org/jira/browse/HADOOP-9086
             Project: Hadoop Common
          Issue Type: Improvement
          Components: util
    Affects Versions: 1.1.1, 2.0.3-alpha
         Environment: Unix/Linux. 
            Reporter: Steve Loughran


the {{hadoop-daemon.sh}} script (and other liveness monitors) probe the 
existence of a daemon service by a {{kill -0}} of a process id picked up from a 
pid file. 
This is flawed
# pid file locations may change with installations.
# Linux and Unix recycle pids, leading to false positives -the scripts think 
the process is running, when another process is.
# doesn't work on windows.

Having the processes acquire an exclusive write-lock on a known file would 
delegate lock management and implicitly liveness to the OS itself. when the 
process dies, the lock is relased (on Unixes)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to