Greg Hogan created FLINK-4212: --------------------------------- Summary: Lock on pid file when starting daemons Key: FLINK-4212 URL: https://issues.apache.org/jira/browse/FLINK-4212 Project: Flink Issue Type: Improvement Components: Startup Shell Scripts Affects Versions: 1.1.0 Reporter: Greg Hogan Assignee: Greg Hogan
As noted on the mailing list (0), when multiple TaskManagers are started in parallel (using pdsh) there is a race condition on updating the pid: 1) the pid file is first read to parse the process' index, 2) the process is started, and 3) on success the daemon pid is appended to the pid file. We could use a tool such as {{flock}} to lock on the pid file while starting the Flink daemon. 0: http://mail-archives.apache.org/mod_mbox/flink-user/201607.mbox/%3CCA%2BssbKXw954Bz_sBRwP6db0FntWyGWzTyP7wJZ5nhOeQnof3kg%40mail.gmail.com%3E -- This message was sent by Atlassian JIRA (v6.3.4#6332)