Hello, We are experiencing an issue where multiple workers are being assigned to the same port, causing heartbeat timeout. Here is the gist showing the supervisor and worker log in time order so you can see the supervisor launching the worker and waiting, the worker jvm fails due to bind exception, supervisor gives up due to timeout.
https://gist.github.com/anonymous/9835036 It should be noted that we are seeing odd scheduling behavior where one machine is getting overloaded with workers. the logs in this gist are from the machine that gets overloaded. It typically gets 8 workers where other machines only get 2 workers. I am happy to provide more details. Thanks, Luke Forehand | Networked Insights | Software Engineer
