Hello,

We are experiencing an issue where multiple workers are being assigned to
the same port, causing heartbeat timeout.  Here is the gist showing the
supervisor and worker log in time order so you can see the supervisor
launching the worker and waiting, the worker jvm fails due to bind
exception, supervisor gives up due to timeout.

https://gist.github.com/anonymous/9835036

It should be noted that we are seeing odd scheduling behavior where one
machine is getting overloaded with workers. the logs in this gist are from
the machine that gets overloaded.  It typically gets 8 workers where other
machines only get 2 workers.  I am happy to provide more details.

Thanks,
Luke Forehand |  Networked Insights  |  Software Engineer

Reply via email to