Possibly one of your executors is in the middle of a large stop-the-world
GC and doesn't respond to network traffic during that period?  If you
shared some information about how each node in your cluster is set up (heap
size, memory, CPU, etc) that might help with debugging.

Andrew


On Mon, Mar 24, 2014 at 9:13 PM, Nathan Kronenfeld <
nkronenf...@oculusinfo.com> wrote:

> What does this error mean:
>
> @hadoop-s2.oculus.local:45186]: Error [Association failed with
> [akka.tcp://spark@hadoop-s2.oculus.local:45186]] [
> akka.remote.EndpointAssociationException: Association failed with
> [akka.tcp://spark@hadoop-s2.oculus.local:45186]
> Caused by:
> akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2:
> Connection refused: hadoop-s2.oculus.loca\
> l/192.168.0.47:45186
> ]
>
> ?
>
> --
> Nathan Kronenfeld
> Senior Visualization Developer
> Oculus Info Inc
> 2 Berkeley Street, Suite 600,
> Toronto, Ontario M5A 4J5
> Phone:  +1-416-203-3003 x 238
> Email:  nkronenf...@oculusinfo.com
>

Reply via email to