Hi,

I'm using spark on a 8 (+headnode) node cluster in standalone mode.
The headnode runs the master instance and the compute nodes are supposed to
run the workers.
No worker runs on the headnode.

However, as soon as I connect more than one worker I get a weird behaviour:
some applications run, some don't.
My test cases have been a small parallelized list which is mapped and then
reduced, the other one is the creation of an rMAT graph with graphx and
printing out the number of vertices and edges.
The parallelized list kernel runs perfectly fine while the graph kernel
freezes after creation of the rMAT graph and before printing the number of
vertices/edges.

The problem could be related to this one:
http://apache-spark-user-list.1001560.n3.nabble.com/Problem-in-running-a-job-on-more-than-one-workers-tp12361.html
The difference is that my applications don't get removed by the master but
just freeze and I get a lot of EndpointAssociationExceptions in my worker
logs.

I doublechecked interconnectivity and all seems can reach eachother. 

Does any of you have an idea what might be causing the problem?

Here is the data I collected:

example code:
==================================================================
import org.apache.spark.SparkContext
import org.apache.spark.SparkContext._
import org.apache.spark.SparkConf
import org.apache.spark._
import org.apache.spark.graphx._
import org.apache.spark.graphx.util._
import org.apache.spark.rdd.RDD
import java.io._

class NullOutStream extends OutputStream {
    def write(b: Byte): Unit = {}
    def write(i: Int): Unit = {}
}

object GraphMinimal {
    def main(args: Array[String]): Unit = {
        try {
            val conf = new SparkConf().setAppName("spark-graphx-minimal")
            val sc = new SparkContext(conf)

            val s = if (args.length >= 2) args(0).toInt else 10
            val ef = if (args.length >= 2) args(1).toInt else 16
                
            val n = math.pow(2, s).toInt
            val m = n * ef
            print(s"Generating graph with ${n} vertices and ${m} edges...")
            val g1 = scala.Console.withOut(new NullOutStream()) {
                GraphGenerators.rmatGraph(sc, n, m)
            }
            println("done")
            println(s"Vertices: ${g1.numVertices}, edges: ${g1.numEdges}")
        } catch {
            case e: Exception => e.printStackTrace()
        }
    }
}

object ListMinimal {
    def main(args: Array[String]): Unit = {
        try {
            val conf = new SparkConf().setAppName("spark-list-minimal")
            val sc = new SparkContext(conf)

            val max = if (args.length >= 1) args(0).toInt else 10000

            val numElems = sc.parallelize(0 to max)
                                .map(x => 1).reduce(_ + _)
            println(numElems)

        } catch {
            case e: Exception => e.printStackTrace()
        }
    }
}


Test Application output with one worker:
==================================================================
list:
----------------------------------------------------------------------------------
Spark assembly has been built with Hive, including Datanucleus jars on
classpath
14/08/21 07:51:00 INFO Server: jetty-8.y.z-SNAPSHOT
14/08/21 07:51:00 INFO AbstractConnector: Started
SocketConnector@0.0.0.0:47461
14/08/21 07:51:00 INFO Server: jetty-8.y.z-SNAPSHOT
14/08/21 07:51:00 INFO AbstractConnector: Started
SocketConnector@0.0.0.0:58979
14/08/21 07:51:00 INFO Server: jetty-8.y.z-SNAPSHOT
14/08/21 07:51:00 INFO AbstractConnector: Started
SelectChannelConnector@0.0.0.0:4040
14/08/21 07:51:00 WARN NativeCodeLoader: Unable to load native-hadoop
library for your platform... using builtin-java classes where applicable
10001

(10001 shows that the application finished)

graph:
----------------------------------------------------------------------------------
Spark assembly has been built with Hive, including Datanucleus jars on
classpath
14/08/21 07:51:08 INFO Server: jetty-8.y.z-SNAPSHOT
14/08/21 07:51:08 INFO AbstractConnector: Started
SocketConnector@0.0.0.0:49574
14/08/21 07:51:08 INFO Server: jetty-8.y.z-SNAPSHOT
14/08/21 07:51:08 INFO AbstractConnector: Started
SocketConnector@0.0.0.0:55816
14/08/21 07:51:09 INFO Server: jetty-8.y.z-SNAPSHOT
14/08/21 07:51:09 INFO AbstractConnector: Started
SelectChannelConnector@0.0.0.0:4040
14/08/21 07:51:09 WARN NativeCodeLoader: Unable to load native-hadoop
library for your platform... using builtin-java classes where applicable
Generating graph with 1024 vertices and 16384 edges...done
Vertices: 512, edges: 16384

(Vertices: 512, edges: 16384 indicates finished application)

Test Application output with two workers:
==================================================================
list:
----------------------------------------------------------------------------------
Spark assembly has been built with Hive, including Datanucleus jars on
classpath
14/08/21 07:52:18 INFO Server: jetty-8.y.z-SNAPSHOT
14/08/21 07:52:18 INFO AbstractConnector: Started
SocketConnector@0.0.0.0:55800
14/08/21 07:52:18 INFO Server: jetty-8.y.z-SNAPSHOT
14/08/21 07:52:18 INFO AbstractConnector: Started
SocketConnector@0.0.0.0:35448
14/08/21 07:52:19 INFO Server: jetty-8.y.z-SNAPSHOT
14/08/21 07:52:19 INFO AbstractConnector: Started
SelectChannelConnector@0.0.0.0:4040
14/08/21 07:52:19 WARN NativeCodeLoader: Unable to load native-hadoop
library for your platform... using builtin-java classes where applicable
10001

(10001 shows that the application finished with two workers, too)

graph:
----------------------------------------------------------------------------------
Spark assembly has been built with Hive, including Datanucleus jars on
classpath
14/08/21 07:52:27 INFO Server: jetty-8.y.z-SNAPSHOT
14/08/21 07:52:27 INFO AbstractConnector: Started
SocketConnector@0.0.0.0:41677
14/08/21 07:52:27 INFO Server: jetty-8.y.z-SNAPSHOT
14/08/21 07:52:27 INFO AbstractConnector: Started
SocketConnector@0.0.0.0:39059
14/08/21 07:52:27 INFO Server: jetty-8.y.z-SNAPSHOT
14/08/21 07:52:27 INFO AbstractConnector: Started
SelectChannelConnector@0.0.0.0:4040
14/08/21 07:52:28 WARN NativeCodeLoader: Unable to load native-hadoop
library for your platform... using builtin-java classes where applicable
Generating graph with 1024 vertices and 16384 edges...done

(No number of vertices/edges shown, application was killed by manual timeout
after 60 seconds)

Master Log:
==================================================================
Spark assembly has been built with Hive, including Datanucleus jars on
classpath
Spark Command: java -cp
::/opt/spark-1.0.2/conf:/opt/spark-1.0.2/lib/spark-assembly-1.0.2-hadoop2.2.0.jar:/opt/spark-1.0.2/lib/datanucleus-core-3.2.2.jar:/opt/spark-1.0.2/lib/datanucleus-rdbms-3.2.1.jar:/opt/spark-1.0.2/lib/datanucleus-api-jdo-3.2.1.jar
-XX:MaxPermSize=128m -Dspark.akka.logLifecycleEvents=true -Xms512m -Xmx512m
org.apache.spark.deploy.master.Master --ip 192.168.23.1 --port 7077
--webui-port 8080
========================================

14/08/21 07:49:57 INFO Server: jetty-8.y.z-SNAPSHOT
14/08/21 07:49:57 INFO AbstractConnector: Started
SelectChannelConnector@0.0.0.0:8080
14/08/21 07:51:05 ERROR EndpointWriter: AssociationError
[akka.tcp://sparkMaster@192.168.23.1:7077] ->
[akka.tcp://spark@192.168.23.1:58887]: Error [Association failed with
[akka.tcp://spark@192.168.23.1:58887]] [
akka.remote.EndpointAssociationException: Association failed with
[akka.tcp://spark@192.168.23.1:58887]
Caused by:
akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2:
Connection refused: /192.168.23.1:58887
]
14/08/21 07:51:05 ERROR EndpointWriter: AssociationError
[akka.tcp://sparkMaster@192.168.23.1:7077] ->
[akka.tcp://spark@192.168.23.1:58887]: Error [Association failed with
[akka.tcp://spark@192.168.23.1:58887]] [
akka.remote.EndpointAssociationException: Association failed with
[akka.tcp://spark@192.168.23.1:58887]
Caused by:
akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2:
Connection refused: /192.168.23.1:58887
]
14/08/21 07:51:05 ERROR EndpointWriter: AssociationError
[akka.tcp://sparkMaster@192.168.23.1:7077] ->
[akka.tcp://spark@192.168.23.1:58887]: Error [Association failed with
[akka.tcp://spark@192.168.23.1:58887]] [
akka.remote.EndpointAssociationException: Association failed with
[akka.tcp://spark@192.168.23.1:58887]
Caused by:
akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2:
Connection refused: /192.168.23.1:58887
]
14/08/21 07:51:06 WARN Master: Got status update for unknown executor
app-20140821075101-0000/0
14/08/21 07:51:15 ERROR EndpointWriter: AssociationError
[akka.tcp://sparkMaster@192.168.23.1:7077] ->
[akka.tcp://spark@192.168.23.1:45365]: Error [Association failed with
[akka.tcp://spark@192.168.23.1:45365]] [
akka.remote.EndpointAssociationException: Association failed with
[akka.tcp://spark@192.168.23.1:45365]
Caused by:
akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2:
Connection refused: /192.168.23.1:45365
]
14/08/21 07:51:15 ERROR EndpointWriter: AssociationError
[akka.tcp://sparkMaster@192.168.23.1:7077] ->
[akka.tcp://spark@192.168.23.1:45365]: Error [Association failed with
[akka.tcp://spark@192.168.23.1:45365]] [
akka.remote.EndpointAssociationException: Association failed with
[akka.tcp://spark@192.168.23.1:45365]
Caused by:
akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2:
Connection refused: /192.168.23.1:45365
]
14/08/21 07:51:15 ERROR EndpointWriter: AssociationError
[akka.tcp://sparkMaster@192.168.23.1:7077] ->
[akka.tcp://spark@192.168.23.1:45365]: Error [Association failed with
[akka.tcp://spark@192.168.23.1:45365]] [
akka.remote.EndpointAssociationException: Association failed with
[akka.tcp://spark@192.168.23.1:45365]
Caused by:
akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2:
Connection refused: /192.168.23.1:45365
]
14/08/21 07:51:15 WARN Master: Got status update for unknown executor
app-20140821075110-0001/0
14/08/21 07:52:24 ERROR EndpointWriter: AssociationError
[akka.tcp://sparkMaster@192.168.23.1:7077] ->
[akka.tcp://spark@192.168.23.1:34793]: Error [Association failed with
[akka.tcp://spark@192.168.23.1:34793]] [
akka.remote.EndpointAssociationException: Association failed with
[akka.tcp://spark@192.168.23.1:34793]
Caused by:
akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2:
Connection refused: /192.168.23.1:34793
]
14/08/21 07:52:24 ERROR EndpointWriter: AssociationError
[akka.tcp://sparkMaster@192.168.23.1:7077] ->
[akka.tcp://spark@192.168.23.1:34793]: Error [Association failed with
[akka.tcp://spark@192.168.23.1:34793]] [
akka.remote.EndpointAssociationException: Association failed with
[akka.tcp://spark@192.168.23.1:34793]
Caused by:
akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2:
Connection refused: /192.168.23.1:34793
]
14/08/21 07:52:24 ERROR EndpointWriter: AssociationError
[akka.tcp://sparkMaster@192.168.23.1:7077] ->
[akka.tcp://spark@192.168.23.1:34793]: Error [Association failed with
[akka.tcp://spark@192.168.23.1:34793]] [
akka.remote.EndpointAssociationException: Association failed with
[akka.tcp://spark@192.168.23.1:34793]
Caused by:
akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2:
Connection refused: /192.168.23.1:34793
]
14/08/21 07:52:24 WARN Master: Got status update for unknown executor
app-20140821075220-0002/1
14/08/21 07:52:24 WARN Master: Got status update for unknown executor
app-20140821075220-0002/0
14/08/21 07:53:24 ERROR EndpointWriter: AssociationError
[akka.tcp://sparkMaster@192.168.23.1:7077] ->
[akka.tcp://spark@192.168.23.1:43727]: Error [Association failed with
[akka.tcp://spark@192.168.23.1:43727]] [
akka.remote.EndpointAssociationException: Association failed with
[akka.tcp://spark@192.168.23.1:43727]
Caused by:
akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2:
Connection refused: /192.168.23.1:43727
]
14/08/21 07:53:24 ERROR EndpointWriter: AssociationError
[akka.tcp://sparkMaster@192.168.23.1:7077] ->
[akka.tcp://spark@192.168.23.1:43727]: Error [Association failed with
[akka.tcp://spark@192.168.23.1:43727]] [
akka.remote.EndpointAssociationException: Association failed with
[akka.tcp://spark@192.168.23.1:43727]
Caused by:
akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2:
Connection refused: /192.168.23.1:43727
]
14/08/21 07:53:24 ERROR EndpointWriter: AssociationError
[akka.tcp://sparkMaster@192.168.23.1:7077] ->
[akka.tcp://spark@192.168.23.1:43727]: Error [Association failed with
[akka.tcp://spark@192.168.23.1:43727]] [
akka.remote.EndpointAssociationException: Association failed with
[akka.tcp://spark@192.168.23.1:43727]
Caused by:
akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2:
Connection refused: /192.168.23.1:43727
]
14/08/21 07:53:25 WARN Master: Got status update for unknown executor
app-20140821075228-0003/1
14/08/21 07:53:25 WARN Master: Got status update for unknown executor
app-20140821075228-0003/0


Worker 1 Log:
==================================================================
Spark assembly has been built with Hive, including Datanucleus jars on
classpath
14/08/21 12:01:53 INFO Server: jetty-8.y.z-SNAPSHOT
14/08/21 12:01:53 INFO AbstractConnector: Started
SelectChannelConnector@0.0.0.0:8081
Spark assembly has been built with Hive, including Datanucleus jars on
classpath
14/08/21 12:02:30 ERROR EndpointWriter: AssociationError
[akka.tcp://sparkwor...@octane001.ziti.uni-heidelberg.de:43823] ->
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:48301]: Error
[Association failed with
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:48301]] [
akka.remote.EndpointAssociationException: Association failed with
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:48301]
Caused by:
akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2:
Connection refused: octane001.ziti.uni-heidelberg.de/127.0.1.1:48301
]
14/08/21 12:02:30 ERROR EndpointWriter: AssociationError
[akka.tcp://sparkwor...@octane001.ziti.uni-heidelberg.de:43823] ->
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:48301]: Error
[Association failed with
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:48301]] [
akka.remote.EndpointAssociationException: Association failed with
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:48301]
Caused by:
akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2:
Connection refused: octane001.ziti.uni-heidelberg.de/127.0.1.1:48301
]
14/08/21 12:02:30 ERROR EndpointWriter: AssociationError
[akka.tcp://sparkwor...@octane001.ziti.uni-heidelberg.de:43823] ->
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:48301]: Error
[Association failed with
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:48301]] [
akka.remote.EndpointAssociationException: Association failed with
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:48301]
Caused by:
akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2:
Connection refused: octane001.ziti.uni-heidelberg.de/127.0.1.1:48301
]
Spark assembly has been built with Hive, including Datanucleus jars on
classpath
14/08/21 12:02:39 ERROR EndpointWriter: AssociationError
[akka.tcp://sparkwor...@octane001.ziti.uni-heidelberg.de:43823] ->
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:59128]: Error
[Association failed with
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:59128]] [
akka.remote.EndpointAssociationException: Association failed with
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:59128]
Caused by:
akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2:
Connection refused: octane001.ziti.uni-heidelberg.de/127.0.1.1:59128
]
14/08/21 12:02:39 ERROR EndpointWriter: AssociationError
[akka.tcp://sparkwor...@octane001.ziti.uni-heidelberg.de:43823] ->
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:59128]: Error
[Association failed with
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:59128]] [
akka.remote.EndpointAssociationException: Association failed with
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:59128]
Caused by:
akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2:
Connection refused: octane001.ziti.uni-heidelberg.de/127.0.1.1:59128
]
14/08/21 12:02:39 ERROR EndpointWriter: AssociationError
[akka.tcp://sparkwor...@octane001.ziti.uni-heidelberg.de:43823] ->
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:59128]: Error
[Association failed with
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:59128]] [
akka.remote.EndpointAssociationException: Association failed with
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:59128]
Caused by:
akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2:
Connection refused: octane001.ziti.uni-heidelberg.de/127.0.1.1:59128
]
Spark assembly has been built with Hive, including Datanucleus jars on
classpath
14/08/21 12:03:48 ERROR EndpointWriter: AssociationError
[akka.tcp://sparkwor...@octane001.ziti.uni-heidelberg.de:43823] ->
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:50671]: Error
[Association failed with
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:50671]] [
akka.remote.EndpointAssociationException: Association failed with
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:50671]
Caused by:
akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2:
Connection refused: octane001.ziti.uni-heidelberg.de/127.0.1.1:50671
]
14/08/21 12:03:48 ERROR EndpointWriter: AssociationError
[akka.tcp://sparkwor...@octane001.ziti.uni-heidelberg.de:43823] ->
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:50671]: Error
[Association failed with
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:50671]] [
akka.remote.EndpointAssociationException: Association failed with
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:50671]
Caused by:
akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2:
Connection refused: octane001.ziti.uni-heidelberg.de/127.0.1.1:50671
]
14/08/21 12:03:48 ERROR EndpointWriter: AssociationError
[akka.tcp://sparkwor...@octane001.ziti.uni-heidelberg.de:43823] ->
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:50671]: Error
[Association failed with
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:50671]] [
akka.remote.EndpointAssociationException: Association failed with
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:50671]
Caused by:
akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2:
Connection refused: octane001.ziti.uni-heidelberg.de/127.0.1.1:50671
]
Spark assembly has been built with Hive, including Datanucleus jars on
classpath
14/08/21 12:04:49 ERROR EndpointWriter: AssociationError
[akka.tcp://sparkwor...@octane001.ziti.uni-heidelberg.de:43823] ->
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:51517]: Error
[Association failed with
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:51517]] [
akka.remote.EndpointAssociationException: Association failed with
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:51517]
Caused by:
akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2:
Connection refused: octane001.ziti.uni-heidelberg.de/127.0.1.1:51517
]
14/08/21 12:04:49 ERROR EndpointWriter: AssociationError
[akka.tcp://sparkwor...@octane001.ziti.uni-heidelberg.de:43823] ->
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:51517]: Error
[Association failed with
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:51517]] [
akka.remote.EndpointAssociationException: Association failed with
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:51517]
Caused by:
akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2:
Connection refused: octane001.ziti.uni-heidelberg.de/127.0.1.1:51517
]
14/08/21 12:04:49 ERROR EndpointWriter: AssociationError
[akka.tcp://sparkwor...@octane001.ziti.uni-heidelberg.de:43823] ->
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:51517]: Error
[Association failed with
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:51517]] [
akka.remote.EndpointAssociationException: Association failed with
[akka.tcp://sparkexecu...@octane001.ziti.uni-heidelberg.de:51517]
Caused by:
akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2:
Connection refused: octane001.ziti.uni-heidelberg.de/127.0.1.1:51517
]


Worker 2 Log:
==================================================================
Spark assembly has been built with Hive, including Datanucleus jars on
classpath
14/08/21 12:03:38 INFO Server: jetty-8.y.z-SNAPSHOT
14/08/21 12:03:38 INFO AbstractConnector: Started
SelectChannelConnector@0.0.0.0:8081
Spark assembly has been built with Hive, including Datanucleus jars on
classpath
14/08/21 12:04:15 ERROR EndpointWriter: AssociationError
[akka.tcp://sparkwor...@octane002.ziti.uni-heidelberg.de:39978] ->
[akka.tcp://sparkexecu...@octane002.ziti.uni-heidelberg.de:44308]: Error
[Association failed with
[akka.tcp://sparkexecu...@octane002.ziti.uni-heidelberg.de:44308]] [
akka.remote.EndpointAssociationException: Association failed with
[akka.tcp://sparkexecu...@octane002.ziti.uni-heidelberg.de:44308]
Caused by:
akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2:
Connection refused: octane002.ziti.uni-heidelberg.de/127.0.1.1:44308
]
14/08/21 12:04:15 ERROR EndpointWriter: AssociationError
[akka.tcp://sparkwor...@octane002.ziti.uni-heidelberg.de:39978] ->
[akka.tcp://sparkexecu...@octane002.ziti.uni-heidelberg.de:44308]: Error
[Association failed with
[akka.tcp://sparkexecu...@octane002.ziti.uni-heidelberg.de:44308]] [
akka.remote.EndpointAssociationException: Association failed with
[akka.tcp://sparkexecu...@octane002.ziti.uni-heidelberg.de:44308]
Caused by:
akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2:
Connection refused: octane002.ziti.uni-heidelberg.de/127.0.1.1:44308
]
14/08/21 12:04:15 ERROR EndpointWriter: AssociationError
[akka.tcp://sparkwor...@octane002.ziti.uni-heidelberg.de:39978] ->
[akka.tcp://sparkexecu...@octane002.ziti.uni-heidelberg.de:44308]: Error
[Association failed with
[akka.tcp://sparkexecu...@octane002.ziti.uni-heidelberg.de:44308]] [
akka.remote.EndpointAssociationException: Association failed with
[akka.tcp://sparkexecu...@octane002.ziti.uni-heidelberg.de:44308]
Caused by:
akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2:
Connection refused: octane002.ziti.uni-heidelberg.de/127.0.1.1:44308
]
Spark assembly has been built with Hive, including Datanucleus jars on
classpath
14/08/21 12:05:16 ERROR EndpointWriter: AssociationError
[akka.tcp://sparkwor...@octane002.ziti.uni-heidelberg.de:39978] ->
[akka.tcp://sparkexecu...@octane002.ziti.uni-heidelberg.de:52528]: Error
[Association failed with
[akka.tcp://sparkexecu...@octane002.ziti.uni-heidelberg.de:52528]] [
akka.remote.EndpointAssociationException: Association failed with
[akka.tcp://sparkexecu...@octane002.ziti.uni-heidelberg.de:52528]
Caused by:
akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2:
Connection refused: octane002.ziti.uni-heidelberg.de/127.0.1.1:52528
]
14/08/21 12:05:16 ERROR EndpointWriter: AssociationError
[akka.tcp://sparkwor...@octane002.ziti.uni-heidelberg.de:39978] ->
[akka.tcp://sparkexecu...@octane002.ziti.uni-heidelberg.de:52528]: Error
[Association failed with
[akka.tcp://sparkexecu...@octane002.ziti.uni-heidelberg.de:52528]] [
akka.remote.EndpointAssociationException: Association failed with
[akka.tcp://sparkexecu...@octane002.ziti.uni-heidelberg.de:52528]
Caused by:
akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2:
Connection refused: octane002.ziti.uni-heidelberg.de/127.0.1.1:52528
]
14/08/21 12:05:16 ERROR EndpointWriter: AssociationError
[akka.tcp://sparkwor...@octane002.ziti.uni-heidelberg.de:39978] ->
[akka.tcp://sparkexecu...@octane002.ziti.uni-heidelberg.de:52528]: Error
[Association failed with
[akka.tcp://sparkexecu...@octane002.ziti.uni-heidelberg.de:52528]] [
akka.remote.EndpointAssociationException: Association failed with
[akka.tcp://sparkexecu...@octane002.ziti.uni-heidelberg.de:52528]
Caused by:
akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2:
Connection refused: octane002.ziti.uni-heidelberg.de/127.0.1.1:52528
]

Thanks in advance for your help and best regards,
Alexander



--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/More-than-one-worker-freezes-some-applications-tp12570.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org

Reply via email to