So this is pretty weird, and my debugging hasn't made much progress, so I
thought I'd ask for help.

I have a medium size dataset, but I am developing my code against a much
smaller version since it is much smaller and faster to work with. I have
deployed a cluster on EC2 using the default scripts and the AMI and loaded
my simple dataset into the ephermeral HDFS.

When I ssh into the master node and run my application using "local",
everything works. However, if I just change to using distributed spark
(spark://...:7077) then everything starts up, but the shuffle operations
start failing with the error:
3/12/02 06:47:33 INFO spark.MapOutputTrackerActor: Asked to send map output
locations for shuffle 0 to ip-10-38-11-59.ec2.internal:39992
...
13/12/02 06:49:19 WARN storage.BlockManagerMasterActor: Removing
BlockManager BlockManagerId(0, ip-10-38-11-59.ec2.internal, 46556, 0) with
no recent heart beats: 51656ms exceeds 45000ms

I looked around the prior documentation and put the extra options into
SPARK_JAVA_OPTS, and increased them to 90000 from the 30000 in the example
but I get the same error.
-Dspark.worker.timeout=90000 -Dspark.akka.timeout=90000
-Dspark.storage.blockManagerHeartBeatMs=90000
-Dspark.akka.retry.wait=90000 -Dspark.akka.frameSize=30000
-Dsun.rmi.dgc.server.gcInterval=3600000

I don't think that this is a memory issue because the dataset fits on the
master alone and I am successfully able to run my program with "local". I
checked the stdout and stderr on the worker, and one of the few times that
there was a stacktrace, it was from here:

148   private def askDriverWithReply[T](message: Any): T = {
149     // TODO: Consider removing multiple attempts
150     if (driverActor == null) {
151       throw new SparkException("Error sending message to BlockManager
as driverActor is null " +
152         "[message = " + message + "]")
153     }
154     var attempts = 0
155     var lastException: Exception = null
156     while (attempts < AKKA_RETRY_ATTEMPTS) {
157       attempts += 1
158       try {
159         val future = driverActor.ask(message)(timeout)
160         val result = Await.result(future, timeout)
161         if (result == null) {
162           throw new SparkException("BlockManagerMaster returned null")
163         }
164         return result.asInstanceOf[T]
165       } catch {
166         case ie: InterruptedException => throw ie
167         case e: Exception =>
168           lastException = e
169           logWarning("Error sending message to BlockManagerMaster in "
+ attempts + "attempts", e)
170       }
171       Thread.sleep(AKKA_RETRY_INTERVAL_MS)
172     }
173
174     throw new SparkException(
175       "Error sending message to BlockManagerMaster [message = " +
message + "]", lastException)
176   }

Most of the time, though, the worker just hangs. I can't run anything else
against that master because there are no resources available. When I try to
stop the workers using stop-slaves.sh, they don't stop. The only way to
recover the cluster is to use "kill -9 <pid>" on the worker processes,
which does work.

So I took a quick look at one of the hung worker processes using jstack.
The output is attached. As you can see, I have tried to shut down the
process multiple times, and the SIGTERM handlers are stuck waiting. I think
that this may be the reason that the workers are not responsive for block
handling as well.

If I read this correctly, the handler is waiting on 0x00000007e025d468
which is locked by spark.deploy.worker.ExecutorRunner$$anon$2. And that is
waiting for some unixprocess?

Thanks,
Shankari
2013-12-0 23:55:46
Full thread dump OpenJDK 64-Bit Server VM (24.45-b08 mixed mode):

"Attach Listener" daemon prio=10 tid=0x00007f5980009000 nid=0x405d waiting on 
condition [0x0000000000000000]
   java.lang.Thread.State: RUNNABLE

"SIGTERM handler" daemon prio=10 tid=0x00007f5980001800 nid=0x4049 waiting for 
monitor entry [0x00007f5998a62000]
   java.lang.Thread.State: BLOCKED (on object monitor)
        at java.lang.Shutdown.exit(Shutdown.java:212)
        - waiting to lock <0x00000007e02720f0> (a java.lang.Class for 
java.lang.Shutdown)
        at java.lang.Terminator$1.handle(Terminator.java:52)
        at sun.misc.Signal$1.run(Signal.java:212)
        at java.lang.Thread.run(Thread.java:744)

"SIGTERM handler" daemon prio=10 tid=0x00007f5980003000 nid=0x402d waiting for 
monitor entry [0x00007f599865e000]
   java.lang.Thread.State: BLOCKED (on object monitor)
        at java.lang.Shutdown.exit(Shutdown.java:212)
        - waiting to lock <0x00000007e02720f0> (a java.lang.Class for 
java.lang.Shutdown)
        at java.lang.Terminator$1.handle(Terminator.java:52)
        at sun.misc.Signal$1.run(Signal.java:212)
        at java.lang.Thread.run(Thread.java:744)

"sparkWorker-akka.actor.default-dispatcher-10" daemon prio=10 
tid=0x00007f596802e000 nid=0x3d0c waiting on condition [0x00007f599a24d000]
   java.lang.Thread.State: WAITING (parking)
        at sun.misc.Unsafe.park(Native Method)
        - parking to wait for  <0x00000007e0286028> (a 
akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinPool)
        at akka.jsr166y.ForkJoinPool.scan(ForkJoinPool.java:1594)
        at akka.jsr166y.ForkJoinPool.runWorker(ForkJoinPool.java:1479)
        at akka.jsr166y.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:104)

"SIGTERM handler" daemon prio=10 tid=0x00007f5980007000 nid=0x3cea waiting for 
monitor entry [0x00007f5998860000]
   java.lang.Thread.State: BLOCKED (on object monitor)
        at java.lang.Shutdown.exit(Shutdown.java:212)
        - waiting to lock <0x00000007e02720f0> (a java.lang.Class for 
java.lang.Shutdown)
        at java.lang.Terminator$1.handle(Terminator.java:52)
        at sun.misc.Signal$1.run(Signal.java:212)
        at java.lang.Thread.run(Thread.java:744)

"SIGTERM handler" daemon prio=10 tid=0x00007f5980005000 nid=0x3cce waiting for 
monitor entry [0x00007f5998961000]
   java.lang.Thread.State: BLOCKED (on object monitor)
        at java.lang.Shutdown.exit(Shutdown.java:212)
        - waiting to lock <0x00000007e02720f0> (a java.lang.Class for 
java.lang.Shutdown)
        at java.lang.Terminator$1.handle(Terminator.java:52)
        at sun.misc.Signal$1.run(Signal.java:212)
        at java.lang.Thread.run(Thread.java:744)

"Thread-9" daemon prio=10 tid=0x00007f596c063000 nid=0x3cb0 in Object.wait() 
[0x00007f599b35e000]
   java.lang.Thread.State: WAITING (on object monitor)
        at java.lang.Object.wait(Native Method)
        - waiting on <0x00000007e025e150> (a java.lang.UNIXProcess)
        at java.lang.Object.wait(Object.java:503)
        at java.lang.UNIXProcess.waitFor(UNIXProcess.java:210)
        - locked <0x00000007e025e150> (a java.lang.UNIXProcess)
        at 
org.apache.spark.deploy.worker.ExecutorRunner$$anon$2.run(ExecutorRunner.scala:69)

"SIGTERM handler" daemon prio=10 tid=0x00007f5980002800 nid=0x3caf in 
Object.wait() [0x00007f599a550000]
   java.lang.Thread.State: WAITING (on object monitor)
        at java.lang.Object.wait(Native Method)
        - waiting on <0x00000007e025d468> (a 
org.apache.spark.deploy.worker.ExecutorRunner$$anon$2)
        at java.lang.Thread.join(Thread.java:1280)
        - locked <0x00000007e025d468> (a 
org.apache.spark.deploy.worker.ExecutorRunner$$anon$2)
        at java.lang.Thread.join(Thread.java:1354)
        at 
java.lang.ApplicationShutdownHooks.runHooks(ApplicationShutdownHooks.java:106)
        at 
java.lang.ApplicationShutdownHooks$1.run(ApplicationShutdownHooks.java:46)
        at java.lang.Shutdown.runHooks(Shutdown.java:123)
        at java.lang.Shutdown.sequence(Shutdown.java:167)
        at java.lang.Shutdown.exit(Shutdown.java:212)
        - locked <0x00000007e02720f0> (a java.lang.Class for java.lang.Shutdown)
        at java.lang.Terminator$1.handle(Terminator.java:52)
        at sun.misc.Signal$1.run(Signal.java:212)
        at java.lang.Thread.run(Thread.java:744)

"sparkWorker-akka.actor.default-dispatcher-9" daemon prio=10 
tid=0x00007f5964002000 nid=0x3739 waiting on condition [0x00007f599a14c000]
   java.lang.Thread.State: WAITING (parking)
        at sun.misc.Unsafe.park(Native Method)
        - parking to wait for  <0x00000007e0286028> (a 
akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinPool)
        at akka.jsr166y.ForkJoinPool.scan(ForkJoinPool.java:1594)
        at akka.jsr166y.ForkJoinPool.runWorker(ForkJoinPool.java:1479)
        at akka.jsr166y.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:104)

"sparkWorker-akka.actor.default-dispatcher-6" daemon prio=10 
tid=0x00007f596803a800 nid=0x30e5 in Object.wait() [0x00007f599a650000]
   java.lang.Thread.State: WAITING (on object monitor)
        at java.lang.Object.wait(Native Method)
        - waiting on <0x00000007e025e150> (a java.lang.UNIXProcess)
        at java.lang.Object.wait(Object.java:503)
        at java.lang.UNIXProcess.waitFor(UNIXProcess.java:210)
        - locked <0x00000007e025e150> (a java.lang.UNIXProcess)
        at 
org.apache.spark.deploy.worker.ExecutorRunner.kill(ExecutorRunner.scala:84)
        at 
org.apache.spark.deploy.worker.Worker$$anonfun$receive$1.apply(Worker.scala:161)
        at 
org.apache.spark.deploy.worker.Worker$$anonfun$receive$1.apply(Worker.scala:120)
        at akka.actor.Actor$class.apply(Actor.scala:318)
        at org.apache.spark.deploy.worker.Worker.apply(Worker.scala:39)
        at akka.actor.ActorCell.invoke(ActorCell.scala:626)
        at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:197)
        at akka.dispatch.Mailbox.run(Mailbox.scala:179)
        at 
akka.dispatch.ForkJoinExecutorConfigurator$MailboxExecutionTask.exec(AbstractDispatcher.scala:516)
        at akka.jsr166y.ForkJoinTask.doExec(ForkJoinTask.java:259)
        at akka.jsr166y.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:975)
        at akka.jsr166y.ForkJoinPool.runWorker(ForkJoinPool.java:1479)
        at akka.jsr166y.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:104)

"redirect output to /root/spark/work/app-20131202064720-0000/0/stderr" daemon 
prio=10 tid=0x00007f596c062800 nid=0x30a1 runnable [0x00007f5998b63000]
   java.lang.Thread.State: RUNNABLE
        at java.io.FileInputStream.readBytes(Native Method)
        at java.io.FileInputStream.read(FileInputStream.java:272)
        at java.io.BufferedInputStream.read1(BufferedInputStream.java:273)
        at java.io.BufferedInputStream.read(BufferedInputStream.java:334)
        - locked <0x00000007e0272058> (a 
java.lang.UNIXProcess$ProcessPipeInputStream)
        at java.io.FilterInputStream.read(FilterInputStream.java:107)
        at org.apache.spark.util.Utils$.copyStream(Utils.scala:211)
        at 
org.apache.spark.deploy.worker.ExecutorRunner$$anon$3.run(ExecutorRunner.scala:135)

"redirect output to /root/spark/work/app-20131202064720-0000/0/stdout" daemon 
prio=10 tid=0x00007f596c057000 nid=0x309f runnable [0x00007f599b25d000]
   java.lang.Thread.State: RUNNABLE
        at java.io.FileInputStream.readBytes(Native Method)
        at java.io.FileInputStream.read(FileInputStream.java:272)
        at java.io.BufferedInputStream.read1(BufferedInputStream.java:273)
        at java.io.BufferedInputStream.read(BufferedInputStream.java:334)
        - locked <0x00000007e0272030> (a 
java.lang.UNIXProcess$ProcessPipeInputStream)
        at java.io.FilterInputStream.read(FilterInputStream.java:107)
        at org.apache.spark.util.Utils$.copyStream(Utils.scala:211)
        at 
org.apache.spark.deploy.worker.ExecutorRunner$$anon$3.run(ExecutorRunner.scala:135)

"process reaper" daemon prio=10 tid=0x00007f596c04a000 nid=0x3093 runnable 
[0x00007f59bc040000]
   java.lang.Thread.State: RUNNABLE
        at java.lang.UNIXProcess.waitForProcessExit(Native Method)
        at java.lang.UNIXProcess.access$200(UNIXProcess.java:54)
        at java.lang.UNIXProcess$3.run(UNIXProcess.java:174)
        at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
        at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
        at java.lang.Thread.run(Thread.java:744)

"Hashed wheel timer #1" daemon prio=10 tid=0x00007f5960118000 nid=0x308a 
waiting on condition [0x00007f599a752000]
   java.lang.Thread.State: TIMED_WAITING (sleeping)
        at java.lang.Thread.sleep(Native Method)
        at 
org.jboss.netty.util.HashedWheelTimer$Worker.waitForNextTick(HashedWheelTimer.java:455)
        at 
org.jboss.netty.util.HashedWheelTimer$Worker.run(HashedWheelTimer.java:372)
        at 
org.jboss.netty.util.ThreadRenamingRunnable.run(ThreadRenamingRunnable.java:102)
        at java.lang.Thread.run(Thread.java:744)

"qtp436478131-22" daemon prio=10 tid=0x00007f596010d000 nid=0x3089 waiting on 
condition [0x00007f599a853000]
   java.lang.Thread.State: TIMED_WAITING (parking)
        at sun.misc.Unsafe.park(Native Method)
        - parking to wait for  <0x00000007e02e26b8> (a 
java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject)
        at 
java.util.concurrent.locks.LockSupport.parkNanos(LockSupport.java:226)
        at 
java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.awaitNanos(AbstractQueuedSynchronizer.java:2082)
        at 
org.eclipse.jetty.util.BlockingArrayQueue.poll(BlockingArrayQueue.java:342)
        at 
org.eclipse.jetty.util.thread.QueuedThreadPool.idleJobPoll(QueuedThreadPool.java:526)
        at 
org.eclipse.jetty.util.thread.QueuedThreadPool.access$600(QueuedThreadPool.java:44)
        at 
org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool.java:572)
        at java.lang.Thread.run(Thread.java:744)

"qtp436478131-21" daemon prio=10 tid=0x00007f596010a800 nid=0x3088 waiting on 
condition [0x00007f599a954000]
   java.lang.Thread.State: TIMED_WAITING (parking)
        at sun.misc.Unsafe.park(Native Method)
        - parking to wait for  <0x00000007e02e26b8> (a 
java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject)
        at 
java.util.concurrent.locks.LockSupport.parkNanos(LockSupport.java:226)
        at 
java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.awaitNanos(AbstractQueuedSynchronizer.java:2082)
        at 
org.eclipse.jetty.util.BlockingArrayQueue.poll(BlockingArrayQueue.java:342)
        at 
org.eclipse.jetty.util.thread.QueuedThreadPool.idleJobPoll(QueuedThreadPool.java:526)
        at 
org.eclipse.jetty.util.thread.QueuedThreadPool.access$600(QueuedThreadPool.java:44)
        at 
org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool.java:572)
        at java.lang.Thread.run(Thread.java:744)

"qtp436478131-20" daemon prio=10 tid=0x00007f5960108800 nid=0x3087 waiting on 
condition [0x00007f599aa55000]
   java.lang.Thread.State: TIMED_WAITING (parking)
        at sun.misc.Unsafe.park(Native Method)
        - parking to wait for  <0x00000007e02e26b8> (a 
java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject)
        at 
java.util.concurrent.locks.LockSupport.parkNanos(LockSupport.java:226)
        at 
java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.awaitNanos(AbstractQueuedSynchronizer.java:2082)
        at 
org.eclipse.jetty.util.BlockingArrayQueue.poll(BlockingArrayQueue.java:342)
        at 
org.eclipse.jetty.util.thread.QueuedThreadPool.idleJobPoll(QueuedThreadPool.java:526)
        at 
org.eclipse.jetty.util.thread.QueuedThreadPool.access$600(QueuedThreadPool.java:44)
        at 
org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool.java:572)
        at java.lang.Thread.run(Thread.java:744)

"qtp436478131-19" daemon prio=10 tid=0x00007f5960107000 nid=0x3086 waiting on 
condition [0x00007f599ab56000]
   java.lang.Thread.State: TIMED_WAITING (parking)
        at sun.misc.Unsafe.park(Native Method)
        - parking to wait for  <0x00000007e02e26b8> (a 
java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject)
        at 
java.util.concurrent.locks.LockSupport.parkNanos(LockSupport.java:226)
        at 
java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.awaitNanos(AbstractQueuedSynchronizer.java:2082)
        at 
org.eclipse.jetty.util.BlockingArrayQueue.poll(BlockingArrayQueue.java:342)
        at 
org.eclipse.jetty.util.thread.QueuedThreadPool.idleJobPoll(QueuedThreadPool.java:526)
        at 
org.eclipse.jetty.util.thread.QueuedThreadPool.access$600(QueuedThreadPool.java:44)
        at 
org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool.java:572)
        at java.lang.Thread.run(Thread.java:744)

"qtp436478131-18" daemon prio=10 tid=0x00007f5960106000 nid=0x3085 waiting on 
condition [0x00007f599ac57000]
   java.lang.Thread.State: TIMED_WAITING (parking)
        at sun.misc.Unsafe.park(Native Method)
        - parking to wait for  <0x00000007e02e26b8> (a 
java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject)
        at 
java.util.concurrent.locks.LockSupport.parkNanos(LockSupport.java:226)
        at 
java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.awaitNanos(AbstractQueuedSynchronizer.java:2082)
        at 
org.eclipse.jetty.util.BlockingArrayQueue.poll(BlockingArrayQueue.java:342)
        at 
org.eclipse.jetty.util.thread.QueuedThreadPool.idleJobPoll(QueuedThreadPool.java:526)
        at 
org.eclipse.jetty.util.thread.QueuedThreadPool.access$600(QueuedThreadPool.java:44)
        at 
org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool.java:572)
        at java.lang.Thread.run(Thread.java:744)

"qtp436478131-17" daemon prio=10 tid=0x00007f5960101800 nid=0x3084 waiting on 
condition [0x00007f599ad58000]
   java.lang.Thread.State: TIMED_WAITING (parking)
        at sun.misc.Unsafe.park(Native Method)
        - parking to wait for  <0x00000007e02e26b8> (a 
java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject)
        at 
java.util.concurrent.locks.LockSupport.parkNanos(LockSupport.java:226)
        at 
java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.awaitNanos(AbstractQueuedSynchronizer.java:2082)
        at 
org.eclipse.jetty.util.BlockingArrayQueue.poll(BlockingArrayQueue.java:342)
        at 
org.eclipse.jetty.util.thread.QueuedThreadPool.idleJobPoll(QueuedThreadPool.java:526)
        at 
org.eclipse.jetty.util.thread.QueuedThreadPool.access$600(QueuedThreadPool.java:44)
        at 
org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool.java:572)
        at java.lang.Thread.run(Thread.java:744)

"qtp436478131-16 Acceptor0 [email protected]:8081" daemon prio=10 
tid=0x00007f59600ff800 nid=0x3083 runnable [0x00007f599ae59000]
   java.lang.Thread.State: RUNNABLE
        at sun.nio.ch.ServerSocketChannelImpl.accept0(Native Method)
        at 
sun.nio.ch.ServerSocketChannelImpl.accept(ServerSocketChannelImpl.java:241)
        - locked <0x00000007e02e1208> (a java.lang.Object)
        at 
org.eclipse.jetty.server.nio.SelectChannelConnector.accept(SelectChannelConnector.java:109)
        at 
org.eclipse.jetty.server.AbstractConnector$Acceptor.run(AbstractConnector.java:938)
        at 
org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:608)
        at 
org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool.java:543)
        at java.lang.Thread.run(Thread.java:744)

"qtp436478131-15 Selector0" daemon prio=10 tid=0x00007f59600d4800 nid=0x3082 
runnable [0x00007f599af5a000]
   java.lang.Thread.State: RUNNABLE
        at sun.nio.ch.EPollArrayWrapper.epollWait(Native Method)
        at sun.nio.ch.EPollArrayWrapper.poll(EPollArrayWrapper.java:269)
        at sun.nio.ch.EPollSelectorImpl.doSelect(EPollSelectorImpl.java:79)
        at sun.nio.ch.SelectorImpl.lockAndDoSelect(SelectorImpl.java:87)
        - locked <0x00000007e02e0d98> (a sun.nio.ch.Util$2)
        - locked <0x00000007e02e0d88> (a java.util.Collections$UnmodifiableSet)
        - locked <0x00000007e02e0b60> (a sun.nio.ch.EPollSelectorImpl)
        at sun.nio.ch.SelectorImpl.select(SelectorImpl.java:98)
        at 
org.eclipse.jetty.io.nio.SelectorManager$SelectSet.doSelect(SelectorManager.java:569)
        at 
org.eclipse.jetty.io.nio.SelectorManager$1.run(SelectorManager.java:290)
        at 
org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:608)
        at 
org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool.java:543)
        at java.lang.Thread.run(Thread.java:744)

"sparkWorker-2" daemon prio=10 tid=0x00007f59b83f5800 nid=0x3081 runnable 
[0x00007f599b05b000]
   java.lang.Thread.State: RUNNABLE
        at sun.nio.ch.EPollArrayWrapper.epollWait(Native Method)
        at sun.nio.ch.EPollArrayWrapper.poll(EPollArrayWrapper.java:269)
        at sun.nio.ch.EPollSelectorImpl.doSelect(EPollSelectorImpl.java:79)
        at sun.nio.ch.SelectorImpl.lockAndDoSelect(SelectorImpl.java:87)
        - locked <0x00000007e0198138> (a sun.nio.ch.Util$2)
        - locked <0x00000007e0198128> (a java.util.Collections$UnmodifiableSet)
        - locked <0x00000007e0197d30> (a sun.nio.ch.EPollSelectorImpl)
        at sun.nio.ch.SelectorImpl.select(SelectorImpl.java:98)
        at 
org.jboss.netty.channel.socket.nio.NioServerSocketPipelineSink$Boss.run(NioServerSocketPipelineSink.java:219)
        at 
org.jboss.netty.util.internal.DeadLockProofWorker$1.run(DeadLockProofWorker.java:42)
        at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
        at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
        at java.lang.Thread.run(Thread.java:744)

"sparkWorker-scheduler-2" daemon prio=10 tid=0x00007f59b8311000 nid=0x3080 
waiting on condition [0x00007f599b15c000]
   java.lang.Thread.State: TIMED_WAITING (sleeping)
        at java.lang.Thread.sleep(Native Method)
        at 
org.jboss.netty.akka.util.HashedWheelTimer$Worker.waitForNextTick(HashedWheelTimer.java:418)
        at 
org.jboss.netty.akka.util.HashedWheelTimer$Worker.run(HashedWheelTimer.java:318)
        at java.lang.Thread.run(Thread.java:744)

"Service Thread" daemon prio=10 tid=0x00007f59b80a1000 nid=0x307b runnable 
[0x0000000000000000]
   java.lang.Thread.State: RUNNABLE

"C2 CompilerThread1" daemon prio=10 tid=0x00007f59b809e800 nid=0x307a waiting 
on condition [0x0000000000000000]
   java.lang.Thread.State: RUNNABLE

"C2 CompilerThread0" daemon prio=10 tid=0x00007f59b809b800 nid=0x3079 waiting 
on condition [0x0000000000000000]
   java.lang.Thread.State: RUNNABLE

"Signal Dispatcher" daemon prio=10 tid=0x00007f59b8099800 nid=0x3078 runnable 
[0x0000000000000000]
   java.lang.Thread.State: RUNNABLE

"Finalizer" daemon prio=10 tid=0x00007f59b8077000 nid=0x3077 in Object.wait() 
[0x00007f599b979000]
   java.lang.Thread.State: WAITING (on object monitor)
        at java.lang.Object.wait(Native Method)
        - waiting on <0x00000007e0008db0> (a java.lang.ref.ReferenceQueue$Lock)
        at java.lang.ref.ReferenceQueue.remove(ReferenceQueue.java:135)
        - locked <0x00000007e0008db0> (a java.lang.ref.ReferenceQueue$Lock)
        at java.lang.ref.ReferenceQueue.remove(ReferenceQueue.java:151)
        at java.lang.ref.Finalizer$FinalizerThread.run(Finalizer.java:189)

"Reference Handler" daemon prio=10 tid=0x00007f59b8075000 nid=0x3076 in 
Object.wait() [0x00007f599ba7a000]
   java.lang.Thread.State: WAITING (on object monitor)
        at java.lang.Object.wait(Native Method)
        - waiting on <0x00000007e0008770> (a java.lang.ref.Reference$Lock)
        at java.lang.Object.wait(Object.java:503)
        at java.lang.ref.Reference$ReferenceHandler.run(Reference.java:133)
        - locked <0x00000007e0008770> (a java.lang.ref.Reference$Lock)

"main" prio=10 tid=0x00007f59b8009800 nid=0x3072 waiting on condition 
[0x00007f59c0db4000]
   java.lang.Thread.State: WAITING (parking)
        at sun.misc.Unsafe.park(Native Method)
        - parking to wait for  <0x00000007e0309760> (a 
java.util.concurrent.CountDownLatch$Sync)
        at java.util.concurrent.locks.LockSupport.park(LockSupport.java:186)
        at 
java.util.concurrent.locks.AbstractQueuedSynchronizer.parkAndCheckInterrupt(AbstractQueuedSynchronizer.java:834)
        at 
java.util.concurrent.locks.AbstractQueuedSynchronizer.doAcquireSharedInterruptibly(AbstractQueuedSynchronizer.java:994)
        at 
java.util.concurrent.locks.AbstractQueuedSynchronizer.acquireSharedInterruptibly(AbstractQueuedSynchronizer.java:1303)
        at java.util.concurrent.CountDownLatch.await(CountDownLatch.java:236)
        at 
akka.actor.ActorSystemImpl$TerminationCallbacks.ready(ActorSystem.scala:736)
        at 
akka.actor.ActorSystemImpl$TerminationCallbacks.ready(ActorSystem.scala:705)
        at akka.dispatch.Await$.ready(Future.scala:64)
        at akka.actor.ActorSystemImpl.awaitTermination(ActorSystem.scala:604)
        at akka.actor.ActorSystemImpl.awaitTermination(ActorSystem.scala:605)
        at org.apache.spark.deploy.worker.Worker$.main(Worker.scala:200)
        at org.apache.spark.deploy.worker.Worker.main(Worker.scala)

"VM Thread" prio=10 tid=0x00007f59b8070800 nid=0x3075 runnable 

"GC task thread#0 (ParallelGC)" prio=10 tid=0x00007f59b801f800 nid=0x3073 
runnable 

"GC task thread#1 (ParallelGC)" prio=10 tid=0x00007f59b8021000 nid=0x3074 
runnable 

"VM Periodic Task Thread" prio=10 tid=0x00007f59b80ab800 nid=0x307c waiting on 
condition 

JNI global references: 252

Reply via email to