Have you tried to increase the dirver memory?
On Thu, Jul 31, 2014 at 3:54 PM, Bin <wubin_phi...@126.com> wrote: > Hi All, > > The data size of my task is about 30mb. It runs smoothly in local mode. > However, when I submit it to the cluster, it throws the titled error > (Please see below for the complete output). > > Actually, my output is almost the same with > http://stackoverflow.com/questions/24080891/spark-program-hangs-at-job-finished-toarray-workers-throw-java-util-concurren. > I > also toArray my data, which was the reason of his case. > > However, how come it runs OK in local but not in the cluster? The memory > of each worker is over 60g, and my run command is: > > "$SPARK_HOME/bin/spark-class org.apache.spark.deploy.Client launch spark:// > 10.196.135.101:7077 $jar_path $programname -Dspark.master=spark:// > 10.196.135.101:7077 -Dspark.cores.max=300 -Dspark.executor.memory=20g > -spark.jars=$jar_path -Dspark.default.parallelism=100 > -Dspark.hadoop.hadoop.job.ugi=$username,$groupname -Dspark.app.name=$appname > $in_path $scala_out_path" > > Looking for help and thanks a lot! > > Below please find the complete output: > > 14/07/31 15:06:53 WARN Configuration: DEPRECATED: hadoop-site.xml found in > the classpath. Usage of hadoop-site.xml is deprecated. Instead use > core-site.xml, mapred-site.xml and hdfs-site.xml to override properties of > core-default.xml, mapred-default.xml and hdfs-default.xml respectively > 14/07/31 15:06:53 INFO SecurityManager: Changing view acls to: spark > 14/07/31 15:06:53 INFO SecurityManager: SecurityManager: authentication > disabled; ui acls disabled; users with view permissions: Set(spark) > 14/07/31 15:06:53 INFO Slf4jLogger: Slf4jLogger started > 14/07/31 15:06:53 INFO Remoting: Starting remoting > 14/07/31 15:06:54 INFO Remoting: Remoting started; listening on addresses > :[akka.tcp://sparkExecutor@tdw-10-215-140-22:39446] > 14/07/31 15:06:54 INFO Remoting: Remoting now listens on addresses: > [akka.tcp://sparkExecutor@tdw-10-215-140-22:39446] > 14/07/31 15:06:54 INFO CoarseGrainedExecutorBackend: Connecting to driver: > akka.tcp://spark@tdw-10-196-135-106:38502/user/CoarseGrainedScheduler > 14/07/31 15:06:54 INFO WorkerWatcher: Connecting to worker > akka.tcp://sparkWorker@tdw-10-215-140-22:34755/user/Worker > 14/07/31 15:06:54 INFO WorkerWatcher: Successfully connected to > akka.tcp://sparkWorker@tdw-10-215-140-22:34755/user/Worker > 14/07/31 15:06:56 INFO CoarseGrainedExecutorBackend: Successfully registered > with driver > 14/07/31 15:06:56 INFO SecurityManager: Changing view acls to: spark > 14/07/31 15:06:56 INFO SecurityManager: SecurityManager: authentication > disabled; ui acls disabled; users with view permissions: Set(spark) > 14/07/31 15:06:56 INFO Slf4jLogger: Slf4jLogger started > 14/07/31 15:06:56 INFO Remoting: Starting remoting > 14/07/31 15:06:56 INFO Remoting: Remoting started; listening on addresses > :[akka.tcp://spark@tdw-10-215-140-22:56708] > 14/07/31 15:06:56 INFO Remoting: Remoting now listens on addresses: > [akka.tcp://spark@tdw-10-215-140-22:56708] > 14/07/31 15:06:56 INFO SparkEnv: Connecting to MapOutputTracker: > akka.tcp://spark@tdw-10-196-135-106:38502/user/MapOutputTracker > 14/07/31 15:06:58 INFO SparkEnv: Connecting to BlockManagerMaster: > akka.tcp://spark@tdw-10-196-135-106:38502/user/BlockManagerMaster > 14/07/31 15:06:59 INFO DiskBlockManager: Created local directory at > /data1/sparkenv/local/spark-local-20140731150659-3f12 > 14/07/31 15:06:59 INFO DiskBlockManager: Created local directory at > /data2/sparkenv/local/spark-local-20140731150659-1602 > 14/07/31 15:06:59 INFO DiskBlockManager: Created local directory at > /data3/sparkenv/local/spark-local-20140731150659-d213 > 14/07/31 15:06:59 INFO DiskBlockManager: Created local directory at > /data4/sparkenv/local/spark-local-20140731150659-f42e > 14/07/31 15:06:59 INFO DiskBlockManager: Created local directory at > /data5/sparkenv/local/spark-local-20140731150659-63d0 > 14/07/31 15:06:59 INFO DiskBlockManager: Created local directory at > /data6/sparkenv/local/spark-local-20140731150659-9003 > 14/07/31 15:06:59 INFO DiskBlockManager: Created local directory at > /data7/sparkenv/local/spark-local-20140731150659-f260 > 14/07/31 15:06:59 INFO DiskBlockManager: Created local directory at > /data8/sparkenv/local/spark-local-20140731150659-6334 > 14/07/31 15:06:59 INFO DiskBlockManager: Created local directory at > /data9/sparkenv/local/spark-local-20140731150659-3af4 > 14/07/31 15:06:59 INFO DiskBlockManager: Created local directory at > /data10/sparkenv/local/spark-local-20140731150659-133d > 14/07/31 15:06:59 INFO DiskBlockManager: Created local directory at > /data11/sparkenv/local/spark-local-20140731150659-ed08 > 14/07/31 15:06:59 INFO MemoryStore: MemoryStore started with capacity 11.5 GB. > 14/07/31 15:06:59 INFO ConnectionManager: Bound socket to port 35127 with id > = ConnectionManagerId(tdw-10-215-140-22,35127) > 14/07/31 15:06:59 INFO BlockManagerMaster: Trying to register BlockManager > 14/07/31 15:07:00 INFO BlockManagerMaster: Registered BlockManager > 14/07/31 15:07:00 INFO HttpFileServer: HTTP File server directory is > /tmp/spark-0914d215-dd22-4d5e-9ec0-724937dbfd8b > 14/07/31 15:07:00 INFO HttpServer: Starting HTTP Server > 14/07/31 15:07:26 INFO CoarseGrainedExecutorBackend: Got assigned task 12 > 14/07/31 15:07:26 INFO CoarseGrainedExecutorBackend: Got assigned task 25 > 14/07/31 15:07:26 INFO Executor: Running task ID 25 > 14/07/31 15:07:26 INFO Executor: Running task ID 12 > 14/07/31 15:07:26 INFO Executor: Fetching > hdfs://tdw-10-196-135-101:54310/user/teg/gdt/tj/test/adsorption_2.10-1.0.jar > with timestamp 1406790410442 > 14/07/31 15:07:26 INFO Executor: Adding > file:/data/home/spark/spark-1.0.0-bin-0.20.2-cdh3u3/work/app-20140731150652-4911/8/./adsorption_2.10-1.0.jar > to class loader > 14/07/31 15:07:26 INFO HttpBroadcast: Started reading broadcast variable 0 > 14/07/31 15:07:28 INFO MemoryStore: ensureFreeSpace(102650) called with > curMem=0, maxMem=12348240691 > 14/07/31 15:07:28 INFO MemoryStore: Block broadcast_0 stored as values to > memory (estimated size 100.2 KB, free 11.5 GB) > 14/07/31 15:07:28 INFO HttpBroadcast: Reading broadcast variable 0 took > 1.702291406 s > 14/07/31 15:07:28 INFO BlockManager: Found block broadcast_0 locally > 14/07/31 15:07:28 INFO HadoopRDD: Input split: > hdfs://tdw-10-196-135-101:54310/user/teg/gdt/tj/test/pre/scala_out/part-00072:0+197955 > 14/07/31 15:07:28 INFO HadoopRDD: Input split: > hdfs://tdw-10-196-135-101:54310/user/teg/gdt/tj/test/pre/scala_out/part-00062:0+218630 > 14/07/31 15:07:28 WARN NativeCodeLoader: Unable to load native-hadoop library > for your platform... using builtin-java classes where applicable > 14/07/31 15:07:28 WARN LoadSnappy: Snappy native library not loaded > 14/07/31 15:07:29 INFO Executor: Serialized size of result for 12 is 887 > 14/07/31 15:07:29 INFO Executor: Serialized size of result for 25 is 887 > 14/07/31 15:07:29 INFO Executor: Sending result for 25 directly to driver > 14/07/31 15:07:29 INFO Executor: Sending result for 12 directly to driver > 14/07/31 15:07:29 INFO Executor: Finished task ID 12 > 14/07/31 15:07:29 INFO Executor: Finished task ID 25 > 14/07/31 15:07:30 INFO CoarseGrainedExecutorBackend: Got assigned task 30 > 14/07/31 15:07:30 INFO Executor: Running task ID 30 > 14/07/31 15:07:30 INFO BlockManager: Found block broadcast_0 locally > 14/07/31 15:07:30 INFO HadoopRDD: Input split: > hdfs://tdw-10-196-135-101:54310/user/teg/gdt/tj/test/pre/scala_out/part-00084:0+196433 > 14/07/31 15:07:30 INFO Executor: Serialized size of result for 30 is 887 > 14/07/31 15:07:30 INFO Executor: Sending result for 30 directly to driver > 14/07/31 15:07:30 INFO Executor: Finished task ID 30 > 14/07/31 15:07:30 INFO CoarseGrainedExecutorBackend: Got assigned task 31 > 14/07/31 15:07:30 INFO Executor: Running task ID 31 > 14/07/31 15:07:30 INFO BlockManager: Found block broadcast_0 locally > 14/07/31 15:07:30 INFO HadoopRDD: Input split: > hdfs://tdw-10-196-135-101:54310/user/teg/gdt/tj/test/pre/scala_out/part-00089:0+190194 > 14/07/31 15:07:30 INFO Executor: Serialized size of result for 31 is 887 > 14/07/31 15:07:30 INFO Executor: Sending result for 31 directly to driver > 14/07/31 15:07:30 INFO Executor: Finished task ID 31 > 14/07/31 15:07:31 INFO CoarseGrainedExecutorBackend: Got assigned task 54 > 14/07/31 15:07:31 INFO Executor: Running task ID 54 > 14/07/31 15:07:31 INFO BlockManager: Found block broadcast_0 locally > 14/07/31 15:07:31 INFO HadoopRDD: Input split: > hdfs://tdw-10-196-135-101:54310/user/teg/gdt/tj/test/pre/scala_out/part-00096:0+153443 > 14/07/31 15:07:31 INFO CoarseGrainedExecutorBackend: Got assigned task 55 > 14/07/31 15:07:31 INFO Executor: Running task ID 55 > 14/07/31 15:07:31 INFO BlockManager: Found block broadcast_0 locally > 14/07/31 15:07:31 INFO HadoopRDD: Input split: > hdfs://tdw-10-196-135-101:54310/user/teg/gdt/tj/test/pre/scala_out/part-00139:0+174726 > 14/07/31 15:07:31 INFO Executor: Serialized size of result for 54 is 887 > 14/07/31 15:07:31 INFO Executor: Sending result for 54 directly to driver > 14/07/31 15:07:31 INFO Executor: Finished task ID 54 > 14/07/31 15:07:31 INFO Executor: Serialized size of result for 55 is 887 > 14/07/31 15:07:31 INFO Executor: Sending result for 55 directly to driver > 14/07/31 15:07:31 INFO Executor: Finished task ID 55 > 14/07/31 15:07:32 INFO CoarseGrainedExecutorBackend: Got assigned task 76 > 14/07/31 15:07:32 INFO Executor: Running task ID 76 > 14/07/31 15:07:32 INFO BlockManager: Found block broadcast_0 locally > 14/07/31 15:07:32 INFO CoarseGrainedExecutorBackend: Got assigned task 79 > 14/07/31 15:07:32 INFO Executor: Running task ID 79 > 14/07/31 15:07:32 INFO BlockManager: Found block broadcast_0 locally > 14/07/31 15:07:32 INFO HadoopRDD: Input split: > hdfs://tdw-10-196-135-101:54310/user/teg/gdt/tj/test/pre/scala_out/part-00149:0+134758 > 14/07/31 15:07:32 INFO HadoopRDD: Input split: > hdfs://tdw-10-196-135-101:54310/user/teg/gdt/tj/test/pre/scala_out/part-00157:0+103176 > 14/07/31 15:07:32 INFO Executor: Serialized size of result for 79 is 887 > 14/07/31 15:07:32 INFO Executor: Sending result for 79 directly to driver > 14/07/31 15:07:32 INFO Executor: Finished task ID 79 > 14/07/31 15:07:32 INFO Executor: Serialized size of result for 76 is 887 > 14/07/31 15:07:32 INFO Executor: Sending result for 76 directly to driver > 14/07/31 15:07:32 INFO Executor: Finished task ID 76 > 14/07/31 15:07:34 INFO CoarseGrainedExecutorBackend: Got assigned task 99 > 14/07/31 15:07:34 INFO Executor: Running task ID 99 > 14/07/31 15:07:34 INFO BlockManager: Found block broadcast_0 locally > 14/07/31 15:07:34 INFO HadoopRDD: Input split: > hdfs://tdw-10-196-135-101:54310/user/teg/gdt/tj/test/pre/scala_out/part-00167:0+51005 > 14/07/31 15:07:34 INFO Executor: Serialized size of result for 99 is 887 > 14/07/31 15:07:34 INFO Executor: Sending result for 99 directly to driver > 14/07/31 15:07:34 INFO Executor: Finished task ID 99 > 14/07/31 15:07:39 INFO CoarseGrainedExecutorBackend: Got assigned task 181 > 14/07/31 15:07:39 INFO Executor: Running task ID 181 > 14/07/31 15:07:39 INFO CoarseGrainedExecutorBackend: Got assigned task 196 > 14/07/31 15:07:39 INFO Executor: Running task ID 196 > 14/07/31 15:07:39 INFO BlockManager: Found block broadcast_0 locally > 14/07/31 15:07:39 INFO BlockManager: Found block broadcast_0 locally > 14/07/31 15:07:39 INFO MapOutputTrackerWorker: Updating epoch to 1 and > clearing cache > 14/07/31 15:07:39 INFO ConnectionManager: Accepted connection from > [tdw-10-215-140-24/10.215.140.24] > 14/07/31 15:07:39 INFO ConnectionManager: Accepted connection from > [tdw-10-196-135-105/10.196.135.105] > 14/07/31 15:07:39 INFO ConnectionManager: Accepted connection from > [tdw-10-215-140-21/10.215.140.21] > 14/07/31 15:07:39 INFO ConnectionManager: Accepted connection from > [tdw-10-215-140-12/10.215.140.12] > 46.947: [GC 10486272K->32824K(40196096K), 0.0347340 secs] > 14/07/31 15:07:39 INFO ConnectionManager: Accepted connection from > [tdw-10-215-140-13/10.215.140.13] > 14/07/31 15:07:39 INFO ConnectionManager: Accepted connection from > [tdw-10-215-140-23/10.215.140.23] > 14/07/31 15:07:39 INFO SendingConnection: Initiating connection to > [tdw-10-215-140-13/10.215.140.13:58657] > 14/07/31 15:07:39 INFO SendingConnection: Initiating connection to > [tdw-10-215-140-23/10.215.140.23:39188] > 14/07/31 15:07:39 INFO SendingConnection: Initiating connection to > [tdw-10-215-140-21/10.215.140.21:36128] > 14/07/31 15:07:39 INFO SendingConnection: Initiating connection to > [tdw-10-215-140-12/10.215.140.12:33380] > 14/07/31 15:07:39 INFO SendingConnection: Initiating connection to > [tdw-10-196-135-105/10.196.135.105:36859] > 14/07/31 15:07:39 INFO SendingConnection: Initiating connection to > [tdw-10-215-140-24/10.215.140.24:49100] > 14/07/31 15:07:39 INFO SendingConnection: Connected to > [tdw-10-215-140-23/10.215.140.23:39188], 2 messages pending > 14/07/31 15:07:39 INFO SendingConnection: Connected to > [tdw-10-215-140-13/10.215.140.13:58657], 2 messages pending > 14/07/31 15:07:39 INFO SendingConnection: Connected to > [tdw-10-196-135-105/10.196.135.105:36859], 2 messages pending > 14/07/31 15:07:39 INFO SendingConnection: Connected to > [tdw-10-215-140-12/10.215.140.12:33380], 1 messages pending > 14/07/31 15:07:39 INFO SendingConnection: Connected to > [tdw-10-215-140-21/10.215.140.21:36128], 1 messages pending > 14/07/31 15:07:39 INFO SendingConnection: Connected to > [tdw-10-215-140-24/10.215.140.24:49100], 2 messages pending > 14/07/31 15:07:39 INFO CacheManager: Partition rdd_6_10 not found, computing > it > 14/07/31 15:07:39 INFO MapOutputTrackerWorker: Don't have map outputs for > shuffle 0, fetching them > 14/07/31 15:07:39 INFO MapOutputTrackerWorker: Doing the fetch; tracker actor > = > Actor[akka.tcp://spark@tdw-10-196-135-106:38502/user/MapOutputTracker#-128956169] > 14/07/31 15:07:39 INFO CacheManager: Partition rdd_6_25 not found, computing > it > 14/07/31 15:07:39 INFO MapOutputTrackerWorker: Don't have map outputs for > shuffle 0, fetching them > 14/07/31 15:07:39 INFO ConnectionManager: Accepted connection from > [tdw-10-215-140-17/10.215.140.17] > 14/07/31 15:07:39 INFO ConnectionManager: Accepted connection from > [tdw-10-215-140-25/10.215.140.25] > 14/07/31 15:07:39 INFO SendingConnection: Initiating connection to > [tdw-10-215-140-17/10.215.140.17:35040] > 14/07/31 15:07:39 INFO SendingConnection: Connected to > [tdw-10-215-140-17/10.215.140.17:35040], 1 messages pending > 14/07/31 15:07:39 INFO ConnectionManager: Accepted connection from > [tdw-10-215-140-18/10.215.140.18] > 14/07/31 15:07:39 INFO SendingConnection: Initiating connection to > [tdw-10-215-140-25/10.215.140.25:50298] > 14/07/31 15:07:39 INFO SendingConnection: Connected to > [tdw-10-215-140-25/10.215.140.25:50298], 2 messages pending > 14/07/31 15:07:39 INFO SendingConnection: Initiating connection to > [tdw-10-215-140-18/10.215.140.18:33575] > 14/07/31 15:07:39 INFO SendingConnection: Connected to > [tdw-10-215-140-18/10.215.140.18:33575], 1 messages pending > 14/07/31 15:07:39 INFO ConnectionManager: Accepted connection from > [tdw-10-215-140-14/10.215.140.14] > 14/07/31 15:07:39 INFO SendingConnection: Initiating connection to > [tdw-10-215-140-14/10.215.140.14:37220] > 14/07/31 15:07:39 INFO SendingConnection: Connected to > [tdw-10-215-140-14/10.215.140.14:37220], 1 messages pending > 14/07/31 15:07:39 INFO MapOutputTrackerWorker: Got the output locations > 14/07/31 15:07:39 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > maxBytesInFlight: 50331648, targetRequestSize: 10066329 > 14/07/31 15:07:39 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > maxBytesInFlight: 50331648, targetRequestSize: 10066329 > 14/07/31 15:07:39 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > Getting 171 non-empty blocks out of 171 blocks > 14/07/31 15:07:39 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > Getting 171 non-empty blocks out of 171 blocks > 14/07/31 15:07:39 INFO SendingConnection: Initiating connection to > [tdw-10-196-135-107/10.196.135.107:59290] > 14/07/31 15:07:39 INFO SendingConnection: Initiating connection to > [tdw-10-215-140-11/10.215.140.11:50302] > 14/07/31 15:07:39 INFO SendingConnection: Connected to > [tdw-10-215-140-11/10.215.140.11:50302], 1 messages pending > 14/07/31 15:07:39 INFO SendingConnection: Initiating connection to > [tdw-10-196-135-106/10.196.135.106:34128] > 14/07/31 15:07:39 INFO ConnectionManager: Accepted connection from > [tdw-10-215-140-11/10.215.140.11] > 14/07/31 15:07:39 INFO SendingConnection: Connected to > [tdw-10-196-135-107/10.196.135.107:59290], 2 messages pending > 14/07/31 15:07:39 INFO SendingConnection: Connected to > [tdw-10-196-135-106/10.196.135.106:34128], 1 messages pending > 14/07/31 15:07:39 INFO SendingConnection: Initiating connection to > [tdw-10-215-140-15/10.215.140.15:50069] > 14/07/31 15:07:39 INFO SendingConnection: Connected to > [tdw-10-215-140-15/10.215.140.15:50069], 1 messages pending > 14/07/31 15:07:40 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > Started 14 remote fetches in 48 ms > 14/07/31 15:07:40 INFO ConnectionManager: Accepted connection from > [tdw-10-215-140-15/10.215.140.15] > 14/07/31 15:07:40 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > Started 14 remote fetches in 49 ms > 14/07/31 15:07:40 INFO ConnectionManager: Accepted connection from > [tdw-10-215-140-16/10.215.140.16] > 14/07/31 15:07:40 INFO ConnectionManager: Accepted connection from > [tdw-10-196-135-102/10.196.135.102] > 14/07/31 15:07:40 INFO SendingConnection: Initiating connection to > [tdw-10-215-140-16/10.215.140.16:58648] > 14/07/31 15:07:40 INFO SendingConnection: Connected to > [tdw-10-215-140-16/10.215.140.16:58648], 1 messages pending > 14/07/31 15:07:40 INFO SendingConnection: Initiating connection to > [tdw-10-196-135-102/10.196.135.102:45729] > 14/07/31 15:07:40 INFO SendingConnection: Connected to > [tdw-10-196-135-102/10.196.135.102:45729], 1 messages pending > 14/07/31 15:07:42 INFO ConnectionManager: Accepted connection from > [tdw-10-196-135-106/10.196.135.106] > 14/07/31 15:07:44 INFO ConnectionManager: Accepted connection from > [tdw-10-196-135-107/10.196.135.107] > 14/07/31 15:07:45 INFO MemoryStore: ensureFreeSpace(1922882) called with > curMem=102650, maxMem=12348240691 > 14/07/31 15:07:45 INFO MemoryStore: Block rdd_6_25 stored as values to memory > (estimated size 1877.8 KB, free 11.5 GB) > 14/07/31 15:07:46 INFO MemoryStore: ensureFreeSpace(1912396) called with > curMem=2025532, maxMem=12348240691 > 14/07/31 15:07:46 INFO MemoryStore: Block rdd_6_10 stored as values to memory > (estimated size 1867.6 KB, free 11.5 GB) > 14/07/31 15:07:46 INFO BlockManagerMaster: Updated info of block rdd_6_10 > 14/07/31 15:07:46 INFO BlockManagerMaster: Updated info of block rdd_6_25 > 14/07/31 15:07:46 INFO Executor: Serialized size of result for 181 is 421363 > 14/07/31 15:07:46 INFO Executor: Serialized size of result for 196 is 421522 > 14/07/31 15:07:46 INFO Executor: Sending result for 181 directly to driver > 14/07/31 15:07:46 INFO Executor: Sending result for 196 directly to driver > 14/07/31 15:07:46 INFO Executor: Finished task ID 181 > 14/07/31 15:07:46 INFO Executor: Finished task ID 196 > 14/07/31 15:07:50 INFO CoarseGrainedExecutorBackend: Got assigned task 219 > 14/07/31 15:07:50 INFO Executor: Running task ID 219 > 14/07/31 15:07:50 INFO BlockManager: Found block broadcast_0 locally > 14/07/31 15:07:50 INFO CacheManager: Partition rdd_6_48 not found, computing > it > 14/07/31 15:07:50 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > maxBytesInFlight: 50331648, targetRequestSize: 10066329 > 14/07/31 15:07:50 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > Getting 171 non-empty blocks out of 171 blocks > 14/07/31 15:07:50 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > Started 14 remote fetches in 19 ms > 14/07/31 15:07:50 INFO CoarseGrainedExecutorBackend: Got assigned task 225 > 14/07/31 15:07:50 INFO Executor: Running task ID 225 > 14/07/31 15:07:50 INFO BlockManager: Found block broadcast_0 locally > 14/07/31 15:07:50 INFO CacheManager: Partition rdd_6_54 not found, computing > it > 14/07/31 15:07:50 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > maxBytesInFlight: 50331648, targetRequestSize: 10066329 > 14/07/31 15:07:50 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > Getting 171 non-empty blocks out of 171 blocks > 14/07/31 15:07:50 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > Started 14 remote fetches in 15 ms > 14/07/31 15:07:51 INFO MemoryStore: ensureFreeSpace(1927469) called with > curMem=3937928, maxMem=12348240691 > 14/07/31 15:07:51 INFO MemoryStore: Block rdd_6_48 stored as values to memory > (estimated size 1882.3 KB, free 11.5 GB) > 14/07/31 15:07:51 INFO BlockManagerMaster: Updated info of block rdd_6_48 > 14/07/31 15:07:51 INFO Executor: Serialized size of result for 219 is 424342 > 14/07/31 15:07:51 INFO Executor: Sending result for 219 directly to driver > 14/07/31 15:07:51 INFO Executor: Finished task ID 219 > 14/07/31 15:07:51 INFO MemoryStore: ensureFreeSpace(1909775) called with > curMem=5865397, maxMem=12348240691 > 14/07/31 15:07:51 INFO MemoryStore: Block rdd_6_54 stored as values to memory > (estimated size 1865.0 KB, free 11.5 GB) > 14/07/31 15:07:51 INFO BlockManagerMaster: Updated info of block rdd_6_54 > 14/07/31 15:07:51 INFO Executor: Serialized size of result for 225 is 421546 > 14/07/31 15:07:51 INFO Executor: Sending result for 225 directly to driver > 14/07/31 15:07:51 INFO Executor: Finished task ID 225 > 14/07/31 15:07:53 INFO CoarseGrainedExecutorBackend: Got assigned task 251 > 14/07/31 15:07:53 INFO Executor: Running task ID 251 > 14/07/31 15:07:53 INFO BlockManager: Found block broadcast_0 locally > 14/07/31 15:07:53 INFO CacheManager: Partition rdd_6_80 not found, computing > it > 14/07/31 15:07:53 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > maxBytesInFlight: 50331648, targetRequestSize: 10066329 > 14/07/31 15:07:53 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > Getting 171 non-empty blocks out of 171 blocks > 14/07/31 15:07:53 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > Started 14 remote fetches in 15 ms > 14/07/31 15:07:53 INFO MemoryStore: ensureFreeSpace(1927469) called with > curMem=7775172, maxMem=12348240691 > 14/07/31 15:07:53 INFO MemoryStore: Block rdd_6_80 stored as values to memory > (estimated size 1882.3 KB, free 11.5 GB) > 14/07/31 15:07:53 INFO BlockManagerMaster: Updated info of block rdd_6_80 > 14/07/31 15:07:53 INFO Executor: Serialized size of result for 251 is 424634 > 14/07/31 15:07:53 INFO Executor: Sending result for 251 directly to driver > 14/07/31 15:07:53 INFO Executor: Finished task ID 251 > 14/07/31 15:07:54 INFO CoarseGrainedExecutorBackend: Got assigned task 259 > 14/07/31 15:07:54 INFO Executor: Running task ID 259 > 14/07/31 15:07:54 INFO BlockManager: Found block broadcast_0 locally > 14/07/31 15:07:54 INFO CacheManager: Partition rdd_6_88 not found, computing > it > 14/07/31 15:07:54 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > maxBytesInFlight: 50331648, targetRequestSize: 10066329 > 14/07/31 15:07:54 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > Getting 171 non-empty blocks out of 171 blocks > 14/07/31 15:07:54 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > Started 14 remote fetches in 13 ms > 14/07/31 15:07:54 INFO MemoryStore: ensureFreeSpace(1921571) called with > curMem=9702641, maxMem=12348240691 > 14/07/31 15:07:54 INFO MemoryStore: Block rdd_6_88 stored as values to memory > (estimated size 1876.5 KB, free 11.5 GB) > 14/07/31 15:07:54 INFO BlockManagerMaster: Updated info of block rdd_6_88 > 14/07/31 15:07:54 INFO Executor: Serialized size of result for 259 is 418167 > 14/07/31 15:07:54 INFO Executor: Sending result for 259 directly to driver > 14/07/31 15:07:54 INFO Executor: Finished task ID 259 > 14/07/31 15:07:56 INFO CoarseGrainedExecutorBackend: Got assigned task 273 > 14/07/31 15:07:56 INFO Executor: Running task ID 273 > 14/07/31 15:07:56 INFO CoarseGrainedExecutorBackend: Got assigned task 290 > 14/07/31 15:07:56 INFO Executor: Running task ID 290 > 14/07/31 15:07:56 INFO BlockManager: Found block broadcast_0 locally > 14/07/31 15:07:56 INFO BlockManager: Found block broadcast_0 locally > 14/07/31 15:07:56 INFO BlockManager: Found block rdd_6_10 locally > 14/07/31 15:07:56 INFO BlockManager: Found block rdd_6_25 locally > 14/07/31 15:07:56 INFO Executor: Serialized size of result for 273 is 887 > 14/07/31 15:07:56 INFO Executor: Sending result for 273 directly to driver > 14/07/31 15:07:56 INFO Executor: Finished task ID 273 > 14/07/31 15:07:56 INFO Executor: Serialized size of result for 290 is 887 > 14/07/31 15:07:56 INFO Executor: Sending result for 290 directly to driver > 14/07/31 15:07:56 INFO Executor: Finished task ID 290 > 14/07/31 15:07:57 INFO CoarseGrainedExecutorBackend: Got assigned task 308 > 14/07/31 15:07:57 INFO Executor: Running task ID 308 > 14/07/31 15:07:57 INFO BlockManager: Found block broadcast_0 locally > 14/07/31 15:07:57 INFO BlockManager: Found block rdd_6_48 locally > 14/07/31 15:07:57 INFO CoarseGrainedExecutorBackend: Got assigned task 311 > 14/07/31 15:07:57 INFO Executor: Running task ID 311 > 14/07/31 15:07:57 INFO BlockManager: Found block broadcast_0 locally > 14/07/31 15:07:57 INFO BlockManager: Found block rdd_6_54 locally > 14/07/31 15:07:57 INFO Executor: Serialized size of result for 308 is 887 > 14/07/31 15:07:57 INFO Executor: Sending result for 308 directly to driver > 14/07/31 15:07:57 INFO Executor: Finished task ID 308 > 14/07/31 15:07:57 INFO Executor: Serialized size of result for 311 is 887 > 14/07/31 15:07:57 INFO Executor: Sending result for 311 directly to driver > 14/07/31 15:07:57 INFO Executor: Finished task ID 311 > 14/07/31 15:07:58 INFO CoarseGrainedExecutorBackend: Got assigned task 339 > 14/07/31 15:07:58 INFO Executor: Running task ID 339 > 14/07/31 15:07:58 INFO BlockManager: Found block broadcast_0 locally > 14/07/31 15:07:58 INFO BlockManager: Found block rdd_6_80 locally > 14/07/31 15:07:58 INFO CoarseGrainedExecutorBackend: Got assigned task 341 > 14/07/31 15:07:58 INFO Executor: Running task ID 341 > 14/07/31 15:07:58 INFO BlockManager: Found block broadcast_0 locally > 14/07/31 15:07:58 INFO BlockManager: Found block rdd_6_88 locally > 14/07/31 15:07:58 INFO Executor: Serialized size of result for 339 is 887 > 14/07/31 15:07:58 INFO Executor: Sending result for 339 directly to driver > 14/07/31 15:07:58 INFO Executor: Finished task ID 339 > 14/07/31 15:07:58 INFO Executor: Serialized size of result for 341 is 887 > 14/07/31 15:07:58 INFO Executor: Sending result for 341 directly to driver > 14/07/31 15:07:58 INFO Executor: Finished task ID 341 > 14/07/31 15:07:59 INFO CoarseGrainedExecutorBackend: Got assigned task 377 > 14/07/31 15:07:59 INFO Executor: Running task ID 377 > 14/07/31 15:07:59 INFO BlockManager: Found block broadcast_0 locally > 14/07/31 15:07:59 INFO MapOutputTrackerWorker: Updating epoch to 2 and > clearing cache > 14/07/31 15:07:59 INFO MapOutputTrackerWorker: Don't have map outputs for > shuffle 1, fetching them > 14/07/31 15:07:59 INFO MapOutputTrackerWorker: Doing the fetch; tracker actor > = > Actor[akka.tcp://spark@tdw-10-196-135-106:38502/user/MapOutputTracker#-128956169] > 14/07/31 15:07:59 INFO MapOutputTrackerWorker: Got the output locations > 14/07/31 15:07:59 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > maxBytesInFlight: 50331648, targetRequestSize: 10066329 > 14/07/31 15:07:59 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > Getting 100 non-empty blocks out of 100 blocks > 14/07/31 15:07:59 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > Started 16 remote fetches in 9 ms > 14/07/31 15:08:00 INFO CoarseGrainedExecutorBackend: Got assigned task 393 > 14/07/31 15:08:00 INFO Executor: Running task ID 393 > 14/07/31 15:08:00 INFO BlockManager: Found block broadcast_0 locally > 14/07/31 15:08:00 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > maxBytesInFlight: 50331648, targetRequestSize: 10066329 > 14/07/31 15:08:00 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > Getting 100 non-empty blocks out of 100 blocks > 14/07/31 15:08:00 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > Started 16 remote fetches in 8 ms > 14/07/31 15:08:00 INFO Executor: Serialized size of result for 377 is 303256 > 14/07/31 15:08:00 INFO Executor: Sending result for 377 directly to driver > 14/07/31 15:08:00 INFO Executor: Finished task ID 377 > 14/07/31 15:08:00 INFO Executor: Serialized size of result for 393 is 310660 > 14/07/31 15:08:00 INFO Executor: Sending result for 393 directly to driver > 14/07/31 15:08:00 INFO Executor: Finished task ID 393 > 14/07/31 15:08:01 INFO CoarseGrainedExecutorBackend: Got assigned task 403 > 14/07/31 15:08:01 INFO Executor: Running task ID 403 > 14/07/31 15:08:01 INFO BlockManager: Found block broadcast_0 locally > 14/07/31 15:08:01 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > maxBytesInFlight: 50331648, targetRequestSize: 10066329 > 14/07/31 15:08:01 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > Getting 100 non-empty blocks out of 100 blocks > 14/07/31 15:08:01 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > Started 16 remote fetches in 7 ms > 14/07/31 15:08:01 INFO Executor: Serialized size of result for 403 is 299667 > 14/07/31 15:08:01 INFO Executor: Sending result for 403 directly to driver > 14/07/31 15:08:01 INFO Executor: Finished task ID 403 > 14/07/31 15:08:02 INFO CoarseGrainedExecutorBackend: Got assigned task 412 > 14/07/31 15:08:02 INFO Executor: Running task ID 412 > 14/07/31 15:08:02 INFO BlockManager: Found block broadcast_0 locally > 14/07/31 15:08:02 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > maxBytesInFlight: 50331648, targetRequestSize: 10066329 > 14/07/31 15:08:02 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > Getting 100 non-empty blocks out of 100 blocks > 14/07/31 15:08:02 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > Started 16 remote fetches in 6 ms > 14/07/31 15:08:02 INFO Executor: Serialized size of result for 412 is 301593 > 14/07/31 15:08:02 INFO Executor: Sending result for 412 directly to driver > 14/07/31 15:08:02 INFO Executor: Finished task ID 412 > 14/07/31 15:08:04 INFO CoarseGrainedExecutorBackend: Got assigned task 437 > 14/07/31 15:08:04 INFO Executor: Running task ID 437 > 14/07/31 15:08:04 INFO BlockManager: Found block broadcast_0 locally > 14/07/31 15:08:04 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > maxBytesInFlight: 50331648, targetRequestSize: 10066329 > 14/07/31 15:08:04 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > Getting 100 non-empty blocks out of 100 blocks > 14/07/31 15:08:04 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > Started 16 remote fetches in 6 ms > 14/07/31 15:08:04 INFO Executor: Serialized size of result for 437 is 312543 > 14/07/31 15:08:04 INFO Executor: Sending result for 437 directly to driver > 14/07/31 15:08:04 INFO Executor: Finished task ID 437 > 14/07/31 15:08:04 INFO CoarseGrainedExecutorBackend: Got assigned task 445 > 14/07/31 15:08:04 INFO Executor: Running task ID 445 > 14/07/31 15:08:04 INFO BlockManager: Found block broadcast_0 locally > 14/07/31 15:08:04 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > maxBytesInFlight: 50331648, targetRequestSize: 10066329 > 14/07/31 15:08:04 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > Getting 100 non-empty blocks out of 100 blocks > 14/07/31 15:08:04 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > Started 16 remote fetches in 6 ms > 14/07/31 15:08:04 INFO Executor: Serialized size of result for 445 is 307049 > 14/07/31 15:08:04 INFO Executor: Sending result for 445 directly to driver > 14/07/31 15:08:04 INFO Executor: Finished task ID 445 > 14/07/31 15:08:06 INFO CoarseGrainedExecutorBackend: Got assigned task 467 > 14/07/31 15:08:06 INFO Executor: Running task ID 467 > 14/07/31 15:08:06 INFO BlockManager: Found block broadcast_0 locally > 14/07/31 15:08:06 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > maxBytesInFlight: 50331648, targetRequestSize: 10066329 > 14/07/31 15:08:06 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > Getting 100 non-empty blocks out of 100 blocks > 14/07/31 15:08:06 INFO BlockFetcherIterator$BasicBlockFetcherIterator: > Started 16 remote fetches in 6 ms > 14/07/31 15:08:07 INFO Executor: Serialized size of result for 467 is 301177 > 14/07/31 15:08:07 INFO Executor: Sending result for 467 directly to driver > 14/07/31 15:08:07 INFO Executor: Finished task ID 467 > 14/07/31 15:08:18 INFO ShuffleBlockManager: Deleted all files for shuffle 1 > 14/07/31 15:09:00 WARN BlockManagerMaster: Error sending message to > BlockManagerMaster in 1 attempts*java.util.concurrent.TimeoutException: > Futures timed out after [30 seconds] > at scala.concurrent.impl.Promise$DefaultPromise.ready(Promise.scala:219) > at > scala.concurrent.impl.Promise$DefaultPromise.result(Promise.scala:223) > at scala.concurrent.Await$$anonfun$result$1.apply(package.scala:107) > at > scala.concurrent.BlockContext$DefaultBlockContext$.blockOn(BlockContext.scala:53) > at scala.concurrent.Await$.result(package.scala:107) > at > org.apache.spark.storage.BlockManagerMaster.askDriverWithReply(BlockManagerMaster.scala:237) > at > org.apache.spark.storage.BlockManagerMaster.sendHeartBeat(BlockManagerMaster.scala:51) > at org.apache.spark.storage.BlockManager.org > <http://org.apache.spark.storage.BlockManager.org>$apache$spark$storage$BlockManager$$heartBeat(BlockManager.scala:113) > at > org.apache.spark.storage.BlockManager$$anonfun$initialize$1$$anonfun$apply$mcV$sp$1.apply$mcV$sp(BlockManager.scala:158) > at org.apache.spark.util.Utils$.tryOrExit(Utils.scala:790) > at > org.apache.spark.storage.BlockManager$$anonfun$initialize$1.apply$mcV$sp(BlockManager.scala:158) > at akka.actor.Scheduler$$anon$9.run(Scheduler.scala:80) > at > akka.actor.LightArrayRevolverScheduler$$anon$3$$anon$2.run(Scheduler.scala:241) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) > at java.lang.Thread.run(Thread.java:744)* > 14/07/31 15:09:12 INFO ShuffleBlockManager: Deleted all files for shuffle 0 > 14/07/31 15:09:12 INFO BlockManager: Removing RDD 6 > 14/07/31 15:09:12 INFO BlockManager: Removing block rdd_6_88 > 14/07/31 15:09:12 INFO MemoryStore: Block rdd_6_88 of size 1921571 dropped > from memory (free 12338538050) > 14/07/31 15:09:12 INFO BlockManager: Removing block rdd_6_25 > 14/07/31 15:09:12 INFO MemoryStore: Block rdd_6_25 of size 1922882 dropped > from memory (free 12340460932) > 14/07/31 15:09:12 INFO BlockManager: Removing block rdd_6_10 > 14/07/31 15:09:12 INFO MemoryStore: Block rdd_6_10 of size 1912396 dropped > from memory (free 12342373328) > 14/07/31 15:09:12 INFO BlockManager: Removing block rdd_6_54 > 14/07/31 15:09:12 INFO MemoryStore: Block rdd_6_54 of size 1909775 dropped > from memory (free 12344283103) > 14/07/31 15:09:12 INFO BlockManager: Removing block rdd_6_80 > 14/07/31 15:09:12 INFO MemoryStore: Block rdd_6_80 of size 1927469 dropped > from memory (free 12346210572) > 14/07/31 15:09:12 INFO BlockManager: Removing block rdd_6_48 > 14/07/31 15:09:12 INFO MemoryStore: Block rdd_6_48 of size 1927469 dropped > from memory (free 12348138041) > > > >