I'm interested in this issue as well.  I have spark streaming jobs that
seems to run well for a while, but slowly degrade and don't recover.


On Wed, Jun 11, 2014 at 11:08 PM, Boduo Li <onpo...@gmail.com> wrote:

> It seems that the slow "reduce" tasks are caused by slow shuffling. Here is
> the logs regarding one slow "reduce" task:
>
> 14/06/11 23:42:45 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Got
> remote block shuffle_69_88_18 after  5029 ms
> 14/06/11 23:42:45 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Got
> remote block shuffle_69_89_18 after  5029 ms
> 14/06/11 23:42:45 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Got
> remote block shuffle_69_90_18 after  5029 ms
> 14/06/11 23:42:45 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Got
> remote block shuffle_69_91_18 after  5029 ms
> 14/06/11 23:42:45 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Got
> remote block shuffle_69_92_18 after  5029 ms
> 14/06/11 23:42:45 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Got
> remote block shuffle_69_93_18 after  5029 ms
> 14/06/11 23:42:45 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Got
> remote block shuffle_69_94_18 after  5029 ms
> 14/06/11 23:42:45 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Got
> remote block shuffle_69_95_18 after  5029 ms
> 14/06/11 23:42:45 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Got
> remote block shuffle_69_96_18 after  5029 ms
> 14/06/11 23:42:45 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Got
> remote block shuffle_69_97_18 after  5029 ms
> 14/06/11 23:42:45 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Got
> remote block shuffle_69_188_18 after  5029 ms
> 14/06/11 23:42:45 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Got
> remote block shuffle_69_189_18 after  5029 ms
> 14/06/11 23:42:45 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Got
> remote block shuffle_69_190_18 after  5029 ms
> 14/06/11 23:42:45 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Got
> remote block shuffle_69_191_18 after  5029 ms
> 14/06/11 23:42:45 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Got
> remote block shuffle_69_192_18 after  5029 ms
> 14/06/11 23:42:45 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Got
> remote block shuffle_69_193_18 after  5029 ms
> 14/06/11 23:42:45 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Got
> remote block shuffle_69_194_18 after  5029 ms
> 14/06/11 23:42:45 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Got
> remote block shuffle_69_195_18 after  5029 ms
> 14/06/11 23:42:45 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Got
> remote block shuffle_69_196_18 after  5029 ms
> 14/06/11 23:42:45 INFO BlockFetcherIterator$BasicBlockFetcherIterator: Got
> remote block shuffle_69_197_18 after  5029 ms
> 14/06/11 23:42:45 INFO Executor: Serialized size of result for 23643 is
> 1143
> 14/06/11 23:42:45 INFO Executor: Sending result for 23643 directly to
> driver
> 14/06/11 23:42:45 INFO Executor: Finished task ID 23643
>
>
>
>
> --
> View this message in context:
> http://apache-spark-user-list.1001560.n3.nabble.com/How-to-achieve-reasonable-performance-on-Spark-Streaming-tp7262p7454.html
> Sent from the Apache Spark User List mailing list archive at Nabble.com.
>

Reply via email to