Hi Jeff,

Am Di., 24. März 2020 um 09:39 Uhr schrieb Jeff Zhang <zjf...@gmail.com>:

> Do you use the same version of spark for zeppelin 0.7 & 0.8 ? What kind of
> job do you run ? Is it easy to reproduce it ?
>

Zeppelin comes with spark 2.1.0 in 0.7.3 and spark 2.2.1 in 0.8.2, so
different version.
In this case it is a 'cache table ...' statement in the %spark.sql
interpreter and i see the larger delay in task executions everytime i
execute it with the different zeppelin versions.

BR,
Sebastian.


>
> Sebastian Albrecht <albrecht.sebast...@googlemail.com> 于2020年3月24日周二
> 下午4:24写道:
>
>> Hi,
>> i just upgraded from zeppelin 0.7.3 to 0.8.2. I am using zeppelin for
>> spark in local mode and 0.8.2 came with spark updated to 2.2.1. I now
>> experience slower spark job executions. I can show it in these log
>> snippets: task executions with 0.7.3: milliseconds, 0.8.2: seconds
>>
>> 0.7.3
>> INFO [2020-03-12 00:01:22,105] ({Executor task launch worker-2}
>> Logging.scala[logInfo]:54) - Code generated in 24.742647 ms
>>  INFO [2020-03-12 00:01:22,139] ({Executor task launch worker-2}
>> Logging.scala[logInfo]:54) - Code generated in 23.640976 ms
>>  INFO [2020-03-12 00:01:22,174] ({Executor task launch worker-2}
>> Logging.scala[logInfo]:54) - Code generated in 11.667064 ms
>>  INFO [2020-03-12 00:01:22,193] ({Executor task launch worker-0}
>> Logging.scala[logInfo]:54) - Code generated in 14.959374 ms
>>  INFO [2020-03-12 00:01:22,872] ({Executor task launch worker-6}
>> Logging.scala[logInfo]:54) - Finished task 6.0 in stage 8.0 (TID 14). 2593
>> bytes result sent to driver
>>  INFO [2020-03-12 00:01:22,873] ({dispatcher-event-loop-2}
>> Logging.scala[logInfo]:54) - Starting task 8.0 in stage 8.0 (TID 16,
>> localhost, executor driver, partition 8, NODE_LOCAL, 9086 bytes)
>>  INFO [2020-03-12 00:01:22,875] ({Executor task launch worker-6}
>> Logging.scala[logInfo]:54) - Running task 8.0 in stage 8.0 (TID 16)
>>  INFO [2020-03-12 00:01:22,876] ({task-result-getter-0}
>> Logging.scala[logInfo]:54) - Finished task 6.0 in stage 8.0 (TID 14) in
>> 2061 ms on localhost (executor driver) (1/97)
>>  INFO [2020-03-12 00:01:22,938] ({Executor task launch worker-2}
>> Logging.scala[logInfo]:54) - Finished task 2.0 in stage 8.0 (TID 10). 2593
>> bytes result sent to driver
>>  INFO [2020-03-12 00:01:22,939] ({dispatcher-event-loop-7}
>> Logging.scala[logInfo]:54) - Starting task 9.0 in stage 8.0 (TID 17,
>> localhost, executor driver, partition 9, NODE_LOCAL, 8838 bytes)
>>  INFO [2020-03-12 00:01:22,946] ({task-result-getter-1}
>> Logging.scala[logInfo]:54) - Finished task 2.0 in stage 8.0 (TID 10) in
>> 2135 ms on localhost (executor driver) (2/97)
>>  INFO [2020-03-12 00:01:22,962] ({Executor task launch worker-8}
>> Logging.scala[logInfo]:54) - Running task 9.0 in stage 8.0 (TID 17)
>>  INFO [2020-03-12 00:01:22,983] ({Executor task launch worker-3}
>> Logging.scala[logInfo]:54) - Finished task 3.0 in stage 8.0 (TID 11). 2506
>> bytes result sent to driver
>>  INFO [2020-03-12 00:01:22,984] ({dispatcher-event-loop-3}
>> Logging.scala[logInfo]:54) - Starting task 10.0 in stage 8.0 (TID 18,
>> localhost, executor driver, partition 10, NODE_LOCAL, 8839 bytes)
>>  INFO [2020-03-12 00:01:22,985] ({task-result-getter-2}
>> Logging.scala[logInfo]:54) - Finished task 3.0 in stage 8.0 (TID 11) in
>> 2173 ms on localhost (executor driver) (3/97)
>>  INFO [2020-03-12 00:01:22,986] ({Executor task launch worker-2}
>> Logging.scala[logInfo]:54) - Running task 10.0 in stage 8.0 (TID 18)
>>  INFO [2020-03-12 00:01:23,068] ({Executor task launch worker-0}
>> Logging.scala[logInfo]:54) - Finished task 0.0 in stage 8.0 (TID 8). 2506
>> bytes result sent to driver
>>  INFO [2020-03-12 00:01:23,069] ({dispatcher-event-loop-6}
>> Logging.scala[logInfo]:54) - Starting task 11.0 in stage 8.0 (TID 19,
>> localhost, executor driver, partition 11, NODE_LOCAL, 9333 bytes)
>>  INFO [2020-03-12 00:01:23,070] ({Executor task launch worker-3}
>> Logging.scala[logInfo]:54) - Running task 11.0 in stage 8.0 (TID 19)
>>  INFO [2020-03-12 00:01:23,070] ({task-result-getter-3}
>> Logging.scala[logInfo]:54) - Finished task 0.0 in stage 8.0 (TID 8) in 2262
>> ms on localhost (executor driver) (4/97)
>>  INFO [2020-03-12 00:01:23,088] ({Executor task launch worker-7}
>> Logging.scala[logInfo]:54) - Finished task 7.0 in stage 8.0 (TID 15). 2506
>> bytes result sent to driver
>>  INFO [2020-03-12 00:01:23,089] ({dispatcher-event-loop-2}
>> Logging.scala[logInfo]:54) - Starting task 12.0 in stage 8.0 (TID 20,
>> localhost, executor driver, partition 12, NODE_LOCAL, 8591 bytes)
>>  INFO [2020-03-12 00:01:23,090] ({task-result-getter-0}
>> Logging.scala[logInfo]:54) - Finished task 7.0 in stage 8.0 (TID 15) in
>> 2275 ms on localhost (executor driver) (5/97)
>>  INFO [2020-03-12 00:01:23,090] ({Executor task launch worker-7}
>> Logging.scala[logInfo]:54) - Running task 12.0 in stage 8.0 (TID 20)
>>  INFO [2020-03-12 00:01:23,266] ({Executor task launch worker-1}
>> Logging.scala[logInfo]:54) - Finished task 1.0 in stage 8.0 (TID 9). 2506
>> bytes result sent to driver
>>  INFO [2020-03-12 00:01:23,267] ({dispatcher-event-loop-7}
>> Logging.scala[logInfo]:54) - Starting task 13.0 in stage 8.0 (TID 21,
>> localhost, executor driver, partition 13, NODE_LOCAL, 9458 bytes)
>>  INFO [2020-03-12 00:01:23,268] ({task-result-getter-1}
>> Logging.scala[logInfo]:54) - Finished task 1.0 in stage 8.0 (TID 9) in 2457
>> ms on localhost (executor driver) (6/97)
>>  INFO [2020-03-12 00:01:23,269] ({Executor task launch worker-1}
>> Logging.scala[logInfo]:54) - Running task 13.0 in stage 8.0 (TID 21)
>>  INFO [2020-03-12 00:01:23,278] ({Executor task launch worker-4}
>> Logging.scala[logInfo]:54) - Finished task 4.0 in stage 8.0 (TID 12). 2506
>> bytes result sent to driver
>>  INFO [2020-03-12 00:01:23,279] ({dispatcher-event-loop-3}
>> Logging.scala[logInfo]:54) - Starting task 14.0 in stage 8.0 (TID 22,
>> localhost, executor driver, partition 14, NODE_LOCAL, 9086 bytes)
>>  INFO [2020-03-12 00:01:23,279] ({Executor task launch worker-0}
>> Logging.scala[logInfo]:54) - Running task 14.0 in stage 8.0 (TID 22)
>>  INFO [2020-03-12 00:01:23,280] ({task-result-getter-2}
>> Logging.scala[logInfo]:54) - Finished task 4.0 in stage 8.0 (TID 12) in
>> 2467 ms on localhost (executor driver) (7/97)
>>  INFO [2020-03-12 00:01:23,773] ({Executor task launch worker-5}
>> Logging.scala[logInfo]:54) - Finished task 5.0 in stage 8.0 (TID 13). 2506
>> bytes result sent to driver
>>  INFO [2020-03-12 00:01:23,775] ({dispatcher-event-loop-6}
>> Logging.scala[logInfo]:54) - Starting task 15.0 in stage 8.0 (TID 23,
>> localhost, executor driver, partition 15, NODE_LOCAL, 8962 bytes)
>>  INFO [2020-03-12 00:01:23,775] ({Executor task launch worker-5}
>> Logging.scala[logInfo]:54) - Running task 15.0 in stage 8.0 (TID 23)
>>  INFO [2020-03-12 00:01:23,775] ({task-result-getter-3}
>> Logging.scala[logInfo]:54) - Finished task 5.0 in stage 8.0 (TID 13) in
>> 2961 ms on localhost (executor driver) (8/97)
>>  INFO [2020-03-12 00:01:24,026] ({Executor task launch worker-2}
>> Logging.scala[logInfo]:54) - Finished task 10.0 in stage 8.0 (TID 18). 2506
>> bytes result sent to driver
>>  INFO [2020-03-12 00:01:24,028] ({dispatcher-event-loop-2}
>> Logging.scala[logInfo]:54) - Starting task 16.0 in stage 8.0 (TID 24,
>> localhost, executor driver, partition 16, NODE_LOCAL, 8963 bytes)
>>
>> 0.8.2
>>
>> INFO [2020-03-17 15:39:14,699] ({Executor task launch worker for task
>> 410} Logging.scala[logInfo]:54) - Code generated in 7.739789 ms
>>  INFO [2020-03-17 15:39:14,715] ({Executor task launch worker for task
>> 410} Logging.scala[logInfo]:54) - Code generated in 14.050658 ms
>>  INFO [2020-03-17 15:39:14,921] ({Executor task launch worker for task
>> 410} Logging.scala[logInfo]:54) - Finished task 0.0 in stage 7.0 (TID 410).
>> 2055 bytes result sent to driver
>>  INFO [2020-03-17 15:39:14,922] ({dispatcher-event-loop-3}
>> Logging.scala[logInfo]:54) - Starting task 1.0 in stage 7.0 (TID 411,
>> localhost, executor driver, partition 1, NODE_LOCAL, 7296 bytes)
>>  INFO [2020-03-17 15:39:14,922] ({Executor task launch worker for task
>> 411} Logging.scala[logInfo]:54) - Running task 1.0 in stage 7.0 (TID 411)
>>  INFO [2020-03-17 15:39:14,922] ({task-result-getter-2}
>> Logging.scala[logInfo]:54) - Finished task 0.0 in stage 7.0 (TID 410) in
>> 1171 ms on localhost (executor driver) (1/113)
>>  INFO [2020-03-17 15:39:16,679] ({Executor task launch worker for task
>> 411} Logging.scala[logInfo]:54) - Finished task 1.0 in stage 7.0 (TID 411).
>> 2012 bytes result sent to driver
>>  INFO [2020-03-17 15:39:16,680] ({dispatcher-event-loop-1}
>> Logging.scala[logInfo]:54) - Starting task 2.0 in stage 7.0 (TID 412,
>> localhost, executor driver, partition 2, NODE_LOCAL, 6923 bytes)
>>  INFO [2020-03-17 15:39:16,680] ({Executor task launch worker for task
>> 412} Logging.scala[logInfo]:54) - Running task 2.0 in stage 7.0 (TID 412)
>>  INFO [2020-03-17 15:39:16,680] ({task-result-getter-3}
>> Logging.scala[logInfo]:54) - Finished task 1.0 in stage 7.0 (TID 411) in
>> 1758 ms on localhost (executor driver) (2/113)
>>  INFO [2020-03-17 15:39:17,584] ({Executor task launch worker for task
>> 412} Logging.scala[logInfo]:54) - Finished task 2.0 in stage 7.0 (TID 412).
>> 2012 bytes result sent to driver
>>  INFO [2020-03-17 15:39:17,585] ({dispatcher-event-loop-7}
>> Logging.scala[logInfo]:54) - Starting task 3.0 in stage 7.0 (TID 413,
>> localhost, executor driver, partition 3, NODE_LOCAL, 7171 bytes)
>>  INFO [2020-03-17 15:39:17,585] ({Executor task launch worker for task
>> 413} Logging.scala[logInfo]:54) - Running task 3.0 in stage 7.0 (TID 413)
>>  INFO [2020-03-17 15:39:17,585] ({task-result-getter-0}
>> Logging.scala[logInfo]:54) - Finished task 2.0 in stage 7.0 (TID 412) in
>> 906 ms on localhost (executor driver) (3/113)
>>  INFO [2020-03-17 15:39:18,811] ({Executor task launch worker for task
>> 413} Logging.scala[logInfo]:54) - Finished task 3.0 in stage 7.0 (TID 413).
>> 2012 bytes result sent to driver
>>  INFO [2020-03-17 15:39:18,811] ({dispatcher-event-loop-6}
>> Logging.scala[logInfo]:54) - Starting task 4.0 in stage 7.0 (TID 414,
>> localhost, executor driver, partition 4, NODE_LOCAL, 6924 bytes)
>>  INFO [2020-03-17 15:39:18,812] ({Executor task launch worker for task
>> 414} Logging.scala[logInfo]:54) - Running task 4.0 in stage 7.0 (TID 414)
>>  INFO [2020-03-17 15:39:18,812] ({task-result-getter-1}
>> Logging.scala[logInfo]:54) - Finished task 3.0 in stage 7.0 (TID 413) in
>> 1228 ms on localhost (executor driver) (4/113)
>>  INFO [2020-03-17 15:39:19,459] ({Executor task launch worker for task
>> 414} Logging.scala[logInfo]:54) - Finished task 4.0 in stage 7.0 (TID 414).
>> 2012 bytes result sent to driver
>>  INFO [2020-03-17 15:39:19,460] ({dispatcher-event-loop-5}
>> Logging.scala[logInfo]:54) - Starting task 5.0 in stage 7.0 (TID 415,
>> localhost, executor driver, partition 5, NODE_LOCAL, 6923 bytes)
>>  INFO [2020-03-17 15:39:19,460] ({Executor task launch worker for task
>> 415} Logging.scala[logInfo]:54) - Running task 5.0 in stage 7.0 (TID 415)
>>  INFO [2020-03-17 15:39:19,460] ({task-result-getter-2}
>> Logging.scala[logInfo]:54) - Finished task 4.0 in stage 7.0 (TID 414) in
>> 649 ms on localhost (executor driver) (5/113)
>>  INFO [2020-03-17 15:39:21,712] ({Executor task launch worker for task
>> 415} Logging.scala[logInfo]:54) - Finished task 5.0 in stage 7.0 (TID 415).
>> 2055 bytes result sent to driver
>>  INFO [2020-03-17 15:39:21,712] ({dispatcher-event-loop-4}
>> Logging.scala[logInfo]:54) - Starting task 6.0 in stage 7.0 (TID 416,
>> localhost, executor driver, partition 6, NODE_LOCAL, 7915 bytes)
>>  INFO [2020-03-17 15:39:21,712] ({Executor task launch worker for task
>> 416} Logging.scala[logInfo]:54) - Running task 6.0 in stage 7.0 (TID 416)
>>  INFO [2020-03-17 15:39:21,712] ({task-result-getter-3}
>> Logging.scala[logInfo]:54) - Finished task 5.0 in stage 7.0 (TID 415) in
>> 2252 ms on localhost (executor driver) (6/113)
>>  INFO [2020-03-17 15:39:22,406] ({Executor task launch worker for task
>> 416} Logging.scala[logInfo]:54) - Finished task 6.0 in stage 7.0 (TID 416).
>> 2012 bytes result sent to driver
>>  INFO [2020-03-17 15:39:22,406] ({dispatcher-event-loop-6}
>> Logging.scala[logInfo]:54) - Starting task 7.0 in stage 7.0 (TID 417,
>> localhost, executor driver, partition 7, NODE_LOCAL, 7171 bytes)
>>  INFO [2020-03-17 15:39:22,406] ({Executor task launch worker for task
>> 417} Logging.scala[logInfo]:54) - Running task 7.0 in stage 7.0 (TID 417)
>>  INFO [2020-03-17 15:39:22,406] ({task-result-getter-0}
>> Logging.scala[logInfo]:54) - Finished task 6.0 in stage 7.0 (TID 416) in
>> 694 ms on localhost (executor driver) (7/113)
>>  INFO [2020-03-17 15:39:23,407] ({Executor task launch worker for task
>> 417} Logging.scala[logInfo]:54) - Finished task 7.0 in stage 7.0 (TID 417).
>> 2012 bytes result sent to driver
>>  INFO [2020-03-17 15:39:23,407] ({dispatcher-event-loop-5}
>> Logging.scala[logInfo]:54) - Starting task 8.0 in stage 7.0 (TID 418,
>> localhost, executor driver, partition 8, NODE_LOCAL, 7047 bytes)
>>  INFO [2020-03-17 15:39:23,407] ({Executor task launch worker for task
>> 418} Logging.scala[logInfo]:54) - Running task 8.0 in stage 7.0 (TID 418)
>>  INFO [2020-03-17 15:39:23,407] ({task-result-getter-1}
>> Logging.scala[logInfo]:54) - Finished task 7.0 in stage 7.0 (TID 417) in
>> 1001 ms on localhost (executor driver) (8/113)
>>  INFO [2020-03-17 15:39:24,450] ({Executor task launch worker for task
>> 418} Logging.scala[logInfo]:54) - Finished task 8.0 in stage 7.0 (TID 418).
>> 2012 bytes result sent to driver
>>  INFO [2020-03-17 15:39:24,450] ({dispatcher-event-loop-2}
>> Logging.scala[logInfo]:54) - Starting task 9.0 in stage 7.0 (TID 419,
>> localhost, executor driver, partition 9, NODE_LOCAL, 6924 bytes)
>>
>> I already gave zeppelin itself as well as the spark interpreter more Xmx.
>> Is this a spark only issue or what would you recommend to narrow down the
>> possible source for that performance fall.
>>
>> Thanks + BR,
>> Sebastian.
>>
>>
>
> --
> Best Regards
>
> Jeff Zhang
>

Reply via email to