Thank you for your reply.
And I want to know is there some demo or doc about how to profiling and debug 
zeppelin on a local machine.
Thanks a lot.

From: Jeff Zhang <zjf...@gmail.com>
Reply-To: "users@zeppelin.apache.org" <users@zeppelin.apache.org>
Date: Tuesday, March 24, 2020 at 23:40
To: users <users@zeppelin.apache.org>
Subject: Re: Spark performance after update to 0.8.2(Internet mail)

Hi lewinma,


The most comprehensive document is the zeppelin official docs. Let us know what 
kind of problem/concern that you would like to address.
BTW, we do need to improve the docs,  maybe we should do better on docs in the 
next release.

lewinma(lewinma) <lewi...@tencent.com<mailto:lewi...@tencent.com>> 
于2020年3月24日周二 下午9:52写道:
Hi,  jeff, are there some docs about the architecture about zeppelin?  Any kind 
of help would be very grateful.

From: Sebastian Albrecht 
<albrecht.sebast...@googlemail.com<mailto:albrecht.sebast...@googlemail.com>>
Reply-To: "users@zeppelin.apache.org<mailto:users@zeppelin.apache.org>" 
<users@zeppelin.apache.org<mailto:users@zeppelin.apache.org>>
Date: Tuesday, March 24, 2020 at 19:11
To: "users@zeppelin.apache.org<mailto:users@zeppelin.apache.org>" 
<users@zeppelin.apache.org<mailto:users@zeppelin.apache.org>>
Subject: Re: Spark performance after update to 0.8.2(Internet mail)

Hi Jeff,

Am Di., 24. März 2020 um 09:39 Uhr schrieb Jeff Zhang 
<zjf...@gmail.com<mailto:zjf...@gmail.com>>:
Do you use the same version of spark for zeppelin 0.7 & 0.8 ? What kind of job 
do you run ? Is it easy to reproduce it ?

Zeppelin comes with spark 2.1.0 in 0.7.3 and spark 2.2.1 in 0.8.2, so different 
version.
In this case it is a 'cache table ...' statement in the %spark.sql interpreter 
and i see the larger delay in task executions everytime i execute it with the 
different zeppelin versions.

BR,
Sebastian.


Sebastian Albrecht 
<albrecht.sebast...@googlemail.com<mailto:albrecht.sebast...@googlemail.com>> 
于2020年3月24日周二 下午4:24写道:
Hi,
i just upgraded from zeppelin 0.7.3 to 0.8.2. I am using zeppelin for spark in 
local mode and 0.8.2 came with spark updated to 2.2.1. I now experience slower 
spark job executions. I can show it in these log snippets: task executions with 
0.7.3: milliseconds, 0.8.2: seconds

0.7.3
INFO [2020-03-12 00:01:22,105] ({Executor task launch worker-2} 
Logging.scala[logInfo]:54) - Code generated in 24.742647 ms
 INFO [2020-03-12 00:01:22,139] ({Executor task launch worker-2} 
Logging.scala[logInfo]:54) - Code generated in 23.640976 ms
 INFO [2020-03-12 00:01:22,174] ({Executor task launch worker-2} 
Logging.scala[logInfo]:54) - Code generated in 11.667064 ms
 INFO [2020-03-12 00:01:22,193] ({Executor task launch worker-0} 
Logging.scala[logInfo]:54) - Code generated in 14.959374 ms
 INFO [2020-03-12 00:01:22,872] ({Executor task launch worker-6} 
Logging.scala[logInfo]:54) - Finished task 6.0 in stage 8.0 (TID 14). 2593 
bytes result sent to driver
 INFO [2020-03-12 00:01:22,873] ({dispatcher-event-loop-2} 
Logging.scala[logInfo]:54) - Starting task 8.0 in stage 8.0 (TID 16, localhost, 
executor driver, partition 8, NODE_LOCAL, 9086 bytes)
 INFO [2020-03-12 00:01:22,875] ({Executor task launch worker-6} 
Logging.scala[logInfo]:54) - Running task 8.0 in stage 8.0 (TID 16)
 INFO [2020-03-12 00:01:22,876] ({task-result-getter-0} 
Logging.scala[logInfo]:54) - Finished task 6.0 in stage 8.0 (TID 14) in 2061 ms 
on localhost (executor driver) (1/97)
 INFO [2020-03-12 00:01:22,938] ({Executor task launch worker-2} 
Logging.scala[logInfo]:54) - Finished task 2.0 in stage 8.0 (TID 10). 2593 
bytes result sent to driver
 INFO [2020-03-12 00:01:22,939] ({dispatcher-event-loop-7} 
Logging.scala[logInfo]:54) - Starting task 9.0 in stage 8.0 (TID 17, localhost, 
executor driver, partition 9, NODE_LOCAL, 8838 bytes)
 INFO [2020-03-12 00:01:22,946] ({task-result-getter-1} 
Logging.scala[logInfo]:54) - Finished task 2.0 in stage 8.0 (TID 10) in 2135 ms 
on localhost (executor driver) (2/97)
 INFO [2020-03-12 00:01:22,962] ({Executor task launch worker-8} 
Logging.scala[logInfo]:54) - Running task 9.0 in stage 8.0 (TID 17)
 INFO [2020-03-12 00:01:22,983] ({Executor task launch worker-3} 
Logging.scala[logInfo]:54) - Finished task 3.0 in stage 8.0 (TID 11). 2506 
bytes result sent to driver
 INFO [2020-03-12 00:01:22,984] ({dispatcher-event-loop-3} 
Logging.scala[logInfo]:54) - Starting task 10.0 in stage 8.0 (TID 18, 
localhost, executor driver, partition 10, NODE_LOCAL, 8839 bytes)
 INFO [2020-03-12 00:01:22,985] ({task-result-getter-2} 
Logging.scala[logInfo]:54) - Finished task 3.0 in stage 8.0 (TID 11) in 2173 ms 
on localhost (executor driver) (3/97)
 INFO [2020-03-12 00:01:22,986] ({Executor task launch worker-2} 
Logging.scala[logInfo]:54) - Running task 10.0 in stage 8.0 (TID 18)
 INFO [2020-03-12 00:01:23,068] ({Executor task launch worker-0} 
Logging.scala[logInfo]:54) - Finished task 0.0 in stage 8.0 (TID 8). 2506 bytes 
result sent to driver
 INFO [2020-03-12 00:01:23,069] ({dispatcher-event-loop-6} 
Logging.scala[logInfo]:54) - Starting task 11.0 in stage 8.0 (TID 19, 
localhost, executor driver, partition 11, NODE_LOCAL, 9333 bytes)
 INFO [2020-03-12 00:01:23,070] ({Executor task launch worker-3} 
Logging.scala[logInfo]:54) - Running task 11.0 in stage 8.0 (TID 19)
 INFO [2020-03-12 00:01:23,070] ({task-result-getter-3} 
Logging.scala[logInfo]:54) - Finished task 0.0 in stage 8.0 (TID 8) in 2262 ms 
on localhost (executor driver) (4/97)
 INFO [2020-03-12 00:01:23,088] ({Executor task launch worker-7} 
Logging.scala[logInfo]:54) - Finished task 7.0 in stage 8.0 (TID 15). 2506 
bytes result sent to driver
 INFO [2020-03-12 00:01:23,089] ({dispatcher-event-loop-2} 
Logging.scala[logInfo]:54) - Starting task 12.0 in stage 8.0 (TID 20, 
localhost, executor driver, partition 12, NODE_LOCAL, 8591 bytes)
 INFO [2020-03-12 00:01:23,090] ({task-result-getter-0} 
Logging.scala[logInfo]:54) - Finished task 7.0 in stage 8.0 (TID 15) in 2275 ms 
on localhost (executor driver) (5/97)
 INFO [2020-03-12 00:01:23,090] ({Executor task launch worker-7} 
Logging.scala[logInfo]:54) - Running task 12.0 in stage 8.0 (TID 20)
 INFO [2020-03-12 00:01:23,266] ({Executor task launch worker-1} 
Logging.scala[logInfo]:54) - Finished task 1.0 in stage 8.0 (TID 9). 2506 bytes 
result sent to driver
 INFO [2020-03-12 00:01:23,267] ({dispatcher-event-loop-7} 
Logging.scala[logInfo]:54) - Starting task 13.0 in stage 8.0 (TID 21, 
localhost, executor driver, partition 13, NODE_LOCAL, 9458 bytes)
 INFO [2020-03-12 00:01:23,268] ({task-result-getter-1} 
Logging.scala[logInfo]:54) - Finished task 1.0 in stage 8.0 (TID 9) in 2457 ms 
on localhost (executor driver) (6/97)
 INFO [2020-03-12 00:01:23,269] ({Executor task launch worker-1} 
Logging.scala[logInfo]:54) - Running task 13.0 in stage 8.0 (TID 21)
 INFO [2020-03-12 00:01:23,278] ({Executor task launch worker-4} 
Logging.scala[logInfo]:54) - Finished task 4.0 in stage 8.0 (TID 12). 2506 
bytes result sent to driver
 INFO [2020-03-12 00:01:23,279] ({dispatcher-event-loop-3} 
Logging.scala[logInfo]:54) - Starting task 14.0 in stage 8.0 (TID 22, 
localhost, executor driver, partition 14, NODE_LOCAL, 9086 bytes)
 INFO [2020-03-12 00:01:23,279] ({Executor task launch worker-0} 
Logging.scala[logInfo]:54) - Running task 14.0 in stage 8.0 (TID 22)
 INFO [2020-03-12 00:01:23,280] ({task-result-getter-2} 
Logging.scala[logInfo]:54) - Finished task 4.0 in stage 8.0 (TID 12) in 2467 ms 
on localhost (executor driver) (7/97)
 INFO [2020-03-12 00:01:23,773] ({Executor task launch worker-5} 
Logging.scala[logInfo]:54) - Finished task 5.0 in stage 8.0 (TID 13). 2506 
bytes result sent to driver
 INFO [2020-03-12 00:01:23,775] ({dispatcher-event-loop-6} 
Logging.scala[logInfo]:54) - Starting task 15.0 in stage 8.0 (TID 23, 
localhost, executor driver, partition 15, NODE_LOCAL, 8962 bytes)
 INFO [2020-03-12 00:01:23,775] ({Executor task launch worker-5} 
Logging.scala[logInfo]:54) - Running task 15.0 in stage 8.0 (TID 23)
 INFO [2020-03-12 00:01:23,775] ({task-result-getter-3} 
Logging.scala[logInfo]:54) - Finished task 5.0 in stage 8.0 (TID 13) in 2961 ms 
on localhost (executor driver) (8/97)
 INFO [2020-03-12 00:01:24,026] ({Executor task launch worker-2} 
Logging.scala[logInfo]:54) - Finished task 10.0 in stage 8.0 (TID 18). 2506 
bytes result sent to driver
 INFO [2020-03-12 00:01:24,028] ({dispatcher-event-loop-2} 
Logging.scala[logInfo]:54) - Starting task 16.0 in stage 8.0 (TID 24, 
localhost, executor driver, partition 16, NODE_LOCAL, 8963 bytes)

0.8.2

INFO [2020-03-17 15:39:14,699] ({Executor task launch worker for task 410} 
Logging.scala[logInfo]:54) - Code generated in 7.739789 ms
 INFO [2020-03-17 15:39:14,715] ({Executor task launch worker for task 410} 
Logging.scala[logInfo]:54) - Code generated in 14.050658 ms
 INFO [2020-03-17 15:39:14,921] ({Executor task launch worker for task 410} 
Logging.scala[logInfo]:54) - Finished task 0.0 in stage 7.0 (TID 410). 2055 
bytes result sent to driver
 INFO [2020-03-17 15:39:14,922] ({dispatcher-event-loop-3} 
Logging.scala[logInfo]:54) - Starting task 1.0 in stage 7.0 (TID 411, 
localhost, executor driver, partition 1, NODE_LOCAL, 7296 bytes)
 INFO [2020-03-17 15:39:14,922] ({Executor task launch worker for task 411} 
Logging.scala[logInfo]:54) - Running task 1.0 in stage 7.0 (TID 411)
 INFO [2020-03-17 15:39:14,922] ({task-result-getter-2} 
Logging.scala[logInfo]:54) - Finished task 0.0 in stage 7.0 (TID 410) in 1171 
ms on localhost (executor driver) (1/113)
 INFO [2020-03-17 15:39:16,679] ({Executor task launch worker for task 411} 
Logging.scala[logInfo]:54) - Finished task 1.0 in stage 7.0 (TID 411). 2012 
bytes result sent to driver
 INFO [2020-03-17 15:39:16,680] ({dispatcher-event-loop-1} 
Logging.scala[logInfo]:54) - Starting task 2.0 in stage 7.0 (TID 412, 
localhost, executor driver, partition 2, NODE_LOCAL, 6923 bytes)
 INFO [2020-03-17 15:39:16,680] ({Executor task launch worker for task 412} 
Logging.scala[logInfo]:54) - Running task 2.0 in stage 7.0 (TID 412)
 INFO [2020-03-17 15:39:16,680] ({task-result-getter-3} 
Logging.scala[logInfo]:54) - Finished task 1.0 in stage 7.0 (TID 411) in 1758 
ms on localhost (executor driver) (2/113)
 INFO [2020-03-17 15:39:17,584] ({Executor task launch worker for task 412} 
Logging.scala[logInfo]:54) - Finished task 2.0 in stage 7.0 (TID 412). 2012 
bytes result sent to driver
 INFO [2020-03-17 15:39:17,585] ({dispatcher-event-loop-7} 
Logging.scala[logInfo]:54) - Starting task 3.0 in stage 7.0 (TID 413, 
localhost, executor driver, partition 3, NODE_LOCAL, 7171 bytes)
 INFO [2020-03-17 15:39:17,585] ({Executor task launch worker for task 413} 
Logging.scala[logInfo]:54) - Running task 3.0 in stage 7.0 (TID 413)
 INFO [2020-03-17 15:39:17,585] ({task-result-getter-0} 
Logging.scala[logInfo]:54) - Finished task 2.0 in stage 7.0 (TID 412) in 906 ms 
on localhost (executor driver) (3/113)
 INFO [2020-03-17 15:39:18,811] ({Executor task launch worker for task 413} 
Logging.scala[logInfo]:54) - Finished task 3.0 in stage 7.0 (TID 413). 2012 
bytes result sent to driver
 INFO [2020-03-17 15:39:18,811] ({dispatcher-event-loop-6} 
Logging.scala[logInfo]:54) - Starting task 4.0 in stage 7.0 (TID 414, 
localhost, executor driver, partition 4, NODE_LOCAL, 6924 bytes)
 INFO [2020-03-17 15:39:18,812] ({Executor task launch worker for task 414} 
Logging.scala[logInfo]:54) - Running task 4.0 in stage 7.0 (TID 414)
 INFO [2020-03-17 15:39:18,812] ({task-result-getter-1} 
Logging.scala[logInfo]:54) - Finished task 3.0 in stage 7.0 (TID 413) in 1228 
ms on localhost (executor driver) (4/113)
 INFO [2020-03-17 15:39:19,459] ({Executor task launch worker for task 414} 
Logging.scala[logInfo]:54) - Finished task 4.0 in stage 7.0 (TID 414). 2012 
bytes result sent to driver
 INFO [2020-03-17 15:39:19,460] ({dispatcher-event-loop-5} 
Logging.scala[logInfo]:54) - Starting task 5.0 in stage 7.0 (TID 415, 
localhost, executor driver, partition 5, NODE_LOCAL, 6923 bytes)
 INFO [2020-03-17 15:39:19,460] ({Executor task launch worker for task 415} 
Logging.scala[logInfo]:54) - Running task 5.0 in stage 7.0 (TID 415)
 INFO [2020-03-17 15:39:19,460] ({task-result-getter-2} 
Logging.scala[logInfo]:54) - Finished task 4.0 in stage 7.0 (TID 414) in 649 ms 
on localhost (executor driver) (5/113)
 INFO [2020-03-17 15:39:21,712] ({Executor task launch worker for task 415} 
Logging.scala[logInfo]:54) - Finished task 5.0 in stage 7.0 (TID 415). 2055 
bytes result sent to driver
 INFO [2020-03-17 15:39:21,712] ({dispatcher-event-loop-4} 
Logging.scala[logInfo]:54) - Starting task 6.0 in stage 7.0 (TID 416, 
localhost, executor driver, partition 6, NODE_LOCAL, 7915 bytes)
 INFO [2020-03-17 15:39:21,712] ({Executor task launch worker for task 416} 
Logging.scala[logInfo]:54) - Running task 6.0 in stage 7.0 (TID 416)
 INFO [2020-03-17 15:39:21,712] ({task-result-getter-3} 
Logging.scala[logInfo]:54) - Finished task 5.0 in stage 7.0 (TID 415) in 2252 
ms on localhost (executor driver) (6/113)
 INFO [2020-03-17 15:39:22,406] ({Executor task launch worker for task 416} 
Logging.scala[logInfo]:54) - Finished task 6.0 in stage 7.0 (TID 416). 2012 
bytes result sent to driver
 INFO [2020-03-17 15:39:22,406] ({dispatcher-event-loop-6} 
Logging.scala[logInfo]:54) - Starting task 7.0 in stage 7.0 (TID 417, 
localhost, executor driver, partition 7, NODE_LOCAL, 7171 bytes)
 INFO [2020-03-17 15:39:22,406] ({Executor task launch worker for task 417} 
Logging.scala[logInfo]:54) - Running task 7.0 in stage 7.0 (TID 417)
 INFO [2020-03-17 15:39:22,406] ({task-result-getter-0} 
Logging.scala[logInfo]:54) - Finished task 6.0 in stage 7.0 (TID 416) in 694 ms 
on localhost (executor driver) (7/113)
 INFO [2020-03-17 15:39:23,407] ({Executor task launch worker for task 417} 
Logging.scala[logInfo]:54) - Finished task 7.0 in stage 7.0 (TID 417). 2012 
bytes result sent to driver
 INFO [2020-03-17 15:39:23,407] ({dispatcher-event-loop-5} 
Logging.scala[logInfo]:54) - Starting task 8.0 in stage 7.0 (TID 418, 
localhost, executor driver, partition 8, NODE_LOCAL, 7047 bytes)
 INFO [2020-03-17 15:39:23,407] ({Executor task launch worker for task 418} 
Logging.scala[logInfo]:54) - Running task 8.0 in stage 7.0 (TID 418)
 INFO [2020-03-17 15:39:23,407] ({task-result-getter-1} 
Logging.scala[logInfo]:54) - Finished task 7.0 in stage 7.0 (TID 417) in 1001 
ms on localhost (executor driver) (8/113)
 INFO [2020-03-17 15:39:24,450] ({Executor task launch worker for task 418} 
Logging.scala[logInfo]:54) - Finished task 8.0 in stage 7.0 (TID 418). 2012 
bytes result sent to driver
 INFO [2020-03-17 15:39:24,450] ({dispatcher-event-loop-2} 
Logging.scala[logInfo]:54) - Starting task 9.0 in stage 7.0 (TID 419, 
localhost, executor driver, partition 9, NODE_LOCAL, 6924 bytes)

I already gave zeppelin itself as well as the spark interpreter more Xmx. Is 
this a spark only issue or what would you recommend to narrow down the possible 
source for that performance fall.

Thanks + BR,
Sebastian.



--
Best Regards

Jeff Zhang


--
Best Regards

Jeff Zhang

Reply via email to