seancxmao commented on issue #23379: [SPARK-26440][WEBUI] Show total CPU time 
across all tasks on stage pages
URL: https://github.com/apache/spark/pull/23379#issuecomment-452617564
 
 
   Let me share some context. When we tune Spark applications, we usually focus 
on 3 kinds of performance metrics.
   * Run Time (at stage level, this is total run time across all tasks)
   * CPU * Time (at stage level, this is total cpu time across all tasks)
   * Memory * Time (maybe hard to measure this at stage level, since executor 
memory are shared across concurrently running stages)
   
   We found at stage level both run time and cpu time are useful. We can get 
both run time (in mili-seconds) and cpu time (in nano-seconds) via REST API, 
below is an example I ran locally.
   <img width="584" alt="screen shot 2019-01-09 at 4 26 46 pm" 
src="https://user-images.githubusercontent.com/12194089/50887029-0450d180-142d-11e9-9947-ecf55257519b.png";>
   run time is already displayed on stage page, however cpu time is not.
   <img width="373" alt="screen shot 2019-01-09 at 4 28 33 pm" 
src="https://user-images.githubusercontent.com/12194089/50887076-2b0f0800-142d-11e9-9366-e52066f03a2a.png";>
   Since REST API is usually targeted for monitoring tools rather than Spark 
users, maybe it's helpful to display cpu time side by side with run time. IIUC, 
currently we can't get stage level cpu time from the executors directly.
   
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to