[GitHub] [spark] liupc edited a comment on issue #27871: [SPARK-31105][CORE]Respect sql execution id for FIFO scheduling mode

GitBox Fri, 20 Mar 2020 02:24:25 -0700

liupc edited a comment on issue #27871: [SPARK-31105][CORE]Respect sql 
execution id for FIFO scheduling mode
URL: https://github.com/apache/spark/pull/27871#issuecomment-601598502
 
 
   > For small queries, usually they won't hit this problem. For big queries, 
the query latency shouldn't matter too much?
   > 
   > @liupc have you tried this in real-world workloads?
   
   Yes, we tried in real workloads, it does better especially there are lots of 
taskSets to be scheduled for one round scheduling. This is obvious for adaptive 
execution. Also, I think this is what FIFO should do.
   usually queries may mapping to several jobs, if several jobs being delayed 
due to this reason, the total delay is obvious. Suppose each job duration would 
be 2 min, then if there are 10 jobs in front of the job and the cores is fully 
used. then due to this reason, it wait 20min to be scheduled.
   Also, users will see lots of jobs running for later comming queries in 
SparkUI, that's confusing.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [spark] liupc edited a comment on issue #27871: [SPARK-31105][CORE]Respect sql execution id for FIFO scheduling mode

Reply via email to