squito commented on issue #23677: [SPARK-26755][SCHEDULER] : Optimize Spark 
Scheduler to dequeue speculative tasks…
URL: https://github.com/apache/spark/pull/23677#issuecomment-511553220
 
 
   Can you also update the PR description, to something which just focuses more 
on the actual improvement and less on the discovery of the issue?   I think you 
should just mention something like
   
   "This improves the performance of scheduling speculative tasks to be O(1) 
instead of O(numSpeculativeTasks), using the same approach used for scheduling 
regular tasks.  The performance of this method is particularly important 
because a lock is held on the TaskSchedulerImpl which is a bottleneck for all 
scheduling operations.
   
   In particular, this works by storing a separate stack of tasks by executor, 
node, and rack locality preferences.  Then when trying to schedule a 
speculative task, rather than scanning *all* speculative tasks to find ones 
which match the given executor (or node, or rack) preference, we can jump to a 
quick check of tasks matching the resource offer.  This technique was already 
used for regular tasks -- this change refactors the code to allow sharing with 
regular and speculative task execution."

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to