I did check it out and although I did get a general understanding of the various classes used to implement Sort and Hash shuffles, however these slides lack details as to how they are implemented and why sort generally has better performance than hash
On Sun, Aug 16, 2015 at 4:31 AM, Ravi Kiran <ravikiranmag...@gmail.com> wrote: > Have a look at this presentation. > http://www.slideshare.net/colorant/spark-shuffle-introduction . Can be of > help to you. > > On Sat, Aug 15, 2015 at 1:42 PM, Muhammad Haseeb Javed < > 11besemja...@seecs.edu.pk> wrote: > >> What are the major differences between how Sort based and Hash based >> shuffle operate and what is it that cause Sort Shuffle to perform better >> than Hash? >> Any talks that discuss both shuffles in detail, how they are implemented >> and the performance gains ? >> > >