See https://issues.apache.org/jira/browse/SPARK-2045 and https://issues.apache.org/jira/browse/SPARK-3280
On Thu, Nov 13, 2014 at 4:19 PM, Debasish Das <[email protected]> wrote: > Hi, > > I am noticing the first step for Spark jobs does a TimSort in 1.2 > branch...and there is some time spent doing the TimSort...Is this assigning > the RDD blocks to different nodes based on a sort order ? > > Could someone please point to a JIRA about this change so that I can read > more about it ? > > Thanks. > Deb >
