Josh Rosen created SPARK-15748:
----------------------------------

             Summary: Replace inefficient foldLeft() call in PartitionStatistics
                 Key: SPARK-15748
                 URL: https://issues.apache.org/jira/browse/SPARK-15748
             Project: Spark
          Issue Type: Improvement
          Components: SQL
            Reporter: Josh Rosen
            Assignee: Josh Rosen


PartitionStatistics uses foldLeft and list concatenation to flatten an iterator 
of lists, but this is extremely inefficient compared to simply doing 
flatMap/flatten because it performs many unnecessary object allocations. Simply 
replacing this foldLeft by a flatMap results in fair performance gains when 
constructing PartitionStatistics instances for tables with many columns.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to