[ 
https://issues.apache.org/jira/browse/SPARK-7323?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sean Owen updated SPARK-7323:
-----------------------------
    Description: Currently we invoke combiners.insert() for each tuple in 
Aggregator - which results in creation of an Iterator per tuple, and iterating 
over this iterator : instead, we can directly call insertAll to avoid the 
object creation, method invocation and iteration overhead - for each tuple when 
combiners are used.  (was: 
Currently we invoke combiners.insert() for each tuple in Aggregator - which 
results in creation of an Iterator per tuple, and iterating over this iterator 
: instead, we can directly call insertAll to avoid the object creation, method 
invocation and iteration overhead - for each tuple when combiners are used.)
       Priority: Minor  (was: Major)

> Use insertAll instead of individual insert while merging combiners
> ------------------------------------------------------------------
>
>                 Key: SPARK-7323
>                 URL: https://issues.apache.org/jira/browse/SPARK-7323
>             Project: Spark
>          Issue Type: Improvement
>          Components: Spark Core
>            Reporter: Mridul Muralidharan
>            Assignee: Mridul Muralidharan
>            Priority: Minor
>             Fix For: 1.3.2, 1.4.0
>
>
> Currently we invoke combiners.insert() for each tuple in Aggregator - which 
> results in creation of an Iterator per tuple, and iterating over this 
> iterator : instead, we can directly call insertAll to avoid the object 
> creation, method invocation and iteration overhead - for each tuple when 
> combiners are used.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to