[ 
https://issues.apache.org/jira/browse/SPARK-3920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Xiangrui Meng updated SPARK-3920:
---------------------------------
    Fix Version/s:     (was: 1.2.0)

> Add option to support aggregation using treeAggregate in decision tree
> ----------------------------------------------------------------------
>
>                 Key: SPARK-3920
>                 URL: https://issues.apache.org/jira/browse/SPARK-3920
>             Project: Spark
>          Issue Type: Improvement
>          Components: MLlib
>            Reporter: Qiping Li
>
> In [SPARK-3366|https://issues.apache.org/jira/browse/SPARK-3366], we used 
> distribute aggregation to aggregate node stats, which can save computation 
> and communication time when the shuffle size is very large. But experiments 
> have shown that if shuffle size is not large enough(e.g, shallow trees), this 
> will cause some performance loss(greater than 20% in some cases). We should 
> support both options for aggregation so that user can choose a proper one 
> based on their needs. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to