[jira] [Commented] (SPARK-17090) Make tree aggregation level in linear/logistic regression configurable

DB Tsai (JIRA) Wed, 17 Aug 2016 22:47:58 -0700

    [ 
https://issues.apache.org/jira/browse/SPARK-17090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15425936#comment-15425936
 ]


DB Tsai commented on SPARK-17090:
---------------------------------

We could make it default to 0 which automatically figure out the best 
aggregation depth based on the dimensions of features, and the number of 
partition. Before that, we can make default 0 as 2 which is the current 
behavior. 

> Make tree aggregation level in linear/logistic regression configurable
> ----------------------------------------------------------------------
>
>                 Key: SPARK-17090
>                 URL: https://issues.apache.org/jira/browse/SPARK-17090
>             Project: Spark
>          Issue Type: Improvement
>          Components: ML
>            Reporter: Seth Hendrickson
>            Priority: Minor
>
> Linear/logistic regression use treeAggregate with default aggregation depth 
> for collecting coefficient gradient updates to the driver. For high 
> dimensional problems, this can case OOM error on the driver. We should make 
> it configurable, perhaps via an expert param, so that users can avoid this 
> problem if their data has many features.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (SPARK-17090) Make tree aggregation level in linear/logistic regression configurable

Reply via email to