[ 
https://issues.apache.org/jira/browse/SPARK-2361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Xiangrui Meng updated SPARK-2361:
---------------------------------

    Assignee: Xiangrui Meng

> Decide whether to broadcast or serialize the weights directly in MLlib 
> algorithms
> ---------------------------------------------------------------------------------
>
>                 Key: SPARK-2361
>                 URL: https://issues.apache.org/jira/browse/SPARK-2361
>             Project: Spark
>          Issue Type: Improvement
>          Components: MLlib
>            Reporter: Xiangrui Meng
>            Assignee: Xiangrui Meng
>
> In the current implementation, MLlib serializes weights directly into 
> closure. This is okay for small feature dimension, but not efficient for 
> feature dimensions beyond 1M. Especially the default akka.frameSize is 10m. 
> We should use broadcast when the size of the serialized task is going to be 
> large.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to