[
https://issues.apache.org/jira/browse/SPARK-10014?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14741291#comment-14741291
]
Sameer Abhyankar commented on SPARK-10014:
------------------------------------------
Sounds good [~josephkb] . If we rebroadcast on each predict/transform then
there wouldn't be any change right? The existing code for predict will
rebroadcast the model every time (val bcModel =
testData.context.broadcast(this)).
> ML model broadcasts should be stored in private vars
> ----------------------------------------------------
>
> Key: SPARK-10014
> URL: https://issues.apache.org/jira/browse/SPARK-10014
> Project: Spark
> Issue Type: Umbrella
> Components: ML, MLlib
> Reporter: Joseph K. Bradley
> Priority: Minor
>
> Multiple places in MLlib, we broadcast a model before prediction. Since
> prediction may be called many times, we should store the broadcast variable
> in a private var so that we broadcast at most once.
> I'll link subtasks for each problem case I find.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]