It gets serialized once per physical container, Instead of being serialized
once per task of every stage that uses it.
On Sat, Feb 20, 2016 at 8:15 AM jeff saremi <jeffsar...@hotmail.com> wrote:

> Is the broadcasted variable distributed to every executor or every worker?
> Now i'm more confused
> I thought it was supposed to save memory by distributing it to every
> worker and the executors would share that copy
>
>
> ------------------------------
> Date: Fri, 19 Feb 2016 16:48:59 -0800
> Subject: Re: Access to broadcasted variable
> From: shixi...@databricks.com
> To: jeffsar...@hotmail.com
> CC: user@spark.apache.org
>
>
> The broadcasted object is serialized in driver and sent to the executors.
> And in the executor, it will deserialize the bytes to get the broadcasted
> object.
>
> On Fri, Feb 19, 2016 at 5:54 AM, jeff saremi <jeffsar...@hotmail.com>
> wrote:
>
> could someone please comment on this? thanks
>
> ------------------------------
> From: jeffsar...@hotmail.com
> To: user@spark.apache.org
> Subject: Access to broadcasted variable
> Date: Thu, 18 Feb 2016 14:44:07 -0500
>
>
>
> I'd like to know if the broadcasted object gets serialized when accessed
> by the executor during the execution of a task?
> I know that it gets serialized from the driver to the worker. This
> question is inside worker when executor JVM's are accessing it
>
> thanks
> Jeff
>
>
>

Reply via email to