The parser does not need to be serializable. In the line:

lines.map(line => JSONParser.parse(line))

... the parser is called but there is no parser object that with state
that can be serialized. Are you sure it does not work?

The error message alluded to originally refers to an object not shown
in the code, so I'm not 100% sure this was the original issue.

If you want, the equivalent of "setup()" is really "writing some code
at the start of a call to mapPartitions()"

On Mon, Aug 4, 2014 at 8:40 AM, Fengyun RAO <raofeng...@gmail.com> wrote:
> Thanks, Ron.
>
> The problem is that the "parser" is written in another package which is not
> serializable.
>
> In mapreduce, I could create the "parser" in the map setup() method.
>
> Now in spark, I want to create it for each worker, and share it among all
> the tasks on the same work node.
>
> I know different workers run on different machine, but it doesn't have to
> communicate between workers.

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org

Reply via email to