Re: Foreachpartition in spark streaming

2017-03-20 Thread Ryan
foreachPartition is an action but run on each worker, which means you won't
see anything on driver.

mapPartitions is a transformation which is lazy and won't do anything until
an action.

it depends on the specific use case which is better. To output sth(like a
print in single machine) you could refer to  take, collect, foreach, etc..

On Mon, Mar 20, 2017 at 2:20 PM, Diwakar Dhanuskodi <
diwakar.dhanusk...@gmail.com> wrote:

> Just wanted to clarify!!!
>
> Is foreachPartition in spark  an output operation?
>
> Which one is better use mapPartitions or foreachPartitions?
>
> Regards
> Diwakar
>


Foreachpartition in spark streaming

2017-03-20 Thread Diwakar Dhanuskodi
Just wanted to clarify!!!

Is foreachPartition in spark  an output operation?

Which one is better use mapPartitions or foreachPartitions?

Regards
Diwakar