So each action (in driver node) creates a job that can still be executed by 1:N worker node(s) ?
On Sat, Jan 18, 2014 at 10:56 PM, Tathagata Das <[email protected] > wrote: > Yes, RDD actions can be called only in the driver program, therefore only > in the driver node. However, they can be parallelized within the driver > program by calling multiple actions from multiple threads. The jobs > corresponding to each action will be executed simultaneously in the Spark > cluster, sharing the available resources. > > TD > > > > > On Sat, Jan 18, 2014 at 10:34 PM, Manoj Samel <[email protected]>wrote: > >> Are RDD actions like count etc. run only on driver node or can they be >> parallelized ? >> >> Thanks, >> > >
