Re: How does shuffle work in spark ?

suman bharadwaj Thu, 16 Jan 2014 06:25:55 -0800

Hi,

Is this behavior the same when the data is in memory ?
If the data is stored to disk, then how is it different than Hadoop map
reduce ?


Regards,
SB


On Thu, Jan 16, 2014 at 5:11 PM, Archit Thakur <[email protected]>wrote:

> For any shuffle operation, groupByKey, etc. it does write map output to
> disk before performing the reduce task on the data.
>
>
> On Thu, Jan 16, 2014 at 4:03 PM, suman bharadwaj <[email protected]>wrote:
>
>> Hi,
>>
>> I'm new to spark. And wanted to understand more on how shuffle works in
>> spark
>>
>> In Hadoop map reduce, while performing a reduce operation, the
>> intermediate data from map gets written to disk. How does the same happen
>> in Spark ?
>>
>> Does spark write the intermediate data to disk ?
>>
>> Thanks in advance.
>>
>> Regards,
>> SB
>>
>
>

Re: How does shuffle work in spark ?

Reply via email to