our experience is that unless you can benefit from spark features such as co-partitioning that allow for more efficient execution that spark is slightly slower for disk to disk. On Apr 27, 2015 10:34 PM, "bit1...@163.com" <bit1...@163.com> wrote:
> Hi, > > I am frequently asked why spark is also much faster than Hadoop MapReduce > on disk (without the use of memory cache). I have no convencing answer for > this question, could you guys elaborate on this? Thanks! > > ------------------------------ > >