Hi, Spark run programs up to 100x faster than Hadoop MapReduce in memory, or 10x faster on disk.(Please see the graph below)
What kind of test dataset and cluster configuration can get the test results above, has anyone known? And,Where can i get the test dataset? Thanx in advance. Best Regards.