Huge joins would be interesting. I do all my demos on wikipedia dataset for Shark. Joins are typical pain to showcase & show off :)
Mayur Rustagi Ph: +1 (760) 203 3257 http://www.sigmoidanalytics.com @mayur_rustagi <https://twitter.com/mayur_rustagi> On Wed, Apr 23, 2014 at 10:33 AM, Ajay Nair <prodig...@gmail.com> wrote: > I am going to perform some test experiments on the wikipedia dataset using > the spark framework. I know wikipedia data set might already have been > analyzed, but what are the potential explored/unexplored aspects of spark > that can be tested and benchmarked on wikipedia dataset? > > Thanks > AJ >