Hi everyone, I notice the benchmark page for AMPLab provides some numbers on Gbs of data: https://amplab.cs.berkeley.edu/benchmark/ I was wondering if similar benchmark numbers existed for even larger data sets, in the terabytes if possible.
Also, are there any for just raw spark, i.e. No shark? Thanks, -Matt Chetah
