Is there a recommended performance test for sort based shuffle? Something similar to terasort on Hadoop. I couldn't find one on the spark-perf code base.
https://github.com/databricks/spark-perf -- Kannan
Is there a recommended performance test for sort based shuffle? Something similar to terasort on Hadoop. I couldn't find one on the spark-perf code base.
https://github.com/databricks/spark-perf -- Kannan