os
slave needs to access the hdfs based input data.
So this leads into another question on how hdfs is setup in a standard way.
Is there any docs to summarize how to setup a standard runtime env for
terasort on Spark?
thanks.
---
er to simulate Terasort on spark would be of great
help..
Kindly help with the same..
Regards
Harsha
Hi all , i tried to run a terasort benchmark on my spark cluster, but i
found it is hard to find a standard spark terasort program except a PR from
rxin and ewan higgs:
https://github.com/apache/spark/pull/1242
https://github.com/ehiggs/spark/tree/terasort
The example which rxin provided without