Hi,
We are going to deploy a standalone mode cluster, we know Spark can read local data files into RDDs, but the question is where should we put the data file, on the server where commit our application, or the server where the master service runs? Regards, Xiaobo Gu
