What's the recommended way to save a RDD as a CSV on say HDFS? Do I have to collect the RDD and save it from the master, or is there someway I can write out the CSV file in parallel to HDFS?
tks shay
What's the recommended way to save a RDD as a CSV on say HDFS? Do I have to collect the RDD and save it from the master, or is there someway I can write out the CSV file in parallel to HDFS?
tks shay