Hingorani, Vineet created SPARK-3202:
----------------------------------------
Summary: Manipulating columns in CSV file or Transpose of
Array[Array[String]] RDD
Key: SPARK-3202
URL: https://issues.apache.org/jira/browse/SPARK-3202
Project: Spark
Issue Type: Documentation
Components: Documentation
Reporter: Hingorani, Vineet
Hello all,
Could someone help me with the manipulation of csv file data. I have
'semicolon' separated csv data including doubles and strings. I want to
calculate the maximum/average of a column. When I read the file using
sc.textFile(test.csv).map(_.split(";"), each field is read as string. Could
someone help me with the above manipulation and how to do that.
Or may be if there is some way to take the transpose of the data and then
manipulating the rows in some way?
Thank you in advance, I am struggling with this thing for quite sometime
Regards,
Vineet
--
This message was sent by Atlassian JIRA
(v6.2#6252)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]