[jira] [Commented] (SPARK-3202) Manipulating columns in CSV file or Transpose of Array[Array[String]] RDD
[ https://issues.apache.org/jira/browse/SPARK-3202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14109128#comment-14109128 ] Sean Owen commented on SPARK-3202: -- JIRA is not a good place to ask questions -- please use u...@spark.apache.org. This is for reporting issues, so I'd recommend closing this. > Manipulating columns in CSV file or Transpose of Array[Array[String]] RDD > - > > Key: SPARK-3202 > URL: https://issues.apache.org/jira/browse/SPARK-3202 > Project: Spark > Issue Type: Documentation > Components: Documentation >Reporter: Hingorani, Vineet > > Hello all, > Could someone help me with the manipulation of csv file data. I have > 'semicolon' separated csv data including doubles and strings. I want to > calculate the maximum/average of a column. When I read the file using > sc.textFile(test.csv).map(_.split(";"), each field is read as string. Could > someone help me with the above manipulation and how to do that. > Or may be if there is some way to take the transpose of the data and then > manipulating the rows in some way? > Thank you in advance, I am struggling with this thing for quite sometime > Regards, > Vineet -- This message was sent by Atlassian JIRA (v6.2#6252) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-3202) Manipulating columns in CSV file or Transpose of Array[Array[String]] RDD
[ https://issues.apache.org/jira/browse/SPARK-3202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14109129#comment-14109129 ] Hingorani, Vineet commented on SPARK-3202: -- Thank you Sean for the helping regarding the platform. :) > Manipulating columns in CSV file or Transpose of Array[Array[String]] RDD > - > > Key: SPARK-3202 > URL: https://issues.apache.org/jira/browse/SPARK-3202 > Project: Spark > Issue Type: Documentation > Components: Documentation >Reporter: Hingorani, Vineet > > Hello all, > Could someone help me with the manipulation of csv file data. I have > 'semicolon' separated csv data including doubles and strings. I want to > calculate the maximum/average of a column. When I read the file using > sc.textFile(test.csv).map(_.split(";"), each field is read as string. Could > someone help me with the above manipulation and how to do that. > Or may be if there is some way to take the transpose of the data and then > manipulating the rows in some way? > Thank you in advance, I am struggling with this thing for quite sometime > Regards, > Vineet -- This message was sent by Atlassian JIRA (v6.2#6252) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org