[jira] [Commented] (SPARK-2360) CSV import to SchemaRDDs

Hingorani, Vineet (JIRA) Fri, 22 Aug 2014 07:06:20 -0700

    [ 
https://issues.apache.org/jira/browse/SPARK-2360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14106851#comment-14106851
 ]


Hingorani, Vineet commented on SPARK-2360:
------------------------------------------

Hello Michael,

I saw your comment thread on a mail archive regarding having to be able to 
manipulate csv files using spark. Could you please give some information as to 
do have this functionality now in the latest release of Spark? I have installed 
the lates version as of now and running it on my local machine.

Thank you

Regards,

Vineet Hingorani
Developer Associate
Custom Development & Strategic Projects group (CD&SP)
Products & Innovation (P&I)
SAP SE
WDF 03, C3.03
E [email protected]<mailto:[email protected]>



> CSV import to SchemaRDDs
> ------------------------
>
>                 Key: SPARK-2360
>                 URL: https://issues.apache.org/jira/browse/SPARK-2360
>             Project: Spark
>          Issue Type: New Feature
>          Components: SQL
>            Reporter: Michael Armbrust
>            Assignee: Hossein Falaki
>
> I think the first step it to design the interface that we want to present to 
> users.  Mostly this is defining options when importing.  Off the top of my 
> head:
> - What is the separator?
> - Provide column names or infer them from the first row.
> - how to handle multiple files with possibly different schemas
> - do we have a method to let users specify the datatypes of the columns or 
> are they just strings?
> - what types of quoting / escaping do we want to support?



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (SPARK-2360) CSV import to SchemaRDDs

Reply via email to