[jira] [Comment Edited] (SPARK-17969) I think it's user unfriendly to process standard json file with DataFrame

Jianfei Wang (JIRA) Sun, 16 Oct 2016 23:28:26 -0700

    [ 
https://issues.apache.org/jira/browse/SPARK-17969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15581357#comment-15581357
 ]


Jianfei Wang edited comment on SPARK-17969 at 10/17/16 6:27 AM:
----------------------------------------------------------------

I can do this mini job base on some top api, such as 
sparkContext.wholeTextFiles without modify some low-level method, is this ok?. 
thank you!


was (Author: codlife):
I can do this mini job. thank you!

> I think it's user unfriendly to process standard json file with DataFrame 
> --------------------------------------------------------------------------
>
>                 Key: SPARK-17969
>                 URL: https://issues.apache.org/jira/browse/SPARK-17969
>             Project: Spark
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 2.0.1
>            Reporter: Jianfei Wang
>            Priority: Minor
>
> Currently, with DataFrame API,  we can't load standard json file directly, 
> maybe we can provide an override method to process this, the logic is as 
> below:
> ```
> val df = spark.sparkContext.wholeTextFiles("data/test.json") 
>  val json_rdd = df.map( x => x.toString.replaceAll("\\s+","")).map{ x => 
>       val index = x.indexOf(',') 
>       x.substring(index + 1, x.length - 1) 
>     } 
>     val json_df = spark.read.json(json_rdd) 
> ```



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Comment Edited] (SPARK-17969) I think it's user unfriendly to process standard json file with DataFrame

Reply via email to