[GitHub] spark issue #20125: [SPARK-17967][SQL] Support for array as an option in SQL...

HyukjinKwon Sun, 31 Dec 2017 05:55:19 -0800

Github user HyukjinKwon commented on the issue:

    https://github.com/apache/spark/pull/20125
  
    Yup, I was thinking of SparkSQL only feature.
    
    For more details, the original intention was to support multiple values for 
`nullValue` but I realised such option support can be generallised - there were 
several issues about this since CSV is thirdparty library (I will find and give 
some links if requested). Also, there is one reference in R too:
    
    ```R
    > d <- "col1,col2
    + 1,3
    + 2,4"
    > df <- read.csv(text=d, na.strings=c("3", "2"))
    > df
    ```
    ```
      col1 col2
    1    1   NA
    2   NA    4
    ```
    
    For more context, original proposal (Scala/SQL/Python/Java) here - 
https://github.com/apache/spark/pull/16611 touched many files and I received an 
advice to make this smaller, which I liked.




---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark issue #20125: [SPARK-17967][SQL] Support for array as an option in SQL...

Reply via email to