[GitHub] spark issue #14118: [SPARK-16462][SPARK-16460][SPARK-15144][SQL] Make CSV ca...

HyukjinKwon Fri, 19 Aug 2016 00:54:07 -0700

Github user HyukjinKwon commented on the issue:

    https://github.com/apache/spark/pull/14118
  
    @rxin Please let me leave my though why I thought it looks good to me in 
case it is helpful.
    
    Yes, but we should set `nullValue` for writing `null`. So, I think, setting 
`""` for `nullValue` means treating `""` as null.
    
    For example, if we have the dataframe as below:
    
    ```
    +------+
    |     a|
    +------+
    |   abc|
    |  null|
    +------+
    ``` 
    
    with `nullValue` set to `"abc"`, this will writes
    
    ```
    abc
    abc
    ```
    
    Here, we ended up with no diff between `null` and `abc`. but since users 
set `nullValue` to `abc` for output, users would understand this behaviour.
    
    I mean.. there is no expression for actual `null` so we are explicitly 
giving the representation for this and so, I thought it is okay even if we can 
differentiate `nullValue` from actual `null`.



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark issue #14118: [SPARK-16462][SPARK-16460][SPARK-15144][SQL] Make CSV ca...

Reply via email to