[GitHub] [spark] wayneguow commented on pull request #34853: [SPARK-37575][SQL] null values should be saved as nothing rather than quoted empty Strings "" by default settings

GitBox Mon, 13 Dec 2021 07:53:38 -0800


wayneguow commented on pull request #34853:
URL: https://github.com/apache/spark/pull/34853#issuecomment-992615673



   @MaxGekk Thank you for your careful reviews and providing a significant test 
case.
   
   But in my opinion, the root cause is  that the parameter `skipEmptyLines` is 
set to true by default.
   
   Before the 2.4 version, when a row has only one column with null values, 
this row is also not to be saved in the final csv files.
   
   Since 2.4 version, we save null values and empty strings both as quoted 
empty strings "" _by mistake_, so the row which has only one null values column 
can be saved in files. 
    
   The change what I made this time is just to distinguish null values and 
empty strings in saved in csv files really and it's stiil to keep the behavior 
with before.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [spark] wayneguow commented on pull request #34853: [SPARK-37575][SQL] null values should be saved as nothing rather than quoted empty Strings "" by default settings

Reply via email to