Github user maropu commented on a diff in the pull request:
https://github.com/apache/spark/pull/17136#discussion_r104356333
--- Diff: R/pkg/inst/tests/testthat/test_sparkSQL.R ---
@@ -246,8 +246,8 @@ test_that("read/write csv as DataFrame", {
mockLinesCsv <- c("year,make,model,comment,blank",
"\"2012\",\"Tesla\",\"S\",\"No comment\",",
"1997,Ford,E350,\"Go get one now they are going
fast\",",
- "2015,Chevy,Volt",
- "NA,Dummy,Placeholder")
+ "2015,Chevy,Volt,,",
--- End diff --
I probably think that dropping the extra tokens in the **longer** case is
an incorrect behaviour by referring the json behaviour. But, I know this change
could affect current users, so we might need to do something for that, e.g.,
adding a new option to keep the current behaviour. WDYT? cc: @HyukjinKwon
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]