David Wilcox created SPARK-25086:

             Summary: Incorrect Default Value For "escape" For CSV Files
                 Key: SPARK-25086
                 URL: https://issues.apache.org/jira/browse/SPARK-25086
             Project: Spark
          Issue Type: Improvement
          Components: Spark Core
    Affects Versions: 2.3.1
            Reporter: David Wilcox

The RFC for CSV files ([https://tools.ietf.org/html/rfc4180]) indicates that 
the way that a double-quote is escaped is by preceding it with another 
7. If double-quotes are used to enclose fields, then a double-quote appearing 
inside a field must be escaped by preceding it with another double quote. For 
example: "aaa","b""bb","ccc"{code}
Your default value for "escape" violates the RFC. I think that we should fix 
the default value to be {{"}}, and those that want {{\}} to escape can override 
for non-RFC-conforming CSV files.

This message was sent by Atlassian JIRA

To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to