Ruslan Dautkhanov created ZEPPELIN-1803:
-------------------------------------------

             Summary: CSV export doesn't conform to RFC-4180: exported csv is 
broken in some cases
                 Key: ZEPPELIN-1803
                 URL: https://issues.apache.org/jira/browse/ZEPPELIN-1803
             Project: Zeppelin
          Issue Type: Bug
          Components: Core, front-end, zeppelin-server
    Affects Versions: 0.7.0
            Reporter: Ruslan Dautkhanov


CSV export doesn't conform to RFC-4180: exported csv is broken in some cases

RFC-4180:
{quote}
If double-quotes are used to enclose fields, then a double-quote appearing 
inside a field must be escaped by preceding it with another double quote.
{quote}

It makes CSV with double quotes (") exported from Zeppelin not importable by 
any tools, including Excel.

It looks like CSV export has other issues too, like in some cases exported 
column value was a negative number instead of a character field. It could be a 
new bug or related again to the fact that Zeppelin CSV exported doesn't conform 
to RFC-4180 standard.

Some related quotes from RFC-4180:
{noformat}
   5.  Each field may or may not be enclosed in double quotes (however
       some programs, such as Microsoft Excel, do not use double quotes
       at all).  If fields are not enclosed with double quotes, then
       double quotes may not appear inside the fields.  For example:

       "aaa","bbb","ccc" CRLF
       zzz,yyy,xxx

   6.  Fields containing line breaks (CRLF), double quotes, and commas
       should be enclosed in double-quotes.  For example:

       "aaa","b CRLF
       bb","ccc" CRLF
       zzz,yyy,xxx

   7.  If double-quotes are used to enclose fields, then a double-quote
       appearing inside a field must be escaped by preceding it with
       another double quote.  For example:

       "aaa","b""bb","ccc"
{noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to