[ 
https://issues.apache.org/jira/browse/SPARK-13638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hyukjin Kwon updated SPARK-13638:
---------------------------------
    Description: 
https://github.com/databricks/spark-csv/pull/254

tobithiel reported this.

{quote}
I'm dealing with some messy csv files and being able to just quote all fields 
is very useful,
so that other applications don't misunderstand the file because of some sketchy 
characters....
{quote}

When writing there are several quote modes in apache commons csv. (See 
https://commons.apache.org/proper/commons-csv/apidocs/org/apache/commons/csv/QuoteMode.html)

This might have to be supported.

However, it looks univocity parser used for writing does not support this quote 
mode. I think we can drop this backwards compatibility if we are not going to 
add apache commons csv.

This is a reminder that it will break backwards compatibility for the options, 
{{quoteMode}} and {{parserLib}}.

  was:
https://github.com/databricks/spark-csv/pull/254

tobithiel reported this.

>I'm dealing with some messy csv files and being able to just quote all fields 
>is very useful, so that other applications don't misunderstand the file 
>because of some sketchy characters....

When writing there are several quote modes in apache commons csv. (See 
https://commons.apache.org/proper/commons-csv/apidocs/org/apache/commons/csv/QuoteMode.html)

This might have to be supported.

However, it looks univocity parser used for writing does not support this quote 
mode. I think we can drop this backwards compatibility if we are not going to 
add apache commons csv.

This is a reminder that it will break backwards compatibility for the options, 
{{quoteMode}} and {{parserLib}}.


> Support for saving with a quote mode
> ------------------------------------
>
>                 Key: SPARK-13638
>                 URL: https://issues.apache.org/jira/browse/SPARK-13638
>             Project: Spark
>          Issue Type: Sub-task
>          Components: SQL
>    Affects Versions: 2.0.0
>            Reporter: Hyukjin Kwon
>            Priority: Minor
>
> https://github.com/databricks/spark-csv/pull/254
> tobithiel reported this.
> {quote}
> I'm dealing with some messy csv files and being able to just quote all fields 
> is very useful,
> so that other applications don't misunderstand the file because of some 
> sketchy characters....
> {quote}
> When writing there are several quote modes in apache commons csv. (See 
> https://commons.apache.org/proper/commons-csv/apidocs/org/apache/commons/csv/QuoteMode.html)
> This might have to be supported.
> However, it looks univocity parser used for writing does not support this 
> quote mode. I think we can drop this backwards compatibility if we are not 
> going to add apache commons csv.
> This is a reminder that it will break backwards compatibility for the 
> options, {{quoteMode}} and {{parserLib}}.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to