M. Le Bihan created SPARK-26968:
-----------------------------------

             Summary: option("quoteMode", "NON_NUMERIC") have no effect on a 
CSV generation
                 Key: SPARK-26968
                 URL: https://issues.apache.org/jira/browse/SPARK-26968
             Project: Spark
          Issue Type: Bug
          Components: Spark Core
    Affects Versions: 2.4.0
            Reporter: M. Le Bihan


I have a CSV to write that has that schema :
{code:java}
StructType s = schema.add("codeCommuneCR", StringType, false);
s = s.add("nomCommuneCR", StringType, false);
s = s.add("populationCR", IntegerType, false);
s = s.add("resultatComptable", IntegerType, false);{code}
If I don't provide an option "_quoteMode_" or even if I set it to 
{{NON_NUMERIC}}, this way :
{code:java}
ds.coalesce(1).write().mode(SaveMode.Overwrite) .option("header", "true") 
.option("quoteMode", "NON_NUMERIC").option("quote", "\"") 
.csv("./target/out_200071470.csv");{code}
the CSV written by {{Spark}} is this one :
{code:java}
codeCommuneCR,nomCommuneCR,populationCR,resultatComptable
03142,LENAX,267,43{code}
If I set an option "_quoteAll_" instead, like that :
{code:java}
ds.coalesce(1).write().mode(SaveMode.Overwrite) .option("header", "true") 
.option("quoteAll", true).option("quote", "\"") 
.csv("./target/out_200071470.csv");{code}
it generates :
{code:java}
"codeCommuneCR","nomCommuneCR","populationCR","resultatComptable" 
"03142","LENAX","267","43"{code}
It seems that the {{.option("quoteMode", "NON_NUMERIC")}} is broken. It should 
generate:

 
{code:java}
"codeCommuneCR","nomCommuneCR","populationCR","resultatComptable"
"03142","LENAX",267,43
{code}
 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to