Kazuaki Ishizaki created SPARK-22935:
----------------------------------------

             Summary: Dataset with Java Beans for java.sql.Date produces 
incorrect result
                 Key: SPARK-22935
                 URL: https://issues.apache.org/jira/browse/SPARK-22935
             Project: Spark
          Issue Type: Bug
          Components: SQL
    Affects Versions: 2.2.1, 2.3.0
            Reporter: Kazuaki Ishizaki


The following code prints {{c=0}}. The value of {{c}} must be 2. {ds} must not 
be {{empty}}.

{code}
  public void SPARK22935() {
    Dataset<CDR> cdr = spark
            .read()
            .format("csv")
            .option("header", "true")
            .option("inferSchema", "true")
            .option("delimiter", ";")
            .csv("CDR_SAMPLE.csv")
            .as(Encoders.bean(CDR.class));
    Dataset<CDR> ds = cdr.filter((FilterFunction<CDR>) x -> (x.timestamp != 
null));
    long c = ds.count();
    cdr.show(2);
    ds.show(2);
    System.out.println("cnt=" + c);
  }

// CDR.java
public class CDR implements java.io.Serializable {
  public java.sql.Date timestamp;
}

// CDR_SAMPLE.csv
timestamp
2017-10-29T02:37:07.815Z
2017-10-29T02:38:07.815Z
{code}

result
{code}
+--------------------+
|           timestamp|
+--------------------+
|2017-10-29 11:37:...|
|2017-10-29 11:38:...|
+--------------------+

+---------+
|timestamp|
+---------+
+---------+

c=0
{code}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to