Josh Mahonin created PHOENIX-2567:
-------------------------------------

             Summary: phoenix-spark: DataFrame API should handle 'DATE' columns
                 Key: PHOENIX-2567
                 URL: https://issues.apache.org/jira/browse/PHOENIX-2567
             Project: Phoenix
          Issue Type: Bug
    Affects Versions: 4.7.0
            Reporter: Josh Mahonin
            Assignee: Josh Mahonin
             Fix For: 4.7.0


The current implementation had the 'DATE' datatype bound to a Spark SQL 
'TimestampType', which causes a casting error trying to convert from 
java.sql.Date to java.sql.Timestamp when using the DataFrame API with Phoenix 
DATE columns.

This patch modifies the schema handling to treat DATE columns as the Spark 
'DateType' instead. Note that Spark *drops* the hour, minute and second values 
from these when interfacing using DataFrames. This follows the java.sql.Date 
spec, but might not useful to folks who rely on the hour/minute/second fields 
working using the DataFrame API and DATE columns. A future improvement would be 
to force these to be TimestampTypes instead to preserve information, but it's 
less intuitive and probably shouldn't be the default behaviour.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to