Martin Tapp created SPARK-11788:
-----------------------------------
Summary: Using java.sql.Timestamp and java.sql.Date in where
clauses on JDBC dataframes causes SQLServerException
Key: SPARK-11788
URL: https://issues.apache.org/jira/browse/SPARK-11788
Project: Spark
Issue Type: Bug
Components: SQL
Affects Versions: 1.5.1
Reporter: Martin Tapp
I have a MSSQL table that has a timestamp column and am reading it using
DataFrameReader.jdbc. Adding a where clause which compares a timestamp range
causes a SQLServerException.
The problem is in
https://github.com/apache/spark/blob/master/sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/jdbc/JDBCRDD.scala#L264
(compileValue) which should surround timestamps/dates with quotes (only does
it for strings).
Sample pseudo-code:
val beg = new java.sql.Timestamp(...)
val end = new java.sql.Timestamp(...)
val filtered = jdbcdf.where($"TIMESTAMP_COLUMN" >= beg && $"TIMESTAMP_COLUMN" <
end)
Generated SQL query: "TIMESTAMP_COLUMN >= 2015-01-01 00:00:00.0"
Query should use quotes around timestamp: "TIMESTAMP_COLUMN >= '2015-01-01
00:00:00.0'"
Fallback is to filter client-side which is extremely inefficient as the whole
table needs to be downloaded to each Spark executor.
Thanks
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]