[ 
https://issues.apache.org/jira/browse/SPARK-30668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17028731#comment-17028731
 ] 

Maxim Gekk commented on SPARK-30668:
------------------------------------

Behind of the removed config _spark.sql.legacy.timeParser.enabled_, there are 2 
more fallbacks to behaviors since Spark 1.5, see LegacyFallbackDateFormatter:
1. *s.toInt* - In Spark 1.5.0, we store the data as number of days since epoch 
in string. So, we just convert it to Int.
2. *DateTimeUtils.millisToDays(DateTimeUtils.stringToTime(s).getTime)* - the 
way used in 2.0 and 1.x
3. FastDateFormat or *SimpleDateFormat*
Should we allow users to switch to SimpleDateFormat only or other legacy ways 
too?

 

> to_timestamp failed to parse 2020-01-27T20:06:11.847-0800 using pattern 
> "yyyy-MM-dd'T'HH:mm:ss.SSSz"
> ----------------------------------------------------------------------------------------------------
>
>                 Key: SPARK-30668
>                 URL: https://issues.apache.org/jira/browse/SPARK-30668
>             Project: Spark
>          Issue Type: Bug
>          Components: SQL
>    Affects Versions: 3.0.0
>            Reporter: Xiao Li
>            Priority: Blocker
>
> {code:java}
> SELECT to_timestamp("2020-01-27T20:06:11.847-0800", 
> "yyyy-MM-dd'T'HH:mm:ss.SSSz")
> {code}
> This can return a valid value in Spark 2.4 but return NULL in the latest 
> master
> **2.4.5 RC2**
> {code}
> scala> sql("""SELECT to_timestamp("2020-01-27T20:06:11.847-0800", 
> "yyyy-MM-dd'T'HH:mm:ss.SSSz")""").show
> +----------------------------------------------------------------------------+
> |to_timestamp('2020-01-27T20:06:11.847-0800', 'yyyy-MM-dd\'T\'HH:mm:ss.SSSz')|
> +----------------------------------------------------------------------------+
> |                                                         2020-01-27 20:06:11|
> +----------------------------------------------------------------------------+
> {code}
> **2.2.3 ~ 2.4.4** (2.0.2 ~ 2.1.3 doesn't have `to_timestamp`).
> {code}
> spark-sql> SELECT to_timestamp("2020-01-27T20:06:11.847-0800", 
> "yyyy-MM-dd'T'HH:mm:ss.SSSz");
> 2020-01-27 20:06:11
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to