[
https://issues.apache.org/jira/browse/SPARK-34392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17281838#comment-17281838
]
Maxim Gekk edited comment on SPARK-34392 at 2/9/21, 3:26 PM:
-------------------------------------------------------------
The "GMT+8:00" string is unsupported format in 3.0, see docs for the
to_utc_timestamp() function
(https://github.com/apache/spark/blob/30468a901577e82c855fbc4cb78e1b869facb44c/sql/core/src/main/scala/org/apache/spark/sql/functions.scala#L3397-L3402):
{code:scala}
@param tz A string detailing the time zone ID that the input should be adjusted
to. It should
be in the format of either region-based zone IDs or zone offsets. Region IDs
must
have the form 'area/city', such as 'America/Los_Angeles'. Zone offsets must
be in
the format '(+|-)HH:mm', for example '-08:00' or '+01:00'. Also 'UTC' and 'Z'
are
supported as aliases of '+00:00'. Other short names are not recommended to use
because they can be ambiguous.
{code}
was (Author: maxgekk):
The "GMT+8:00" string is unsupported format in 3.0, see docs for the
to_utc_timestamp() function:
{code:scala}
* @param tz A string detailing the time zone ID that the input should be
adjusted to. It should
* be in the format of either region-based zone IDs or zone
offsets. Region IDs must
* have the form 'area/city', such as 'America/Los_Angeles'. Zone
offsets must be in
* the format '(+|-)HH:mm', for example '-08:00' or '+01:00'. Also
'UTC' and 'Z' are
* supported as aliases of '+00:00'. Other short names are not
recommended to use
* because they can be ambiguous.
{code}
> Invalid ID for offset-based ZoneId since Spark 3.0
> --------------------------------------------------
>
> Key: SPARK-34392
> URL: https://issues.apache.org/jira/browse/SPARK-34392
> Project: Spark
> Issue Type: Bug
> Components: SQL
> Affects Versions: 3.0.0, 3.0.1
> Reporter: Yuming Wang
> Priority: Major
>
> How to reproduce this issue:
> {code:sql}
> select to_utc_timestamp("2020-02-07 16:00:00", "GMT+8:00");
> {code}
> Spark 2.4:
> {noformat}
> spark-sql> select to_utc_timestamp("2020-02-07 16:00:00", "GMT+8:00");
> 2020-02-07 08:00:00
> Time taken: 0.089 seconds, Fetched 1 row(s)
> {noformat}
> Spark 3.x:
> {noformat}
> spark-sql> select to_utc_timestamp("2020-02-07 16:00:00", "GMT+8:00");
> 21/02/07 01:24:32 ERROR SparkSQLDriver: Failed in [select
> to_utc_timestamp("2020-02-07 16:00:00", "GMT+8:00")]
> java.time.DateTimeException: Invalid ID for offset-based ZoneId: GMT+8:00
> at java.time.ZoneId.ofWithPrefix(ZoneId.java:437)
> at java.time.ZoneId.of(ZoneId.java:407)
> at java.time.ZoneId.of(ZoneId.java:359)
> at java.time.ZoneId.of(ZoneId.java:315)
> at
> org.apache.spark.sql.catalyst.util.DateTimeUtils$.getZoneId(DateTimeUtils.scala:53)
> at
> org.apache.spark.sql.catalyst.util.DateTimeUtils$.toUTCTime(DateTimeUtils.scala:814)
> {noformat}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]