Stamatis Zampetakis created HIVE-27673:
------------------------------------------

             Summary: Configurable datetime formatter for date_format
                 Key: HIVE-27673
                 URL: https://issues.apache.org/jira/browse/HIVE-27673
             Project: Hive
          Issue Type: Improvement
          Components: HiveServer2
    Affects Versions: 4.0.0-beta-1
            Reporter: Stamatis Zampetakis
            Assignee: Stamatis Zampetakis


HIVE-25268 switched the internal implementation of date_format from 
java.text.SimpleDateFormat to java.time.format.DateTimeFormatter in order to 
avoid some inconsistencies (arguably wrong results) for dates prior to 1900.

However, the API of the underlying formatter is exposed to the user since they 
need to pass patterns that are valid for the respective formatter.

Changing the formatter implementation resolves the bugs in HIVE-25268 but also 
leads to backward incompatible behavior.

Consider for example the following query where the letter 'u' is used to format 
the date:

{code:sql}
select date_format('2023-09-08','u');
{code}

The query above will return different result depending on the formatter that is 
used underneath.

In 
[SimpleDateFormat|https://docs.oracle.com/javase/8/docs/api/java/text/SimpleDateFormat.html],
 the letter 'u' means day of the week so the query returns 5.

In 
[DateTimeFormatter|https://docs.oracle.com/javase/8/docs/api/java/time/format/DateTimeFormatter.html],
 the letter 'u' means year so the query returns 2023.

The goal of this ticket is to make the underlying formatter of date_format 
function configurable by the end-user via property, similarly to what was done 
in HIVE-25576. For this purpose we could reuse the same property: 
hive.datetime.formatter



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to