[
https://issues.apache.org/jira/browse/HIVE-20174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Matt McCline updated HIVE-20174:
--------------------------------
Description:
Write new UT tests that use random data and intentional isRepeating batches to
checks for NULL and Wrong Results for vectorized aggregation functions.
BUGs found:
1) AVG/VARIANCE (family) in PARTIAL1 mode was returning NULL instead of count =
0, sum = 0 (All data types). For AVG DECIMAL, only return NULL if there was an
overflow.
2) AVG/MIN/MAX was not detecting repeated NULL correctly for the TIMESTAMP and
INTERVAL_DAY_TIME. Eliminated redundant code.
3) Fix incorrect calculation for VARIANCE (family) in PARTIAL2 and FINAL modes.
4) Fix row-mode AVG DECIMAL to enforce output type precision and scale in
COMPLETE and FINAL modes.
was:
Write new UT tests that use random data and intentional isRepeating batches to
checks for NULL and Wrong Results for vectorized aggregation functions.
BUGs found:
1) AVG/VARIANCE (family) in PARTIAL1 mode was returning NULL instead of count =
0, sum = 0 (All data types). For AVG DECIMAL, only return NULL if there was an
overflow.
2) AVG/MIN/MAX was not detecting repeated NULL correctly for the TIMESTAMP and
INTERVAL_DAY_TIME.
3) Fix incorrect calculation for VARIANCE (family) in PARTIAL2 and FINAL modes.
4) Fix row-mode AVG DECIMAL to enforce output type precision and scale in
COMPLETE and FINAL modes.
> Vectorization: Fix NULL / Wrong Results issues in GROUP BY Aggregation
> Functions
> --------------------------------------------------------------------------------
>
> Key: HIVE-20174
> URL: https://issues.apache.org/jira/browse/HIVE-20174
> Project: Hive
> Issue Type: Bug
> Components: Hive
> Reporter: Matt McCline
> Assignee: Matt McCline
> Priority: Critical
> Attachments: HIVE-20174.01.patch
>
>
> Write new UT tests that use random data and intentional isRepeating batches
> to checks for NULL and Wrong Results for vectorized aggregation functions.
>
> BUGs found:
> 1) AVG/VARIANCE (family) in PARTIAL1 mode was returning NULL instead of count
> = 0, sum = 0 (All data types). For AVG DECIMAL, only return NULL if there
> was an overflow.
> 2) AVG/MIN/MAX was not detecting repeated NULL correctly for the TIMESTAMP
> and INTERVAL_DAY_TIME. Eliminated redundant code.
> 3) Fix incorrect calculation for VARIANCE (family) in PARTIAL2 and FINAL
> modes.
> 4) Fix row-mode AVG DECIMAL to enforce output type precision and scale in
> COMPLETE and FINAL modes.
>
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)