Victoria Markman created DRILL-2148:
---------------------------------------
Summary: Wrong result with grouping on a column of date type with
streaming aggregation
Key: DRILL-2148
URL: https://issues.apache.org/jira/browse/DRILL-2148
Project: Apache Drill
Issue Type: Bug
Components: Execution - Operators
Affects Versions: 0.8.0
Reporter: Victoria Markman
Assignee: Chris Westin
Priority: Critical
Disable hash aggregation and run query below:
{code}
alter system set `planner.enable_hashagg` = false;
select
c_date,
COUNT(*)
from t1
group by
c_date
order by
c_date;
{code}
You will get wrong result. Because NULLs are sorted in the middle ( see
DRILL-2084 ) they are folded in one of the non related groups.
We might have the same problem with the merge join on date, time and timestamp
columns.
Attached is a parquet file that was used in this query.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)