[
https://issues.apache.org/jira/browse/DRILL-4473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jacques Nadeau updated DRILL-4473:
----------------------------------
Assignee: Sean Hsuan-Yi Chu
> Removing trivial projects reveals bugs in handling of nonexistent columns in
> StreamingAggregate
> -----------------------------------------------------------------------------------------------
>
> Key: DRILL-4473
> URL: https://issues.apache.org/jira/browse/DRILL-4473
> Project: Apache Drill
> Issue Type: Bug
> Reporter: Jacques Nadeau
> Assignee: Sean Hsuan-Yi Chu
>
> We see a couple unit test failures in working with nonexistent columns once
> DRILL-4467 is fixed. This is because trivial projects no longer protect
> StreamingAggregate from non-existent columns. This is likely due to an
> incorrect check before throwing a Unsupported error. An unknown/ANY type
> should probably be allowed in the case of using sum/max/stddev
> {code:title=Plan before DRILL-4467}
> VOLCANO:Physical Planning (71ms):
> ScreenPrel: rowcount = 1.0, cumulative cost = {464.1 rows, 2375.1 cpu, 0.0
> io, 0.0 network, 0.0 memory}, id = 185
> ProjectPrel(col1=[$0], col2=[$1], col3=[$2], col4=[$3], col5=[$4]):
> rowcount = 1.0, cumulative cost = {464.0 rows, 2375.0 cpu, 0.0 io, 0.0
> network, 0.0 memory}, id = 184
> StreamAggPrel(group=[{}], col1=[SUM($0)], col2=[SUM($1)], col3=[SUM($2)],
> col4=[SUM($3)], col5=[SUM($4)]): rowcount = 1.0, cumulative cost = {464.0
> rows, 2375.0 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 183
> LimitPrel(offset=[0], fetch=[0]): rowcount = 1.0, cumulative cost =
> {463.0 rows, 2315.0 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 182
> ProjectPrel(int_col=[$0], bigint_col=[$3], float4_col=[$4],
> float8_col=[$1], interval_year_col=[$2]): rowcount = 463.0, cumulative cost =
> {463.0 rows, 2315.0 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 181
> ScanPrel(groupscan=[EasyGroupScan
> [selectionRoot=classpath:/employee.json, numFiles=1, columns=[`int_col`,
> `bigint_col`, `float4_col`, `float8_col`, `interval_year_col`],
> files=[classpath:/employee.json]]]): rowcount = 463.0, cumulative cost =
> {463.0 rows, 2315.0 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 160
> {code}
> {code:title=Plan after DRILL-4467}
> VOLCANO:Physical Planning (63ms):
> ScreenPrel: rowcount = 1.0, cumulative cost = {464.1 rows, 2375.1 cpu, 0.0
> io, 0.0 network, 0.0 memory}, id = 151
> ProjectPrel(col1=[$0], col2=[$1], col3=[$2], col4=[$3], col5=[$4]):
> rowcount = 1.0, cumulative cost = {464.0 rows, 2375.0 cpu, 0.0 io, 0.0
> network, 0.0 memory}, id = 150
> StreamAggPrel(group=[{}], col1=[SUM($0)], col2=[SUM($1)], col3=[SUM($2)],
> col4=[SUM($3)], col5=[SUM($4)]): rowcount = 1.0, cumulative cost = {464.0
> rows, 2375.0 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 149
> LimitPrel(offset=[0], fetch=[0]): rowcount = 1.0, cumulative cost =
> {463.0 rows, 2315.0 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 148
> ScanPrel(groupscan=[EasyGroupScan
> [selectionRoot=classpath:/employee.json, numFiles=1, columns=[`int_col`,
> `bigint_col`, `float4_col`, `float8_col`, `interval_year_col`],
> files=[classpath:/employee.json]]]): rowcount = 463.0, cumulative cost =
> {463.0 rows, 2315.0 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 141
> Tests disabled referring to this bug in TestAggregateFunctions show multiple
> examples of this behavior.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)