Austin Haas created BEAM-3481:
---------------------------------
Summary: Query with subquery and aggregates cannot be implemented.
Key: BEAM-3481
URL: https://issues.apache.org/jira/browse/BEAM-3481
Project: Beam
Issue Type: Bug
Components: dsl-sql
Affects Versions: 2.2.0
Reporter: Austin Haas
Assignee: Xu Mingmin
This query results in the error below:
{noformat}
"SELECT (COUNT(`p`))
FROM (SELECT `p`
FROM `contains`
GROUP BY `p`) AS `t1`"{noformat}
This works correctly:
{noformat}
"SELECT (COUNT(`p`))
FROM (SELECT `p`, CURRENT_TIME
FROM `contains`
GROUP BY `p`) AS `t1`"{noformat}
Error:
{noformat}
[nREPL-worker-5] INFO
org.apache.beam.sdk.extensions.sql.impl.planner.BeamQueryPlanner - SQL:
SELECT COUNT(`t1`.`p`)
FROM (SELECT `contains`.`p`
FROM `contains` AS `contains`
GROUP BY `contains`.`p`) AS `t1`
[nREPL-worker-5] INFO
org.apache.beam.sdk.extensions.sql.impl.planner.BeamQueryPlanner - SQLPlan>
LogicalAggregate(group=[{}], EXPR$0=[COUNT()])
LogicalAggregate(group=[{0}])
LogicalProject(p=[$0])
LogicalTableScan(table=[[contains]])
CannotPlanException Node [rel#157:Subset#3.BEAM_LOGICAL.[]] could not be
implemented; planner state:
Root: rel#157:Subset#3.BEAM_LOGICAL.[]
Original rel:
LogicalAggregate(subset=[rel#157:Subset#3.BEAM_LOGICAL.[]], group=[{}],
EXPR$0=[COUNT()]): rowcount = 1.0, cumulative cost = {1.125 rows, 0.0 cpu, 0.0
io}, id = 155
LogicalAggregate(subset=[rel#154:Subset#2.NONE.[]], group=[{0}]): rowcount =
10.0, cumulative cost = {10.0 rows, 0.0 cpu, 0.0 io}, id = 153
LogicalProject(subset=[rel#152:Subset#1.NONE.[]], p=[$0]): rowcount = 100.0,
cumulative cost = {100.0 rows, 100.0 cpu, 0.0 io}, id = 151
LogicalTableScan(subset=[rel#150:Subset#0.NONE.[]], table=[[contains]]):
rowcount = 100.0, cumulative cost = {100.0 rows, 101.0 cpu, 0.0 io}, id = 146
Sets:
Set#0, type: RecordType(VARCHAR p, VARCHAR s, BIGINT c)
rel#150:Subset#0.NONE.[], best=null, importance=0.6561
rel#146:LogicalTableScan.NONE.[](table=[contains]), rowcount=100.0, cumulative
cost={inf}
rel#162:Subset#0.BEAM_LOGICAL.[], best=rel#164, importance=0.32805
rel#164:BeamIOSourceRel.BEAM_LOGICAL.[](table=[contains]), rowcount=100.0,
cumulative cost={100.0 rows, 101.0 cpu, 0.0 io}
Set#1, type: RecordType(VARCHAR p)
rel#152:Subset#1.NONE.[], best=null, importance=0.7290000000000001
rel#151:LogicalProject.NONE.[](input=rel#150:Subset#0.NONE.[],p=$0),
rowcount=100.0, cumulative cost={inf}
rel#159:Subset#1.BEAM_LOGICAL.[], best=rel#163, importance=0.36450000000000005
rel#163:BeamProjectRel.BEAM_LOGICAL.[](input=rel#162:Subset#0.BEAM_LOGICAL.[],p=$0),
rowcount=100.0, cumulative cost={200.0 rows, 201.0 cpu, 0.0 io}
Set#2, type: RecordType(VARCHAR p)
rel#154:Subset#2.NONE.[], best=null, importance=0.81
rel#153:LogicalAggregate.NONE.[](input=rel#152:Subset#1.NONE.[],group={0}),
rowcount=10.0, cumulative cost={inf}
rel#161:Subset#2.BEAM_LOGICAL.[], best=rel#160, importance=0.405
rel#160:BeamAggregationRel.BEAM_LOGICAL.[](group={0},window=org.apache.beam.sdk.transforms.windowing.GlobalWindows,trigger=Repeatedly.forever(AfterWatermark.pastEndOfWindow())),
rowcount=10.0, cumulative cost={210.0 rows, 201.0 cpu, 0.0 io}
Set#3, type: RecordType(BIGINT EXPR$0)
rel#156:Subset#3.NONE.[], best=null, importance=0.9
rel#155:LogicalAggregate.NONE.[](input=rel#154:Subset#2.NONE.[],group={},EXPR$0=COUNT()),
rowcount=1.0, cumulative cost={inf}
rel#157:Subset#3.BEAM_LOGICAL.[], best=null,
importance=1.0
rel#158:AbstractConverter.BEAM_LOGICAL.[](input=rel#156:Subset#3.NONE.[],convention=BEAM_LOGICAL,sort=[]),
rowcount=1.0, cumulative cost={inf}
org.apache.beam.sdks.java.extensions.sql.repackaged.org.apache.calcite.plan.volcano.RelSubset$CheapestPlanReplacer.visit
(RelSubset.java:441)
{noformat}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)