Peter Ebert created IMPALA-8956:
-----------------------------------
Summary: Row count incorrect in summary while query running
Key: IMPALA-8956
URL: https://issues.apache.org/jira/browse/IMPALA-8956
Project: IMPALA
Issue Type: Bug
Reporter: Peter Ebert
Attachments: image-2019-09-18-20-48-39-069.png,
image-2019-09-18-21-28-33-720.png
For a query (below) the summary is incorrect on the row count for the nested
loop join, however when the query is complete its corrected:
select g.name, g.start, g.aa, sum(case when p.pn < 125.0 then 1 else 0 end) /
cast(max(lf_count) as decimal(10,6)) as lf, sum(case when p.pn >= 125.0 then 1
else 0 end) / cast(max(hf_count) as decimal(10,6)) as hff from g join p on
(g.sampleid = p.sample_id) cross join (select count(*) as lf_count from p where
pn < 125.0) lff cross join (select count(*) as hf_count from p where pn >=
125.0) hf group by g.name, g.start, g.aa order by name desc
!image-2019-09-18-20-48-39-069.png!
compared to the finished query
!image-2019-09-18-21-28-33-720.png!
--
This message was sent by Atlassian Jira
(v8.3.4#803005)