[ 
https://issues.apache.org/jira/browse/HAWQ-1616?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lin Wen updated HAWQ-1616:
--------------------------
    Description: 
Wrong result of Hash Join when enable Bloom filter in some cases, e.g join key 
"l_partkey" is not in select list:

select  l_quantity, l_partkey,  l_extendedprice  from part, lineitem where 
p_partkey = l_partkey and p_brand = 'Brand#23' and p_container = 'MED BOX' 
limit 10; 

 select  l_quantity,  l_extendedprice  from part, lineitem where p_partkey = 
l_partkey and p_brand = 'Brand#23' and p_container = 'MED BOX' limit 10;

The SQL statement and data are from TPCH workload, the correct result should be:
l_quantity | l_extendedprice
------------+-----------------
       3.00 |         5399.55
       6.00 |         8318.58
      38.00 |        57927.20
      49.00 |        90545.63
      44.00 |        76197.88
      10.00 |        17146.20
      26.00 |        34376.94
      35.00 |        56332.85
       9.00 |        11999.88
      14.00 |        24020.92
(10 rows)

The projection information hasn't been pushed down to parquet scan correctly, 
so current result is none.

  was:
Wrong result of Hash Join when enable Bloom filter in some cases, e.g join key 
"l_partkey" is not in select list:

select  l_quantity, l_partkey,  l_extendedprice  from part, lineitem where 
p_partkey = l_partkey and p_brand = 'Brand#23' and p_container = 'MED BOX' 
limit 10; 

 select  l_quantity,  l_extendedprice  from part, lineitem where p_partkey = 
l_partkey and p_brand = 'Brand#23' and p_container = 'MED BOX' limit 10;


> Wrong Result of Hash Join When Enable Bloom filter
> --------------------------------------------------
>
>                 Key: HAWQ-1616
>                 URL: https://issues.apache.org/jira/browse/HAWQ-1616
>             Project: Apache HAWQ
>          Issue Type: Bug
>          Components: Query Execution
>            Reporter: Lin Wen
>            Assignee: Lin Wen
>            Priority: Major
>
> Wrong result of Hash Join when enable Bloom filter in some cases, e.g join 
> key "l_partkey" is not in select list:
> select  l_quantity, l_partkey,  l_extendedprice  from part, lineitem where 
> p_partkey = l_partkey and p_brand = 'Brand#23' and p_container = 'MED BOX' 
> limit 10; 
>  select  l_quantity,  l_extendedprice  from part, lineitem where p_partkey = 
> l_partkey and p_brand = 'Brand#23' and p_container = 'MED BOX' limit 10;
> The SQL statement and data are from TPCH workload, the correct result should 
> be:
> l_quantity | l_extendedprice
> ------------+-----------------
>        3.00 |         5399.55
>        6.00 |         8318.58
>       38.00 |        57927.20
>       49.00 |        90545.63
>       44.00 |        76197.88
>       10.00 |        17146.20
>       26.00 |        34376.94
>       35.00 |        56332.85
>        9.00 |        11999.88
>       14.00 |        24020.92
> (10 rows)
> The projection information hasn't been pushed down to parquet scan correctly, 
> so current result is none.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to