Hi ,

    

      Recently I came across a case where the planner choose larger table on 
hash side. I am not sure whether it is an intended  behavior or we are missing 
something. 

  

     I have two tables (a and b) each with  single column in it. One table 'a' 
is large with around 30 million distinct rows and other table 'b' has merely 
70,000 rows with one-seventh (10,000) distinct rows. I have analyzed both the 
table.  But while joining both the table I get the larger table on hash side. 

tpch=# explain select b from b left join a on a = b;

                                               QUERY PLAN                       
                         

---------------------------------------------------------------------------------------------------------

 Hash Left Join  (cost=824863.75..950104.42 rows=78264 width=4)

   Hash Cond: (b.b = a.a)o

   ->  Foreign Scan on b  (cost=0.00..821.64 rows=78264 width=4)

         CStore File: /home/likewise-open/pg96/data/cstore_fdw/1818708/1849879

         CStore File Size: 314587

   ->  Hash  (cost=321721.22..321721.22 rows=30667722 width=4)

         ->  Foreign Scan on a  (cost=0.00..321721.22 rows=30667722 width=4)

               CStore File: 
/home/likewise-open/pg96/data/cstore_fdw/1818708/1849876

               CStore File Size: 123236206

(9 rows)






I would like to know the reason for choosing this plan and Is there a easy fix 
to prevent such plans (especially like this one where it choose a larger hash 
table) ?  



Thanks,

Pradeep


Reply via email to