[HACKERS] hash join hashtable size and work_mem

Timothy J. Kordas Wed, 14 Mar 2007 08:42:52 -0800

in nodeHash.c, the function ExecChooseHashTableSize() uses two differentmethods for determining the number of buckets to use.


the current code looks something like:

if (ntuples * tuplesize > work_mem * 1024)
        buckets = (work_mem * 1024) / (tupsize * 10);
else
        buckets = ntuples/10

So for the case where a spill is expected; we use work_mem to decide on ourhash size. For the case where a spill isn't expected; we rely on the rowestimate alone -- and make no provision for speeding the join by using thememory that we're allowed to use.

When profiling large hash-joins, it often is the case that scanning thehash-buckets is a bottleneck; it would be nice for the user to be able to"throw memory" at a join to improve performance.

Am I missing something about the current implementation ? I would expectthat the bucket count would be calculated something like:


buckets = (work_mem * 1024L) / (tup_size * NTUP_PER_BUCKET)

for both cases ?

making this change appears to improve hash-join performance substantially insome cases, and as far as I can tell doesn't hurt anything (apart from usingmemory that it is "allowed" to use given a particular work_mem setting).


-Tim
--
[EMAIL PROTECTED]


---------------------------(end of broadcast)---------------------------
TIP 6: explain analyze is your friend

[HACKERS] hash join hashtable size and work_mem

Reply via email to