Re: [HACKERS] hash join hashtable size and work_mem

Timothy J. Kordas Wed, 14 Mar 2007 09:38:32 -0800

Tom Lane wrote:

If the planner has correctly predicted the number of rows, the table
loading should be about NTUP_PER_BUCKET in either regime.  Are you
sure you aren't just wishing that NTUP_PER_BUCKET were smaller?

Maybe I wish NTUP_PER_BUCKET was smaller. But I don't think that's the wholestory.

The planner estimates definitely play a role in my concern here. Formis-estimated inner relations, the current calculation may over-subscribethe hash-table even if more work_mem was available (that is, there are toomany hash collisions *and* memory isn't being used to the fullest extentallowed).

I've been tracking the number of tuples which land in each bucket, and I'dlike to see that number go down as I increase work_mem.

I would expect for the same data a hash-join with a work_mem of 256MB to runfaster than one run with 32MB; even if the inner relation is only 30MB.

the implementation I've been experimenting with actually takes the averageof the current implementation (ntuples/10) and the spill version(work_mem/(tupsize * 10).



-Tim


---------------------------(end of broadcast)---------------------------
TIP 4: Have you searched our list archives?

              http://archives.postgresql.org

Re: [HACKERS] hash join hashtable size and work_mem

Reply via email to