Re: [HACKERS] Planning large IN lists

Atul Deopujari Thu, 17 May 2007 12:03:14 -0700

Hi,

Tom Lane wrote:

"Atul Deopujari" <[EMAIL PROTECTED]> writes:

Hi,
Tom Lane wrote:
That's the least of the problems.  We really ought to convert such cases
into an IN (VALUES(...)) type of query, since often repeated indexscans
aren't the best implementation.
I thought of giving this a shot and while I was working on it, itoccurred to me that we need to decide on a threshold value of the INlist size above which such transformation should take place.


I see no good reason to suppose that there is/should be a constant
threshold --- most likely it depends on size of table, availability of
indexes, etc.  Having the planner try it both ways and compare costs
would be best.

Yes, letting the planner make its own decision would seem best (inaccordance with what we do for different join paths). But for large INlists, a substantial part of the planner is spent in estimating theselectivity of the ScalarArrayExpr by calling scalararraysel. If we arenot eliminating this step in processing the IN list then we are notdoing any optimization. Asking the planner to do scalararraysel and alsocompute cost of any other way and choose between the two is askingplanner to do more work.

Factors such as size of table, availability of index etc. would affectboth the ways similarly. So, if we see a gain in the execution of the INlist due to an external factor then we will also see a similar gain inthe execution of the transformed IN (VALUES(...)) clause.

I agree that one value would not fit all cases. The problem with thisapproach is that for some cases, large IN list would perform better thanthe transformed IN (VALUES(...)) clause. But we know that thetransformed IN (VALUES(...)) clause has almost a steady state behaviorand it would not blow off the planner estimates. The error would be justmarginal.


--
Atul

EnterpriseDB
www.enterprisedb.com


---------------------------(end of broadcast)---------------------------
TIP 4: Have you searched our list archives?

              http://archives.postgresql.org

Re: [HACKERS] Planning large IN lists

Reply via email to