Re: [PERFORM] Planner should use index on a LIKE 'foo%' query

Moritz Onken Mon, 30 Jun 2008 09:42:49 -0700

The thing here is that you are effectively causing Postgres to run asub-select for each row of the "result" table, each time generatingeither an empty list or a list with one or more identical URLs. Thisis effectively forcing a nested loop. In a way, you have twoconstraints where you only need one.
You can safely take out the constraint in the subquery, so it islike this:
SELECT COUNT(*) FROM result WHERE url IN (SELECT shorturl FROM item);
This will generate equivalent results, because those rows thatdidn't match the constraint wouldn't have affected the IN anyway.However, it will alter the performance, because the subquery willcontain more results, but it will only be run once, rather thanmultiple times. This is effectively forcing a hash join (kind of).
Whereas if you rewrite the query as I demonstrated earlier, then youallow Postgres to make its own choice about which join algorithmwill work best.
Matthew


Thank you! I learned a lot today :-)

I thought the subquery will be run on every row thus I tried to makeit as fast as possible by using a where clause. I didn't try yourfirst query on the hole table so it could be faster than mine approach.


greetings,

moritz

--
Sent via pgsql-performance mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance

Re: [PERFORM] Planner should use index on a LIKE 'foo%' query

Reply via email to