Re: [PERFORM] Perfomance of views

Jan Wieck Thu, 27 Oct 2005 06:48:53 -0700

On 10/27/2005 7:29 AM, Richard Huxton wrote:

Don't forget to CC the list
Svenne Krap wrote:
What do you mean exactly but "pushing conditions inside" ?
If I have something like "SELECT * FROM complicated_view WHERE foo = 7"then the planner can look "inside" complicated_view and see where it canattach the condition "foo=7", rather than running the query and applyingthe condition at the end.

Sorry, but the planner doesn't attach the condition anywhere. It is therewriter that takes the actual query, replaces the views rangetable andexpression entries with the actual underlying objects and adds the viewscondition with an AND to the queries condition. Simply example:


Given a view

    create view v1 as select a1, b1, c2 from t1, t2 where a1 = a2;

The statement

    select * from v1 where b1 = 'foo';

will result in a parsetree equivalent to what you would get if theoriginal query was


    select a1, b1, c2 from t1, t2 where (b1 = 'foo') and (a1 = a2);

It is the planners and optimizers job to recognize where in theexecution plan it can push qualifications down into filters or evenscankeys. The planner should be able to realize that


    select * from v1 where a1 = 42;

is in fact equivalent to

    select a1, b1, c2 from t1, t2 where a1 = 42 and a1 = a2;

as well as

    select a1, b1, c2 from t1, t2 where a1 = 42 and a1 = a2 and a2 = 42;

This very last addition of "a2 = 42" because of "a2 = a1 = 42" allows itto put a constant scankey onto the scan of t2. The 8.0 planner doesthat, so the resulting query plan for the last three selects above isabsolutely identical.

There are cases where it is safe for the planner to do this, but itisn't smart enough to do so.


Example?


Jan

I don't think I will have the option of testing on the full queries, asthese take many days to write (the current ones, they are replacing on amssql takes up more that 5kb of query). The current ones are nightmaresfrom a maintaince standpoint.
Hmm - it sounds like they would be.
Basicly what the application is doing is selecting some base data fromthe "large" table for a point in time (usually a quarter) and selectsall matching auxilliare data from the other tables. They are made in atime-travel like manner with a first and last useable date.
The ways I have considered was :
1) write a big query in hand (not preferred as it gets hard to manage)
Agreed.
2) write layers of views (still not prefered as I still have to rememberto put on the right conditions everywhere)
This is what I'd probably do, but of course I don't have fullinformation about your situation.
3) write layers of sql-functions (returning the right sets of rows fromthe underlying tables) - which I prefer from a development angel .. itgets very clean and I cant forget a parameter anywhere.
But I seem to remember (and I have used PGSQL in production since 7.0)that the planner has some problems with solution 3 (i.e. estimating thecost and rearranging the query), but frankly that would be the way Iwould like to go.
Well, 8.x can "inline" a simple sql function into a larger query, but itdoesn't sound like that will be enough in your case. Once a functionbecomes a "black box" then there's not much the planner can do to figureout what to do.
Based on the current (non-optimal) design and hardware constraints, Istill have to make sure, the query runs fairly optimal - that means theplanner must use indexes intelligently and other stuff as if it was(well-)written using solution 1.
Well, #1,#2 are likely to be the most efficient, but you won't know forsure about #2 until you test it.
There are a couple of other options though:
#4 - Write a set-returning function that breaks the query into steps andexecutes each in turn. So - fetch IDs from the main table in step 1 andstore them in a temporary table, join other tables in later steps.
#5 - Write a function that writes your big query for you and eitherreturns the SQL to your application, or runs it and returns the results.
What do you think of the three solutions ? And is there some ressourceabout the planners capabilites for someone like me (that is very used towrite reasonably fast and complex sql, can read c-code, but does notreally want to dig into the source code)
There is some stuff in the "Internals" section of the manuals and itmight be worth rummaging around on http://techdocs.postgresql.org
--
   Richard Huxton
   Archonet Ltd

---------------------------(end of broadcast)---------------------------
TIP 4: Have you searched our list archives?

               http://archives.postgresql.org



--
#======================================================================#
# It's easier to get forgiveness for being wrong than for being right. #
# Let's break this rule - forgive me.                                  #
#================================================== [EMAIL PROTECTED] #

---------------------------(end of broadcast)---------------------------
TIP 5: don't forget to increase your free space map settings

Re: [PERFORM] Perfomance of views

Reply via email to