Re: [HACKERS] Optimizer improvements: to do or not to do?

AgentM Wed, 13 Sep 2006 12:30:33 -0700


On Sep 13, 2006, at 14:44 , Gregory Stark wrote:

I think we need a serious statistics jock to pipe up with somestandardmetrics that do what we need. Otherwise we'll never have a solidfooting forthe predictions we make and will never know how much we can trustthem.
That said I'm now going to do exactly what I just said we shouldstop doing
and brain storm about an ad-hoc metric that might help:
I wonder if what we need is something like: sort the sampled valuesby valueand count up the average number of distinct blocks per value. Thatmight letus predict how many pages a fetch of a specific value wouldretrieve. Orperhaps we need a second histogram where the quantities are ofdistinct pages
rather than total records.
We might also need a separate "average number of n-block spans pervalue"metric to predict how sequential the i/o will be in addition to howmany pages
will be fetched.

Currently, statistics are only collected during an "ANALYZE". Whyaren't statistics collected during actual query runs such as seqscans? One could turn such as beast off in order to get repeatable,deterministic optimizer results.


-M

---------------------------(end of broadcast)---------------------------
TIP 9: In versions below 8.0, the planner will ignore your desire to
      choose an index scan if your joining column's datatypes do not
      match

Re: [HACKERS] Optimizer improvements: to do or not to do?

Reply via email to