Mark, Stephen, etc:

> > I can see your point, however I wonder if the issue is that the default
> > stats settings of '10' (3000 rows, 10 histogram buckets) is too low, and
> > maybe we should consider making a higher value (say '100') the default.
>
> Personally, I think that'd be reasonable.

I don't really think that we want to increase sampling for *all* columns.   
Each column that doesn't get queried but has increased stats adds the size of 
the pg_stats table and the amount of vacuuming it needs, decreasing 
performance of stats lookups.  Especially when very large text and BYTEA 
columns are involved.

Neil and I talked about setting a seperate GUC, default_index_stats_target, 
for 8.0 that would dramatically increase the stats sampling for indexed 
columns only (say, to 250).    Unfortunately, I was unable to come up with a 
good test case for setting this value.

Perhaps the Tiger database would allow us to set up a good test case.  Does 
anyone have a well-indexed PG Tiger I could play with?

-- 
--Josh

Josh Berkus
Aglio Database Solutions
San Francisco

---------------------------(end of broadcast)---------------------------
TIP 5: Have you checked our extensive FAQ?

               http://www.postgresql.org/docs/faq

Reply via email to