Re: [PERFORM] TPC-H Scaling Factors X PostgreSQL Cluster Command

Heikki Linnakangas Tue, 24 Apr 2007 01:55:12 -0700

Greg Smith wrote:

On Sat, 21 Apr 2007, Nelson Kotowski wrote:
I identified that the cluster command over the lineitem table (clusteridx_lineitem on lineitem) is the responsible. I got to this conclusionbecause when i run it in the 1GB and 2GB database i am able tocomplete this script in 10 and 30 minutes each. But when i run thiscommand over the 5GB database, it simply seems as it won't end.
Have you looked in the database log files for messages? Unless youchanged some other parameters from the defaults that you didn't mention,I'd expect you've got a constant series of "checkpoint occuring toofrequently" errors in there, which would be a huge slowdown on yourindex rebuild. Slowdowns from checkpoints would get worse with anincrease of shared_buffers, as you report.

Index builds don't write WAL, unless archive_command has been set. Ahigher shared_buffers setting can hurt index build performance, but fora different reason: the memory spent on shared_buffers can't be used forsorting and caching the sort tapes.

The default setting for checkpoint_segments of 3 is extremely low foreven a 1GB database. Try increasing that to 30, restart the server, andrebuild the index to see how much the 1GB case speeds up. If it'ssignificantly faster (it should be), try the 5GB one again.


A good advice, but it's unlikely to make a difference at load time.

BTW: With CVS HEAD, if you create the table in the same transaction (orTRUNCATE) as you load the data, the COPY will skip writing WAL which cangive a nice speedup.


--
  Heikki Linnakangas
  EnterpriseDB   http://www.enterprisedb.com

---------------------------(end of broadcast)---------------------------
TIP 7: You can help support the PostgreSQL project by donating at

               http://www.postgresql.org/about/donate

Re: [PERFORM] TPC-H Scaling Factors X PostgreSQL Cluster Command

Reply via email to