Greg Wooledge ([EMAIL PROTECTED]) wrote:
> PostgreSQL 6.4.2, compiled and installed on AIX 4.3.2 (a PowerPC
> system).
> Everything was going fairly well until this morning.
It's even worse today. *None* of the updates succeeded this morning.
I was able to do queries, but the data looked just like yesterday's.
UID PID PPID C STIME TTY TIME CMD
postgres 4278 39506 0 04:16:24 pts/7 0:06 /usr/local/pgsql/bin/postgres daemon
WWW.XXX.YYY.ZZZ sysinfo2 DELETE waiting
postgres 5730 39506 0 04:13:56 pts/7 0:05 /usr/local/pgsql/bin/postgres daemon
WWW.XXX.YYY.ZZZ sysinfo2 DELETE waiting
postgres 7738 39506 0 04:19:26 pts/7 0:04 /usr/local/pgsql/bin/postgres daemon
WWW.XXX.YYY.ZZZ sysinfo2 DELETE waiting
postgres 19186 39506 60 04:10:02 pts/7 2:52 /usr/local/pgsql/bin/postgres daemon
WWW.XXX.YYY.ZZZ sysinfo2 INSERT
postgres 24946 39506 0 04:10:44 pts/7 0:05 /usr/local/pgsql/bin/postgres daemon
WWW.XXX.YYY.ZZZ sysinfo2 DELETE waiting
postgres 25442 39506 0 04:11:34 pts/7 0:05 /usr/local/pgsql/bin/postgres daemon
WWW.XXX.YYY.ZZZ sysinfo2 DELETE waiting
[...]
There were 26 postgres-owned processes running. (I killed 2, but a new
runaway process stepped up each time to take over the CPU from the
previous one. As you can see above, process 19186 was running away.)
The errlog in ~postgres was timestamped 06:15, which is when my vacuum
runs for this database. There were no error messages from any other
process but the vacuum; not even after killing two runaways.
There was a pg_vlock file from yesterday morning, when I had to kill
the postmaster to get anything to work. I removed that, but it did not
appear to make any difference. After removing that, and killing off
one or two more runaways, I finally stopped and restarted postmaster
again. I'm running a vacuum now, and hoping that it works. It's been
running for a while....
I'm seriously considering rewriting major hunks of this application so
that it doesn't need to use the PostgreSQL large objects. There are
currently 3462 files in data/base/sysinfo2, and that number is only
going to increase as this application continues to run, unless I do
away with the large objects.
But obviously I'm going to have to do *something*. It's not usable
right now.
Does the 6.5 beta version fix any known problems with large objects, or
with reliability under high loads of simultaneous DELETE/INSERT/UPDATE
statements? Or should I stick with 6.4.2?
Are there any known reliability issues with AIX 4.3.2?