Re: [HACKERS] Cost limited statements RFC

Greg Smith Sat, 08 Jun 2013 14:47:07 -0700

On 6/8/13 5:17 PM, Jeff Janes wrote:

But my gut feeling is that if autovacuum is trying to read faster than
the hardware will support, it will just automatically get throttled, by
inherent IO waits, at a level which can be comfortably supported.  And
this will cause minimal interference with other processes.

If this were true all the time autovacuum tuning would be a lot easier.You can easily make a whole server unresponsive by letting loose onerogue process doing a lot of reads. Right now this isn't a problem forautovacuum because any one process running at 7.8MB/s is usually not abig deal. It doesn't take too much in the way of read-ahead logic andthroughput to satisfy that. But I've seen people try and push the readrate upwards who didn't get very far beyond that before it was way tooobtrusive.

I could collect some data from troubled servers to see how high I canpush the read rate before they suffer. Maybe there's a case there forincreasing the default read rate because the write one is a good enoughsecondary limiter. I'd be surprised if we could get away with more thana 2 or 3X increase though, and the idea of going unlimited is reallyscary. It took me a year of before/after data collection before I wasconfident that it's OK to run unrestricted in all cache hit situations.

Why is there so much random IO?  Do your systems have
autovacuum_vacuum_scale_factor set far below the default?  Unless they
do, most of the IO (both read and write) should be sequential.

Insert one process doing sequential reads into a stream of otheractivity and you can easily get random I/O against the disks out of themix. You don't necessarily need the other activity to be random to getthat. N sequential readers eventually acts like N random readers forhigh enough values of N. On busy servers, autovacuum is normallycompeting against multiple random I/O processes though.

Also, the database's theoretical model that block number correlatesdirectly with location on disk can break down. I haven't put a hardnumber to measuring it directly, but systems with vacuum problems seemmore likely to have noticeable filesystem level fragmentation. I'vebeen thinking about collecting data from a few systems with filefrag tosee if I'm right about that.


--
Greg Smith   2ndQuadrant US    g...@2ndquadrant.com   Baltimore, MD
PostgreSQL Training, Services, and 24x7 Support www.2ndQuadrant.com


--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Cost limited statements RFC

Reply via email to