Re: [PERFORM] CPU bound at 99%

PFC Tue, 22 Apr 2008 09:41:17 -0700

about 2300 connections in idle
(ps auxwww | grep postgres | idle)


        [...]

The server that connects to the db is an apache server using persistent
connections. MaxClients is 2048 thus the high number of connections
needed. Application was written in PHP using the Pear DB class.


        This is pretty classical.

When your number of threads gets out of control, everything gets slower,so more requests pile up, spawning more threads, this is positivefeedback, and in seconds all hell breaks loose. That's why I call itimploding, like if it collapses under its own weight. There is a thresholdeffect and it gets from working good to a crawl rather quickly once youpass the threshold, as you experienced.

Note that the same applies to Apache, PHP as well as Postgres : there isa "sweet spot" in the number of threads, for optimum efficiency, dependingon how many cores you have. Too few threads, and it will be waiting for IOor waiting for the database. Too many threads, and CPU cache utilizationbecomes suboptimal and context switches eat your performance.

This sweet spot is certainly not at 500 connections per core, either forPostgres or for PHP. It is much lower, about 5-20 depending on your load.

I will copypaste here an email I wrote to another person with the exactsame problem, and the exact same solution.

        Please read this carefully :

*********************************************************************

Basically there are three classes of websites in my book.

1- Low traffic (ie a few hits/s on dynamic pages), when performancedoesn't matter2- High traffic (ie 10-100 hits/s on dynamic pages), when you must readthe rest of this email3- Monster traffic (lots more than that) when you need to give some ofyour cash to Akamai, get some load balancers, replicate your databases,use lots of caching, etc. This is yahoo, flickr, meetic, etc.

Usually people whose web sites are imploding under load think they are inclass 3 but really most of them are in class 2 but using inadequatetechnical solutions like MySQL, etc. I had a website with 200K membersthat ran on a Celeron 1200 with 512 MB RAM, perfectly fine, and lighttpdwasn't even visible in the top.

Good news for you is that the solution to your problem is pretty easy. Youshould be able to solve that in about 4 hours.

Suppose you have some web servers for static content ; obviously you areusing lighttpd on that since it can service an "unlimited" (up to the OSlimit, something like 64K sockets) number of concurrent connections. Youcould also use nginx or Zeus. I think Akamai uses Zeus. But Lighttpd isperfectly fine (and free). For your static content servers you will wantto use lots of RAM for caching, if you serve images, put the small fileslike thumbnails, css, javascript, html pages on a separate server so thatthey are all served from RAM, use a cheap CPU since a Pentium-M withlighttpd will happily push 10K http hits/s if you don't wait for IO. Largefiles should be on the second static server to avoid cache trashing on theserver which has all the frequently accessed small files.

Then you have some web servers for generating your dynamic content. Let'ssuppose you have N CPU cores total.With your N cores, the ideal number of threads would be N. However thosewill also wait for IO and database operations, so you want to fill thosewait times with useful work, so maybe you will use something like 2...10threads per core. This can only be determined by experimentation, itdepends on the type and length of your SQL queries so there is no "onesize fits all" answer.

Example. You have pages that take 20 ms to generate, and you have 100requests for those coming up. Let's suppose you have one CPU core.

(Note : if your pages take longer than 10 ms, you have a problem. On thepreviously mentioned website, now running on the cheapest Core 2 we couldfind since the torrent tracker eats lots of CPU, pages take about 2-5 msto generate, even the forum pages with 30 posts on them. We use PHP withcompiled code caching and SQL is properly optimized). And, yes, it usesMySQL. Once I wrote (as an experiment) an extremely simple forum which did1400 pages/second (which is huge) with a desktop Core2 as the Postgres 8.2server.

- You could use Apache in the old fasion way, have 100 threads, so allyour pages will take 20 ms x 100 = 2 seconds,But the CPU cache utilisation will suck because of all those contextswitches, you'll have 100 processes eating your RAM (count 8MB for a PHPprocess), 100 database connections, 100 postgres processes, the locks willstay on longer, transactions will last longer, you'll get more dead rowsto vacuum, etc.And actually, since Apache will not buffer the output of your scripts, thePHP or Perl interpreter will stay in memory (and hog a databaseconnection) until the client at the other end of the internets had loadedall the data. If the guy has DSL, this can take 0.5 seconds, if he has56K, much longer. So, you are likely to get much more than 100 processesin your Apache, perhaps 150 or perhaps even 1000 if you are out of luck.In this case the site usually implodes.

- You could have a lighttpd or squid proxy handling the clientconnections, then funnelling that to a few threads generating thewebpages. Then, you don't care anymore about the slowness of the clientsbecause they are not hogging threads anymore. If you have 4 threads, yourrequests will be processed in order, first come first served, 20 ms x 4 =80 ms each average, the CPU cache will work better since you'll get muchless context switching, RAM will not be filled, postgres will be happy.

So, the front-end proxy would have a number of max connections, say 200,

Number of connections to clients => don't set any values, sockets are freein lighttpd.Number of connections to PHP/fastcgi or apache/mod_perl backends => numberof cores x 2 to 5, adjust to taste

and it would connect to another httpd/mod_perl server behind with alower number of connections, say 20. If the backend httpd server wasbusy, the proxy connection to it would just wait in a queue until it wasavailable.


    Yes, it waits in a queue.

Is that the kind of design you had in mind?


    Yes.
    The two key points are that :

* Perl/PHP processes and their heavy resources (database connections,RAM) are used only when they have work to do and not waiting for theclient.

    * The proxy must work this way :

1- get and buffer request data from client (slow, up to 500 ms, upto 2000 ms if user has emule or other crap hogging his upload)

        2- send request to backend (fast, on your LAN, < 1 ms)

3- backend generates HTML and sends it to proxy (fast, LAN), proxybuffers data

        4- backend is now free to process another request
        5- proxy sends buffered data to client (slow, up to 100-3000 ms)
    The slow parts (points 1 and 5) do not hog a perl/PHP backend.

Do not use a transparent proxy ! The proxy must buffer requests anddata for this to work. Backends must never wait for the client. Lighttpdwill buffer everything, I believe Apache can be configured to do so. But Iprefer to use lighttpd for proxying, it is faster and the queuing worksbetter.

Also, if you can use FastCGI, use it. I have never used mod_perl, butwith mod_php, you have a fixed startup cost every time a PHP interpreterstarts. With fastcgi, a number of PHP interpreter threads are spawned atstartup, so they are always ready, the startup cost is much smaller. Youcan serve a small AJAX request with 1-2 database queries in less than 1 msif you are careful with your code (like, no heavyweight sessioninitialization on each page, using mmcache to avoid reparsing the PHPeverytime, etc).

If you have several backend servers generating webpages, use stickysessions and put the session storage on the backends themselves, if youuse files use ReiserFS not ext3 which sucks when you have a large numberof session files in the same directory. Or use memcached, whatever, butdon't put sessions in the database, this gives you a nice tight bottleneckwhen adding servers. If each and every one of your pages has an UPDATEquery to the sessions table you have a problem.

As for why I like lighttpd, I am fond of the asynchronous select/pollmodel for a webserver which needs to handle lots of concurrentconnections. When you have 50 open sockets threads are perfectly fine,when you have 1000 a threaded server will implode. I wrote a bittorrenttracker in Python using an asynchronous select/poll model ; it has beenhandling about 150-400 HTTP hits per second for two years now, it hasabout 100-200 concurrent opened sockets 24 hours a day, and the averagelifetime of a socket connection is 600 ms. There are 3 threads (webserver,backend, deferred database operations) with some queues in between for theplumbing. Serving an /announce HTTP request takes 350 microseconds of CPUtime. All using a purely interpreted language, lol. It uses half a core onthe Core 2 and about 40 MB of RAM.

When lighttpd is overloaded (well, it's impossible to kill it with staticfiles unless it waits for disk IO, but if you overload the fastcgiprocesses), requests are kicked out of the queue, so for instance it willonly serve 50% of the requests. But an overloaded apache will serve 0%since you'll get 1000 threads, it'll swap, and everything will timeout andcrash.


********************************************************

End of copypaste.

        So :

- You need to get less Postgres connections to let Postgres breathe anduse your CPU power to perform queries and not context switches and cachemanagement.- You need to get less PHP threads which will have the same effect onyour webserver.


        The way to do this is is actually pretty simple.

- Frontend proxy (lighttpd), load balancer, whatever, sending staticrequests to static servers, and dynamic requests to dynamic servers. Ifthe total size of your static files fits in the RAM of this server, makethe static server and the proxy the same lighttpd instance.

- Backends for PHP : a number of servers running PHP/fastcgi, no webservers at all, the lighttpd frontend can hit several PHP/fastcgi backends.

- Use PHP persistent connections (which now appear to work in the latestversion, in fastcgi mode, I don't know about mod_php's persistentconnections though).- Or use pgpool or pgbouncer or another connection pooler, but only ifPHP's persistent connections do not work for you.

1:  Each apache / php process maintains its own connections, not
sharing with others.  So it's NOT connection pooling, but people tend
to think it is.

True with mod_php (and sad). With fastcgi, you don't really care, sincethe PHP processes are few and are active most of the time, no connectionhogging takes place unless you use many different users to connect topostgres, in which case you should switch to pgpool.

2:  Each unique connection creates another persistent connection for
an apache/php child process.  If you routinely connect to multiple
servers / databases or as > 1 user, then each one of those
combinations that is unique makes another persistent connection.


        True also for fastcgi, but if you don't do that, no problem.

3:  There's no facility in PHP to clean an old connection out and make
sure it's in some kind of consistent state when you get it.  It's in
exactly the same state it was when the previous php script finished
with it.  Half completed transactions, partial sql statements,
sequence functions like currval() may have values that don't apply to
you.


        Apparently now fixed.

4:  pg_close can't close a persistent connection.  Once it's open, it
stays open until the child process is harvested.


        Don't know about that.

5:  Apache, by default, is configured for 150 child processes.
Postgresql, and many other databases for that matter, are configured
for 100 or less.


        (and for good reason)

Even if apache only opens one connection to one
database with one user account, it will eventually try to open the
101st connection to postgresql and fail.  So, the default
configuration of apache / postgresql for number of connections is
unsafe for pconnect.

fastcgi makes this problem disappear by separating the concept of "clientconnection" from the concept of "web server thread". Not only will it makePostgres happier, your PHP processing will be much faster too.

6:  The reason for connection pooling is primarily to twofold.  One is
to allow very fast connections to your database when doing lots of
small things where connection time will cost too much.  The other is
to prevent your database from having lots of stale / idle connections
that cause it to waste memory and to be slower since each backend
needs to communicate with every other backend some amount of data some
times.  pconnect takes care of the first problem, but exacerbates the
second.


        Moot point with fastcgi.

Unused PHP processes are removed in times of low traffic, along withtheir connections.

P.s. dont' think I'm dogging PHP, cause I'm not.  I use it all the
time, and it's really great for simple small scripts that need to be
done NOW and need to be lightweight.  I even use pconnect a bit.  But
my machine is set for 50 or fewer apache children and 150 postgresql
connects, and I only use pconnect on small, lightweight things that
need to zoom.  Everything else gets regular old connect.

Very true for mod_php, wrong for fastcgi : you can get extremeperformance with pconnect and a PHP code cache like turck/mm oreaccelerator, down to 1 ms per page.

Especially if you use PEAR which is very bloated, you nead a code cacheto avoid parsing it on every page.On previously mentioned website it cut the page time from 50 ms to 2 mson some pages because there was a lot of includes.










--
Sent via pgsql-performance mailing list (pgsql-performance@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance

Re: [PERFORM] CPU bound at 99%

Reply via email to