Some of the things I look at when things are slow in no particular order:

* Check top. Look at memory usage (ensure no swap usage), CPU usage as well
as deadlocked processes.
* Check disk space. Ensure all drives have enough free space.
* Check error logs.
* See which server or component is being slow. Is it just one server? Is it
static or dynamic pages?
* Check the MySQL process list (SHOW PROCESSLIST). Generally this is no
more than a couple of items and any more can indicate an issue.
* Check hit rates in all caches. Is the cache getting filled up too quickly
resulting in a low hit rate and high refresh rate?
* If there's intermittent issues that are hard to track I would manually
benchmark services (ex: ApacheBench) to try spot the issue and the cause.
* See if there's any pattern to when the slow downs occur. See if they
coincide with any cron scripts (ex: Lucene index updates).

Note that most of the above items should be applied to each individual
server to try and narrow down the source. Since you appear to have 3
"content" servers (2 squids and one content) try directly accessing the
wiki from each one too see if that narrows down the issue.


On 20 April 2013 15:29, David Gerard <[email protected]> wrote:

> rationalwiki.org is currently serving pages very slowly. It's
> intermittent, but when it's slow it's a *slug*. Many users are getting
> 502 errors from Apache or 503 from the Squids.
>
> We have one Linode doing Apache/MySQL/Lucene. It's an 8GB box with 8
> cores. (Was 4GB/4 cores, but Linode just doubled everyone's server.)
> In front of that are two Squids fed by a load balancer.
>
> * Sometimes the cause is obvious: when the load average is 30 and top
> shows a pile of Apaches using up CPU, then it's PHP handling a complex
> page request. (No, I still haven't made it PHP via fcgid.)
> * Sometimes it isn't, e.g. this afternoon when the site was running
> like a slug and load average was 0.8 with nothing amiss in top.
> * The squids don't show an unusual rate of hits on the site.
> * We have plenty of memory free - about 4GB on the main box is just
> sitting in file cache.
> * php_errors.log only shows up some processes timing out their 30
> seconds (which would be the 502s).
>
> So where would I start looking to work out what's going on?
>
>
> - d.
>
> _______________________________________________
> MediaWiki-l mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
>



-- 
Dave Humphrey -- [email protected]
Founder/Server Admin of the Unofficial Elder Scrolls Pages -- www.uesp.net
_______________________________________________
MediaWiki-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/mediawiki-l

Reply via email to