(I sent this to -lpsg and -steer in a single message last night but it doesn't appear to have made it to steer, so here it is again. Sorry if this winds up being a dupe.)

Per our discussion at LPSG, I've had some fun with rewrite rules and seem to have gotten the load on the server down significantly.

During the meeting I added a quick Webalizer report to our web server, which can be found here:
http://sparkplug.kernel-panic.org/stats/

That report lets us see how the majority of traffic is reaching us. The answer is Google searches, and if you're easily offended I suggest you skip the section of the report showing what people are searching to find us. :)

Although referrals from Google.com are top on the list, referrals from other foreign Google.<country> sites make up quite a chunk. I've added rules to make Apache deny anyone coming to us via those foreign Googles. My list is not complete, but I used the top 20 list on Webalizer to guide me. If someone has a text processing itch and wants to parse the complete list of official Google sites (http:// www.google.com/supported_domains) into the following format:

RewriteCond %{HTTP_REFERER} ^http://www.google.pl [OR]
RewriteCond %{HTTP_REFERER} ^http://www.google.com.au [OR]

I'd be more than happy to paste the entire list into our config.

I also made explicit rules that deny visits to some of the high traffic Member sites, such as 'allaboutx'.

With these rules in place, I've been able to re-enable the Member sites for everyone else and the server load is staying low.

Tracy, you may not have to do the CacheFu/Squid thing, but you're more than welcome to if you want. One other thought - our Apache server already has mod_proxy installed, giving it similar functionality to Squid... is there a way to integrate that with CacheFu instead? That would save the installation of yet another daemon that takes memory.

That's all for now, heading to bed. I'll get into the Plone specific problems later.

--
Joshua Penix                                http://www.binarytribe.com
Binary Tribe           Linux Integration Services & Network Consulting


--
[email protected]
http://www.kernel-panic.org/cgi-bin/mailman/listinfo/kplug-steer

Reply via email to