Hey all,

Every once in a while I'll notice in my logs that someone comes to one of my 
sites and hits thousands of pages in a short span and then leaves.  This annoys 
me for a few reasons:

(a) It's an unecessary tax on my server (and we all hate taxes)

(b) It artificially inflates my page hits

(c) What the hell are they doing? Scraping my pages and hosting them on some 
site? The current offending IP reverses to China.


So, to avoid this, I'm considering the following:

o Add a session variable that stores the last page view time down to the 
second.  However, this may not work as the offending behavior is probably 
generated by a bot or desktop app that doesn't store session variables.

o Review my databased logs for the current IP's last twenty page views.  This 
may put an extra small hit on the server, but over all not as much as an extra 
2200 page views in an hour every couple of weeks.

If the requesting IP has requested more than twenty pages from the website in 
the current minute, I block the IP for a period of time, say, an hour or two.

I have about a dozen sites running the same software, each with multiple 
thousands of pages (community sites, with 21,000 messages and 


I've posed this question a couple of times before on this list and it hasn't 
prompted any response.  I will try again, hoping someone will either tell me 
I'm worrying too much, or that this is a smart idea.

Thanks,

Mik



--------
Michael Muller
office (413) 863-6455
cell (413) 320-5336
skype: michaelBmuller
http://MontagueWebWorks.com

Information is not knowledge
Knowlege is not wisdom

Eschew Obfuscation


~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~|
Want to reach the ColdFusion community with something they want? Let them know 
on the House of Fusion mailing lists
Archive: 
http://www.houseoffusion.com/groups/cf-talk/message.cfm/messageid:327267
Subscription: http://www.houseoffusion.com/groups/cf-talk/subscribe.cfm
Unsubscribe: http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=89.70.4

Reply via email to