>>I came across an idea of using HTTP_USER_AGENT to identify a bot
 >>... assuming it gets in despite the robots.txt file.

Not a very good idea. HTTP_USER_AGENT will help you identify ONLY good 
bots that
actually comply with the robots.txt file anyway.

Any bad bot with some ill intent will be smart enough to forge a browser 
agent so that you will not identfy it
as a bot.

Only some smart behavior analysis can detect bots with an acceptable 
accuracy.
- rate of HTTP requests,
- reads images (most good bot don't need to read images)
- executes Javascript?
- etc...

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~|
Adobe® ColdFusion® 8 software 8 is the most important and dramatic release to 
date
Get the Free Trial
http://ad.doubleclick.net/clk;207172674;29440083;f

Archive: 
http://www.houseoffusion.com/groups/cf-talk/message.cfm/messageid:319616
Subscription: http://www.houseoffusion.com/groups/cf-talk/subscribe.cfm
Unsubscribe: http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=89.70.4

Reply via email to