Jens-Uwe Mager wrote:

> I have for the first time encountered the problem that some braindead
> web robot (ExtractorPro) attempted to download all of the site and
> appended some random URL segment at the end of an embedded 
> perl page. I
> use suffix .phtml for these pages, and the url looked like
> <http://mysite//page.phtml/randomotherurl>. The innocent embperl page
> delivered some contents with relative urls and the robot continued to
> fetch the same page with various URL suffixes, causing a loop 
> and doing
> the equivalent of an Apache bench remotely.
> 
> What is the best way to stop these kinds of mishaps? And what the heck
> is this ExtractorPro thing?
> 
> What is the best way to stop these kinds of mishaps? And what the heck
> is this ExtractorPro thing?

It's spamware designed (or ill-designed, as the case may be) for
scraping e-mail addresses from web pages to be added to some future
"100 ZILLION EMAIL ADDRESSES FOR $99!!!" list.  You will want to block
it and its ilk from your site if at all possible.  (Though it sounds
like it's doing an ok job of blocking itself... :-)

Regards,
        Charlie

-- 
|o|/\/\/\/\/\/\/\/\/\/\/\/\/\/\/\|o|
|o|      Charlie Wilkinson       |o|
|o|  TRIS Development SysAdmin   |o|
|o|  [EMAIL PROTECTED] (w)   |o|
|o| [EMAIL PROTECTED] (h) |o|
|o|/\/\/\/\/\/\/\/\/\/\/\/\/\/\/\|o|

Reply via email to