Jens-Uwe Mager wrote:
> I have for the first time encountered the problem that some braindead
> web robot (ExtractorPro) attempted to download all of the site and
> appended some random URL segment at the end of an embedded
> perl page. I
> use suffix .phtml for these pages, and the url looked like
> <http://mysite//page.phtml/randomotherurl>. The innocent embperl page
> delivered some contents with relative urls and the robot continued to
> fetch the same page with various URL suffixes, causing a loop
> and doing
> the equivalent of an Apache bench remotely.
>
> What is the best way to stop these kinds of mishaps? And what the heck
> is this ExtractorPro thing?
>
> What is the best way to stop these kinds of mishaps? And what the heck
> is this ExtractorPro thing?
It's spamware designed (or ill-designed, as the case may be) for
scraping e-mail addresses from web pages to be added to some future
"100 ZILLION EMAIL ADDRESSES FOR $99!!!" list. You will want to block
it and its ilk from your site if at all possible. (Though it sounds
like it's doing an ok job of blocking itself... :-)
Regards,
Charlie
--
|o|/\/\/\/\/\/\/\/\/\/\/\/\/\/\/\|o|
|o| Charlie Wilkinson |o|
|o| TRIS Development SysAdmin |o|
|o| [EMAIL PROTECTED] (w) |o|
|o| [EMAIL PROTECTED] (h) |o|
|o|/\/\/\/\/\/\/\/\/\/\/\/\/\/\/\|o|