I've noticed the same. It went away after I added "script" to
parser.html.outlinks.ignore_tags. This looks like a bug to me.

-- Stefan


> From: Euan Clark <[EMAIL PROTECTED]>
> Organization: nzs.com
> Reply-To: <[email protected]>
> Date: Mon, 28 Apr 2008 10:40:23 +1200
> To: <[email protected]>
> Subject: On-page javascript treated as relative link
> 
> Hi All,
> 
> Webmasters for some of the sites we are crawling are reporting high
> numbers of requests resulting in 404s
> 
> It appears that <script type="text/javascript"> is being treated as a
> relative link to the current page, which is causing the bot to generate
> hundreds of page not found errors.
> 
> E.g. on http://www.mysite.com/page1/
> 
> If there is a <script type="text/javascript"> it will try and access to url
> 
> http://www.mysite.com/page1/text/javascript
> 
> 
> Which of course never resolves to a valid url.
> 
> 
> Cheers,
> 
> 
> Euan Clark


Reply via email to