On 11/12/2001 15:09:25 hniksic wrote:

>Summer Breeze <[EMAIL PROTECTED]> writes:
>
>> I want to know if Wget is a program similar to Mozilla, and if so is
>> there any way to make my pages available to Wget? I use Netscape to
>> create my web pages.
>
>Wget is a command-line downloading utility; it allows you to download
>a page or a part of the site without further user interaction.
>
>> Here is a sample entry:
>>
>> 66.28.29.44 - - [08/Dec/2001:18:21:20 -0500] "GET /index4.html%0A
>> HTTP/1.0" 403 280 "-" "Wget/1.6"
>
>"/index4.html%0A" looks like a page is trying to link to /index4.html,
>but the link contains a trailing newline.

That IP address is assigned to Road Runner (big cable ISP, I think)

Is /index4.html%0A the *first* error line in the log from 66...44 ?

Wget will try to download a URL in two cases: either because it was told to
explicitly, or because it was doing a recursive download and found that
link in a page downloaded earlier.

/index4.html%0A looks like something somewhere was misparsed. It might
conceivably be wget (unlikely, as this sort of problem would've surfaced
long ago).

If /index4.html%0A *is* the first URL requested by that IP address, then
the blame is clearly elsewhere (unless -i was used). If not, can you search
your site for a link to /index4.html that might be badly formatted HTML
(although wget should be able to defend itself against bad HTML).


(Please don't CC me; I'm on the list)
--
Csaba R�duly, Software Engineer                           Sophos Anti-Virus
email: [EMAIL PROTECTED]                        http://www.sophos.com
US Support: +1 888 SOPHOS 9                     UK Support: +44 1235 559933


Reply via email to