On 2008-12-11 09:17 +0100, Morten Lemvigh wrote:

> I'm having a problem retrieving a page, when I use the time
> stamping option.
>
> When I run wget with:
> wget -N 'http://eur-lex.europa.eu/JOHtml.do?uri=OJ:C:2007:306:SOM:EN:HTML'
>
> the file is downloaded, but I get the message:
> "Last-modified header missing -- time-stamps turned off."
>
> If I run the command a second time, I get an "ERROR 500: Internal Server  
> Error." and wget exits. If I leave the time stamping option out, the  
> document is retrieved again.
>
> Is there a way to make wget ignore missing Last-modified headers, and  
> just retrieve the document?

I believe it's what it does by default. Wget only checks for the
Last-modified header here because you told it to (-N).

> When mirroring a site wget will stop and not  follow any links
> on a page, which doesn't send a Last-modified header.

Do you have a log showing that behaviour ? Recursive retrieval of
sites that don't return Last-modified works for me.

-- 
André Majorel <URL:http://www.teaser.fr/~amajorel/>


Reply via email to