On 2008-12-11 09:17 +0100, Morten Lemvigh wrote: > I'm having a problem retrieving a page, when I use the time > stamping option. > > When I run wget with: > wget -N 'http://eur-lex.europa.eu/JOHtml.do?uri=OJ:C:2007:306:SOM:EN:HTML' > > the file is downloaded, but I get the message: > "Last-modified header missing -- time-stamps turned off." > > If I run the command a second time, I get an "ERROR 500: Internal Server > Error." and wget exits. If I leave the time stamping option out, the > document is retrieved again. > > Is there a way to make wget ignore missing Last-modified headers, and > just retrieve the document?
I believe it's what it does by default. Wget only checks for the Last-modified header here because you told it to (-N). > When mirroring a site wget will stop and not follow any links > on a page, which doesn't send a Last-modified header. Do you have a log showing that behaviour ? Recursive retrieval of sites that don't return Last-modified works for me. -- André Majorel <URL:http://www.teaser.fr/~amajorel/>
