I'm having a problem retrieving a page, when I use the time stamping option.
When I run wget with:
wget -N 'http://eur-lex.europa.eu/JOHtml.do?uri=OJ:C:2007:306:SOM:EN:HTML'
the file is downloaded, but I get the message:
"Last-modified header missing -- time-stamps turned off."
If I run the command a second time, I get an "ERROR 500: Internal Server
Error." and wget exits. If I leave the time stamping option out, the
document is retrieved again.
Is there a way to make wget ignore missing Last-modified headers, and
just retrieve the document? When mirroring a site wget will stop and not
follow any links on a page, which doesn't send a Last-modified header.
The only alternative I've found, is to retrieve all documents without
time stamping - and that's a horrible way to mirror...
Thanks,
Morten