Hi,
I am using wget to mirror a site. In all pages of this site there is a
calendar that provides links for the rest of the site (this site carries
some news selections for the day).
The point is, I want it to bring only today news. I have a link that
points me to them, except for one IFRAME that carries a calendar which
gives me access to all other dates.
I tried to supress the download of the calendar using the --reject
option with:
/usr/local/bin/wget --recursive --no-host-directories --reject
calendar.htm http://my.host/default.htm?date=20030107
When I use this, wget won't save the "calendar.htm" file. During
mirroring I get a message like "calendar.htm retrieved but not saved
because it was rejected". Well, I don't like the idea of retrieving a
file I don't want but I can live with that. What is really being a
problem is the fact that because wget retrieves "calendar.htm", it is
followong the links inside it which point to all other dates.
How can I make wget completly ignore a file? Or am I thinking it all
wrong and the solution for my problem is somewhere else?
TIA,
Rodrigo Severo
