Tobias Tiederle wrote:
Evert Meulie schrieb:

Hi!

Thanks for the reply. Since I have no control over the server from which I'm pulling the mirror AND I do not want to live with these files ( 8-) ), I was wondering whether there's a way to exclude certain file names, so that I can exclude the index.html?* wildcard...?

afaik there's no way (with official releases) to do this.
I have a regex patch for 1.9.1 lying around on my system but its not included in current wget releases (because it used pcre instead of gnu regex/c library regex).
Last thing I heard regex support is planned for 1.11.
(If you mirror this site often, why not use a script and delete them afterwards?)



That is not a bad idea either!  :-)

Does anyone here happen to have a script that does a recursive delete of all of these index.html?* and index.html (but ONLY if there is a index.html?* file in the same directory)? Writing a script like this exceeds my scripting capabilities... :-/

Regards,
        Evert

Reply via email to