Re: How do I prevent wget from creating index.html?C=M;O=A ?

Evert Meulie Tue, 08 Nov 2005 06:02:05 -0800

But that would also wipe out all legitimate index.html files, right?  ;-)


  Evert


Alan.Hall wrote:

A less resource intensive solution might be:

   find . -name "index*.html" |xargs rm

Alan.

Evert Meulie wrote the following on 11/8/2005 2:58 AM:
The Gentoo forum provided me with the following script, that seems todo the job:
    for dir in $(find /path/to/downloads -type d); do
        rm $dir/index.html?* && rm $dir/index.html
    done

(from  http://forums.gentoo.org/viewtopic-t-399594.html )


Regards,
  Evert


Evert Meulie wrote:
Tobias Tiederle wrote:
Evert Meulie schrieb:
Hi!
Thanks for the reply. Since I have no control over the server fromwhich I'm pulling the mirror AND I do not want to live with thesefiles ( 8-) ), I was wondering whether there's a way to excludecertain file names, so that I can exclude the index.html?*wildcard...?
afaik there's no way (with official releases) to do this.
I have a regex patch for 1.9.1 lying around on my system but its notincluded in current wget releases (because it used pcre instead ofgnu regex/c library regex).
Last thing I heard regex support is planned for 1.11.
(If you mirror this site often, why not use a script and delete themafterwards?)
That is not a bad idea either!  :-)
Does anyone here happen to have a script that does a recursive deleteof all of these index.html?* and index.html (but ONLY if there is aindex.html?* file in the same directory)? Writing a script like thisexceeds my scripting capabilities... :-/
Regards,
    Evert

Re: How do I prevent wget from creating index.html?C=M;O=A ?

Reply via email to