Re: -R and HTML files

2007-08-23 Thread Matthias Vill
Micah Cowan schrieb: Josh Williams wrote: On 8/22/07, Micah Cowan [EMAIL PROTECTED] wrote: What would be the appropriate behavior of -R then? I think the default option should be to download the html files to parse the links, but it should discard them afterwards if they do not match the

Re: -R and HTML files

2007-08-23 Thread Matthias Vill
Matthias Vill schrieb: Micah Cowan schrieb: Josh Williams wrote: On 8/22/07, Micah Cowan [EMAIL PROTECTED] wrote: What would be the appropriate behavior of -R then? I think the default option should be to download the html files to parse the links, but it should discard them afterwards if

RE: -R and HTML files

2007-08-23 Thread Barnett, Rodney
-Original Message- From: Matthias Vill [mailto:[EMAIL PROTECTED] Sent: Thursday, August 23, 2007 1:54 AM To: wget@sunsite.dk Subject: Re: -R and HTML files Micah Cowan schrieb: Josh Williams wrote: On 8/22/07, Micah Cowan [EMAIL PROTECTED] wrote: What would

Re: -R and HTML files

2007-08-23 Thread Matthias Vill
Barnett, Rodney schrieb: -Original Message- From: Matthias Vill [mailto:[EMAIL PROTECTED] Sent: Thursday, August 23, 2007 1:54 AM To: wget@sunsite.dk Subject: Re: -R and HTML files Micah Cowan schrieb: Josh Williams wrote: On 8/22/07, Micah Cowan [EMAIL PROTECTED] wrote

RE: -R and HTML files

2007-08-23 Thread Barnett, Rodney
-Original Message- From: Matthias Vill [mailto:[EMAIL PROTECTED] Sent: Thursday, August 23, 2007 7:41 AM To: wget@sunsite.dk Subject: Re: -R and HTML files Barnett, Rodney schrieb: -Original Message- From: Matthias Vill [mailto:[EMAIL PROTECTED] Sent: Thursday

-R and HTML files

2007-08-22 Thread Micah Cowan
-BEGIN PGP SIGNED MESSAGE- Hash: SHA256 It appears that some people (including myself) are confused by the fact that wget will download files that match a rejection pattern (or fail to match an accept pattern), if the file type is text/html. The manual says: Note that these two

Re: -R and HTML files

2007-08-22 Thread Micah Cowan
-BEGIN PGP SIGNED MESSAGE- Hash: SHA256 Micah Cowan wrote: Is there any real reason that we can't just always reject files if they match the reject list? Or, would it be worth adding an extra option to allow even HTML files to be skipped? It may be worth mentioning at this point, that

Re: -R and HTML files

2007-08-22 Thread Josh Williams
On 8/22/07, Micah Cowan [EMAIL PROTECTED] wrote: What would be the appropriate behavior of -R then? I think the default option should be to download the html files to parse the links, but it should discard them afterwards if they do not match the acceptance list. But, as you stated, I believe

Re: -R and HTML files

2007-08-22 Thread Micah Cowan
-BEGIN PGP SIGNED MESSAGE- Hash: SHA256 Josh Williams wrote: On 8/22/07, Micah Cowan [EMAIL PROTECTED] wrote: What would be the appropriate behavior of -R then? I think the default option should be to download the html files to parse the links, but it should discard them afterwards