Re: [Bug-wget] mirroring a Blogger blog without the comments

2014-04-24 Thread Darshit Shah
Hello, (Sorry, I didn't catch your name)! Thanks for your kind words. Regarding your issue, the reason Wget downloads the comment spam pages is that you have enabled the --span-hosts option. By default Wget would not download pages from a different domain, however, since you explicitly asked it t

[Bug-wget] mirroring a Blogger blog without the comments

2014-04-24 Thread j045233
Dear wget community, I'm playing with wget's mirroring functionality for the first time, and first off, so far it's fantastic. Thanks for the great work! I'm using a command like the following to create a (shallow) offline mirror of my Blogger blog: wget --tries=2 -e robots=off --span-hosts --ti

Re: [Bug-wget] bad filename

2014-04-24 Thread Andries E. Brouwer
On Thu, Apr 24, 2014 at 09:56:15AM +0200, Tim Ruehsen wrote: > On Wednesday 23 April 2014 15:32:47 Andries E. Brouwer wrote: > > On Wed, Apr 23, 2014 at 02:43:21PM +0200, Tim Ruehsen wrote: > > Wget has a serious problem. It creates by default illegal filenames. > > I couldn't read that in your po

Re: [Bug-wget] bad filename

2014-04-24 Thread Tim Ruehsen
On Wednesday 23 April 2014 15:32:47 Andries E. Brouwer wrote: > On Wed, Apr 23, 2014 at 02:43:21PM +0200, Tim Ruehsen wrote: > Wget has a serious problem. It creates by default illegal filenames. I couldn't read that in your post before (I still can't). If Wget puts "illegal" characters into fil

Re: [Bug-wget] bad filename

2014-04-24 Thread Andries E. Brouwer
On Thu, Apr 24, 2014 at 03:43:40PM +0200, Tim Ruehsen wrote: > 1. How do you know, what filesystem you are writing to ? > I just think of these fat32 USB sticks flying around everywhere. > UTF-8 might be a problem (see > http://en.wikipedia.org/wiki/Comparison_of_file_systems). > I just mention

Re: [Bug-wget] bad filename

2014-04-24 Thread Tim Rühsen
Am Donnerstag, 24. April 2014, 20:00:18 schrieb Andries E. Brouwer: > On Thu, Apr 24, 2014 at 03:43:40PM +0200, Tim Ruehsen wrote: > > 1. How do you know, what filesystem you are writing to ? > > I just think of these fat32 USB sticks flying around everywhere. > > UTF-8 might be a problem (see > >

Re: [Bug-wget] [PATCH] wget hangs on HTTP 204

2014-04-24 Thread Giuseppe Scrivano
Tim Ruehsen writes: > On Tuesday 22 April 2014 18:08:08 Giuseppe Scrivano wrote: >> Tim Ruehsen writes: >> > Attached is a patch including a new test case. >> > >> > Guiseppe, I made it for a clone of Darshit's clone of Wget. Not sure if it >> > fits into master. >> >> thanks for your patch.

Re: [Bug-wget] bad filename

2014-04-24 Thread Tim Ruehsen
On Thursday 24 April 2014 12:21:54 Andries E. Brouwer wrote: > > I couldn't read that in your post before (I still can't). If Wget puts > > "illegal" characters into filenames, that is a bug and has to be fixed. > > Then let me clarify this point. Sorry for the length. Andries, first of thanks fo