Re: wget-1.11.4 bug

2008-07-25 Thread Micah Cowan
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

kuang-cheng chao wrote:
 Dear Micah:
  
 Thanks for your work of wget.
  
 There is a question about two wgets run simultaneously.
 In method resolve_bind_address, wget assumes that this is called once.
 However, this will cause two domain name with the same ip if two wgets
 run the same method concurrently.

Have you reproduced this, or is this in theory? If the latter, what has
led you to this conclusion? I don't see anything in the code that would
cause this behavior.

Also, please use the mailing list for discussions about Wget. I've added
it to the recipients list.

- --
Micah J. Cowan
Programmer, musician, typesetting enthusiast, gamer,
and GNU Wget Project Maintainer.
http://micah.cowan.name/
-BEGIN PGP SIGNATURE-
Version: GnuPG v1.4.6 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFIiYKF7M8hyUobTrERAr7fAJ0TnkLdEVOMy6wJA3Z1kIYC7dQoMACfZ9hb
x5K6MTzhgVRCdKJwUGnbSRw=
=EcFC
-END PGP SIGNATURE-


RE: wget-1.11.4 bug

2008-07-25 Thread kuang-cheng chao

Micah Cowan wrote:
 Have you reproduced this, or is this in theory? If the latter, what has led 
 you to this conclusion? I don't see anything in the code that would cause 
 this behavior.
I reproduce this. But I can't make sure the really problem is in 
resolve_bind_address.
In the attached message, both api.yougotphogo.com and farm1.static.flickr.com 
get the same ip(74.124.203.218).
The two wget are called from two threads of a program.
 
Best regards,
k.c. chao
 
p.s. 
 
The log is follworing:
 
wget -4 -t 6 
http://api.yougotphoto.com/device/?action=get_device_new_photoapi=2.2api_key=f10df554a958fd10050e2d305241c7a3device_class=2serial_no=000E2EE5676Furl_no=24616cksn=44fe191d6cb4e7807f75938b5d72f07c;
 -O /tmp/webii/ygp_new_photo_list.txt--1999-11-30 00:04:21--  
http://api.yougotphoto.com/device/?action=get_device_new_photoapi=2.2api_key=f10df554a958fd10050e2d305241c7a3device_class=2serial_no=000E2EE5676Furl_no=24616cksn=44fe191d6cb4e7807f75938b5d72f07cResolving
 api.yougotphoto.com... wget -4 -t 6 
http://farm1.static.flickr.com/33/49038824_e4b04b7d9f_b.jpg; -O 
/tmp/webii/24616 74.124.203.218Connecting to 
api.yougotphoto.com|74.124.203.218|:80... --1999-11-30 00:04:22--  
http://farm1.static.flickr.com/33/49038824_e4b04b7d9f_b.jpgResolving 
farm1.static.flickr.com... 74.124.203.218Connecting to 
farm1.static.flickr.com|74.124.203.218|:80... connected. 
_
Discover the new Windows Vista
http://search.msn.com/results.aspx?q=windows+vistamkt=en-USform=QBRE

Patch to allow filtering on content-type header

2008-07-25 Thread Lars Kotthoff
Hi list,

 I've written a patch which allows filtering on the content-type header to
select what is downloaded. E.g.
wget -r --content-type=text/* http://www.foobar.com
will only download things with a content-type header of text/html, text/plain
etc. There's also a content-type-exclude option to not download specific
content-types.

If people are interested in getting this into wget, I could submit it to the bug
tracker.

Thanks,

Lars


Re: wget-1.11.4 bug

2008-07-25 Thread Micah Cowan
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

k.c. chao wrote:
 Micah Cowan wrote:
  Have you reproduced this, or is this in theory? If the latter, what has
  led you to this conclusion? I don't see anything in the code that would
  cause this behavior.

 I reproduce this. But I can't make sure the really problem is in
 resolve_bind_address. In the attached message, both
 api.yougotphogo.com and farm1.static.flickr.com get the same
 ip(74.124.203.218).  The two wget are called from two threads of a
 program.

Yeah, I get 68.142.213.135 for the flickr.com address, currently.

The thing is, though, those two threads should be running wgets under
separate processes (I'm not sure how they couldn't be, but if they
somehow weren't that would be using Wget other than how it was designed
to be used).

This problem sounds much more like an issue with the OS's API than an
issue with Wget, to me. But we'd still want to work around it if it were
feasible.

What operating system are you running? Vista?

- --
Micah J. Cowan
Programmer, musician, typesetting enthusiast, gamer,
and GNU Wget Project Maintainer.
http://micah.cowan.name/
-BEGIN PGP SIGNATURE-
Version: GnuPG v1.4.6 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFIirT17M8hyUobTrERAjsuAJ0crMPYIQficu1csou8Tt0jDFKvpQCeNYk3
1FhXl3uUYj2IA53qI1oOJ8A=
=DvdG
-END PGP SIGNATURE-


Re: Patch to allow filtering on content-type header

2008-07-25 Thread Micah Cowan
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

Lars Kotthoff wrote:
 Hi list,
 
  I've written a patch which allows filtering on the content-type header to
 select what is downloaded. E.g.
 wget -r --content-type=text/* http://www.foobar.com
 will only download things with a content-type header of text/html, text/plain
 etc. There's also a content-type-exclude option to not download specific
 content-types.

Sounds great, Lars!

In fact, we already have an RFE on the bug-tracker for just such a thing
at https://savannah.gnu.org/bugs/?20378; if you'd like to attach it
there, that'd be great.

- --
Micah J. Cowan
Programmer, musician, typesetting enthusiast, gamer,
and GNU Wget Project Maintainer.
http://micah.cowan.name/
-BEGIN PGP SIGNATURE-
Version: GnuPG v1.4.6 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD4DBQFIirbU7M8hyUobTrERAlUHAJ9pFEOOgspdiYXE54Wg0nD4+e3udgCWMPjM
+muSJuWzt8yJwIlTO3oJbQ==
=+jBB
-END PGP SIGNATURE-