[Sorry for top post, but for some reason, I did not received your email]

Micah Cowan <[EMAIL PROTECTED]>
> I'm at a loss as to how the above demonstrates a problem. It decided not
> to download the file, because it was no newer than the local copy. Isn't
> that the behavior you were asking for? That's certainly what -N is
> intended for.


Ok let me summarize here:

1. using raw text file, the -N option is simply discarded. Steps to reproduce

$ cat dummy.txt
http://www.medical.philips.com/us/company/connectivity/assets/docs/dicomcs/ENsphere%2520DICOM%25203%2520Conformance%2520Statement.pdf
$ wget -N -i dummy.txt

2. Using a fake HTML file, so that I can explicitely use the space
character (AFAIK I cannot do that in raw text file, the -N option is
properly taken into account:

$ cat dummy.txt
<a 
href="http://www.medical.philips.com/us/company/connectivity/assets/docs/dicomcs/ENsphere
DICOM 3 Conformance Statement.pdf">dummy</a>
$ wget -N  --force-html -i dummy.txt

So either the documentation for -i (alone) should be updated to
mention the issue with space character or maybe this is just a simple
bug that can be fixed in wget code base.

Thanks
-Mathieu

On Fri, Mar 21, 2008 at 8:03 PM, Mathieu Malaterre
<[EMAIL PROTECTED]> wrote:
> Rahhhh !
>
>  Ok I finally found the issue, AND I WAS RIGHT ! Sorry :(
>
>  Try this:
>
>  $ cat dummy.txt
>  <a 
> href="http://www.medical.philips.com/us/company/connectivity/assets/docs/dicomcs/ENsphere
>  DICOM 3 Conformance Statement.pdf">dummy</a>
>
>  Then
>  $ wget -N  --force-html -i dummy.txt
>
>  --20:03:04--  
> http://www.medical.philips.com/us/company/connectivity/assets/docs/dicomcs/ENsphere%20DICOM%203%20Conformance%20Statement.pdf
>            => `ENsphere DICOM 3 Conformance Statement.pdf'
>
> Resolving www.medical.philips.com... 161.88.247.197
>  Connecting to www.medical.philips.com|161.88.247.197|:80... connected.
>  HTTP request sent, awaiting response... 200 OK
>  Length: 334,708 (327K) [application/octet-stream]
>  Server file no newer than local file `ENsphere DICOM 3 Conformance
>  Statement.pdf' -- not retrieving.
>
>
>  So please reopn the bug report, as I really believe wget -i and space
>  in the path is not working.
>
>  Thank you
>  -Mathieu
>
>
>
>  On Fri, Mar 21, 2008 at 7:54 PM, Mathieu Malaterre
>  <[EMAIL PROTECTED]> wrote:
>  > I still do not believe this has anything to do with the server, if you
>  >  have a couple of seconds please try this file instead:
>  >
>  >  
> http://www.medical.philips.com/us/company/connectivity/assets/docs/dicomcs/ENsphere%2520DICOM%25203%2520Conformance%2520Statement.pdf
>  >  
> http://www.medical.philips.com/us/company/connectivity/assets/docs/dicomcs/usit15l3_final.pdf
>  >
>  >  You'll see that both files are stored at the exact same location, but
>  >  wget report two different things (*). I *seriously* doubt the server
>  >  has a per file configuration...
>  >
>  >  Thanks for your time anyway,
>  >  -Mathieu
>  >
>  >  (*)
>  >  --19:53:33--  
> http://www.medical.philips.com/us/company/connectivity/assets/docs/dicomcs/ENsphere%2520DICOM%25203%2520Conformance%2520Statement.pdf
>  >            => `ENsphere%20DICOM%203%20Conformance%20Statement.pdf'
>  >  Resolving www.medical.philips.com... 161.88.247.197
>  >  Connecting to www.medical.philips.com|161.88.247.197|:80... connected.
>  >  HTTP request sent, awaiting response... 200 OK
>  >  Length: 1,936 (1.9K) [text/html]
>  >  Last-modified header missing -- time-stamps turned off.
>  >  --19:53:33--  
> http://www.medical.philips.com/us/company/connectivity/assets/docs/dicomcs/ENsphere%2520DICOM%25203%2520Conformance%2520Statement.pdf
>  >            => `ENsphere%20DICOM%203%20Conformance%20Statement.pdf'
>  >  Reusing existing connection to www.medical.philips.com:80.
>  >  HTTP request sent, awaiting response... 200 OK
>  >  Length: 1,936 (1.9K) [text/html]
>  >
>  >     0K .                                                     100%  694.97 
> KB/s
>  >
>  >  19:53:33 (694.97 KB/s) -
>  >  `ENsphere%20DICOM%203%20Conformance%20Statement.pdf' saved [1936/1936]
>  >
>  >  --19:53:33--  
> http://www.medical.philips.com/us/company/connectivity/assets/docs/dicomcs/usit15l3_final.pdf
>  >            => `usit15l3_final.pdf'
>  >  Reusing existing connection to www.medical.philips.com:80.
>  >  HTTP request sent, awaiting response... 200 OK
>  >  Length: 217,998 (213K) [application/octet-stream]
>  >  Server file no newer than local file `usit15l3_final.pdf' -- not 
> retrieving.
>  >
>  >
>  >  FINISHED --19:53:33--
>  >  Downloaded: 1,936 bytes in 1 files
>  >
>  >
>  >
>  >
>  >  On Fri, Mar 21, 2008 at 7:00 PM, Debian Bug Tracking System
>  >  <[EMAIL PROTECTED]> wrote:
>  >  >
>  >  >  This is an automatic notification regarding your Bug report
>  >  >  which was filed against the wget package:
>  >  >
>  >  >  #471970: wget -N and space in the path (HTML encoding)
>  >  >
>  >  >  It has been closed by Micah Cowan <[EMAIL PROTECTED]>.
>  >  >
>  >  >  Their explanation is attached below along with your original report.
>  >  >  If this explanation is unsatisfactory and you have not received a
>  >  >  better one in a separate message then please contact Micah Cowan 
> <[EMAIL PROTECTED]> by
>  >  >  replying to this email.
>  >  >
>  >  >
>  >  >  --
>  >  >  471970: http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=471970
>  >  >  Debian Bug Tracking System
>  >  >  Contact [EMAIL PROTECTED] with problems
>  >  >
>  >  >
>  >  > ---------- Forwarded message ----------
>  >  > From: Micah Cowan <[EMAIL PROTECTED]>
>  >  > To: Mathieu Malaterre <[EMAIL PROTECTED]>, [EMAIL PROTECTED]
>  >  > Date: Fri, 21 Mar 2008 10:56:28 -0700
>  >  > Subject: Re: Bug#471970: wget -N and space in the path (HTML encoding)
>  >  > -----BEGIN PGP SIGNED MESSAGE-----
>  >  >  Hash: SHA1
>  >  >
>  >  >  Mathieu Malaterre wrote:
>  >  >  > Package: wget
>  >  >  > Version: 1.10.2-0bpo1
>  >  >  > Severity: normal
>  >  >  >
>  >  >  >
>  >  >  > wget -N does not work when filename has a space in the filename.
>  >  >  >
>  >  >  > Steps to reproduce:
>  >  >  >
>  >  >  > $ echo
>  >  >  > 
> "http://www.medical.philips.com/us/company/connectivity/assets/docs/dicomcs/ENsphere%2520DICOM%25203%2520Conformance%2520Statement.pdf";
>  > dummy.txt
>  >  >  > $ wget -N -i dummy.txt
>  >  >  > $ wget -N -i dummy.txt
>  >  >  >
>  >  >  > the second time, the file should not have been downloaded.
>  >  >
>  >  >  In the log that Wget issues while downloading that file, is the line:
>  >  >
>  >  >   Last-modified header missing -- time-stamps turned off.
>  >  >
>  >  >  Your issue has nothing to do with spaces in the filename (at least, on
>  >  >  Wget's end), and everything to do with the server not telling wget when
>  >  >  it was last modified. Therefore, wget cannot determine whether the file
>  >  >  on the server is newer or older than the local copy.
>  >  >
>  >  >  - --
>  >  >  Micah J. Cowan
>  >  >  Programmer, musician, typesetting enthusiast, gamer...
>  >  >  http://micah.cowan.name/
>  >  >  -----BEGIN PGP SIGNATURE-----
>  >  >  Version: GnuPG v1.4.6 (GNU/Linux)
>  >  >  Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org
>  >  >
>  >  >  iD8DBQFH4/bM7M8hyUobTrERApCdAJsFlWyubh1pnVY8qwgatoZPRWDXBgCdFyVn
>  >  >  yjgZ+itvfDouqQ40WL3C4BE=
>  >  >  =Mn8U
>  >  >  -----END PGP SIGNATURE-----
>  >  >
>  >  >
>  >  >
>  >  > ---------- Forwarded message ----------
>  >  > From: Mathieu Malaterre <[EMAIL PROTECTED]>
>  >  > To: Debian Bug Tracking System <[EMAIL PROTECTED]>
>  >  > Date: Fri, 21 Mar 2008 14:27:51 +0100
>  >  > Subject: wget -N and space in the path (HTML encoding)
>  >  > Package: wget
>  >  >  Version: 1.10.2-0bpo1
>  >  >  Severity: normal
>  >  >
>  >  >
>  >  >  wget -N does not work when filename has a space in the filename.
>  >  >
>  >  >  Steps to reproduce:
>  >  >
>  >  >  $ echo
>  >  >  
> "http://www.medical.philips.com/us/company/connectivity/assets/docs/dicomcs/ENsphere%2520DICOM%25203%2520Conformance%2520Statement.pdf";
>  > dummy.txt
>  >  >  $ wget -N -i dummy.txt
>  >  >  $ wget -N -i dummy.txt
>  >  >
>  >  >  the second time, the file should not have been downloaded. I suspect 
> the
>  >  >  use of % in the HTML URL encoding is not being decoded properly for use
>  >  >  in the -N option.
>  >  >
>  >  >  Thanks !
>  >  >
>  >  >  -- System Information:
>  >  >  Debian Release: 3.1
>  >  >  Architecture: i386 (i686)
>  >  >  Kernel: Linux 2.6.18-4-686-bigmem
>  >  >  Locale: LANG=C, LC_CTYPE=C (charmap=ANSI_X3.4-1968)
>  >  >
>  >  >  Versions of packages wget depends on:
>  >  >  ii  libc6                     2.5-9+b1       GNU C Library: Shared 
> libraries
>  >  >  ii  libssl0.9.7               0.9.7e-3sarge5 SSL shared libraries
>  >  >
>  >  >  -- no debconf information
>  >  >
>  >  >
>  >  >
>  >  >
>  >
>  >
>  >
>  >  --
>  >  Mathieu
>  >
>
>
>
>  --
>  Mathieu
>



-- 
Mathieu



-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]

Reply via email to