Rahhhh !

Ok I finally found the issue, AND I WAS RIGHT ! Sorry :(

Try this:

$ cat dummy.txt
<a 
href="http://www.medical.philips.com/us/company/connectivity/assets/docs/dicomcs/ENsphere
DICOM 3 Conformance Statement.pdf">dummy</a>

Then
$ wget -N  --force-html -i dummy.txt

--20:03:04--  
http://www.medical.philips.com/us/company/connectivity/assets/docs/dicomcs/ENsphere%20DICOM%203%20Conformance%20Statement.pdf
           => `ENsphere DICOM 3 Conformance Statement.pdf'
Resolving www.medical.philips.com... 161.88.247.197
Connecting to www.medical.philips.com|161.88.247.197|:80... connected.
HTTP request sent, awaiting response... 200 OK
Length: 334,708 (327K) [application/octet-stream]
Server file no newer than local file `ENsphere DICOM 3 Conformance
Statement.pdf' -- not retrieving.


So please reopn the bug report, as I really believe wget -i and space
in the path is not working.

Thank you
-Mathieu

On Fri, Mar 21, 2008 at 7:54 PM, Mathieu Malaterre
<[EMAIL PROTECTED]> wrote:
> I still do not believe this has anything to do with the server, if you
>  have a couple of seconds please try this file instead:
>
>  
> http://www.medical.philips.com/us/company/connectivity/assets/docs/dicomcs/ENsphere%2520DICOM%25203%2520Conformance%2520Statement.pdf
>  
> http://www.medical.philips.com/us/company/connectivity/assets/docs/dicomcs/usit15l3_final.pdf
>
>  You'll see that both files are stored at the exact same location, but
>  wget report two different things (*). I *seriously* doubt the server
>  has a per file configuration...
>
>  Thanks for your time anyway,
>  -Mathieu
>
>  (*)
>  --19:53:33--  
> http://www.medical.philips.com/us/company/connectivity/assets/docs/dicomcs/ENsphere%2520DICOM%25203%2520Conformance%2520Statement.pdf
>            => `ENsphere%20DICOM%203%20Conformance%20Statement.pdf'
>  Resolving www.medical.philips.com... 161.88.247.197
>  Connecting to www.medical.philips.com|161.88.247.197|:80... connected.
>  HTTP request sent, awaiting response... 200 OK
>  Length: 1,936 (1.9K) [text/html]
>  Last-modified header missing -- time-stamps turned off.
>  --19:53:33--  
> http://www.medical.philips.com/us/company/connectivity/assets/docs/dicomcs/ENsphere%2520DICOM%25203%2520Conformance%2520Statement.pdf
>            => `ENsphere%20DICOM%203%20Conformance%20Statement.pdf'
>  Reusing existing connection to www.medical.philips.com:80.
>  HTTP request sent, awaiting response... 200 OK
>  Length: 1,936 (1.9K) [text/html]
>
>     0K .                                                     100%  694.97 KB/s
>
>  19:53:33 (694.97 KB/s) -
>  `ENsphere%20DICOM%203%20Conformance%20Statement.pdf' saved [1936/1936]
>
>  --19:53:33--  
> http://www.medical.philips.com/us/company/connectivity/assets/docs/dicomcs/usit15l3_final.pdf
>            => `usit15l3_final.pdf'
>  Reusing existing connection to www.medical.philips.com:80.
>  HTTP request sent, awaiting response... 200 OK
>  Length: 217,998 (213K) [application/octet-stream]
>  Server file no newer than local file `usit15l3_final.pdf' -- not retrieving.
>
>
>  FINISHED --19:53:33--
>  Downloaded: 1,936 bytes in 1 files
>
>
>
>
>  On Fri, Mar 21, 2008 at 7:00 PM, Debian Bug Tracking System
>  <[EMAIL PROTECTED]> wrote:
>  >
>  >  This is an automatic notification regarding your Bug report
>  >  which was filed against the wget package:
>  >
>  >  #471970: wget -N and space in the path (HTML encoding)
>  >
>  >  It has been closed by Micah Cowan <[EMAIL PROTECTED]>.
>  >
>  >  Their explanation is attached below along with your original report.
>  >  If this explanation is unsatisfactory and you have not received a
>  >  better one in a separate message then please contact Micah Cowan <[EMAIL 
> PROTECTED]> by
>  >  replying to this email.
>  >
>  >
>  >  --
>  >  471970: http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=471970
>  >  Debian Bug Tracking System
>  >  Contact [EMAIL PROTECTED] with problems
>  >
>  >
>  > ---------- Forwarded message ----------
>  > From: Micah Cowan <[EMAIL PROTECTED]>
>  > To: Mathieu Malaterre <[EMAIL PROTECTED]>, [EMAIL PROTECTED]
>  > Date: Fri, 21 Mar 2008 10:56:28 -0700
>  > Subject: Re: Bug#471970: wget -N and space in the path (HTML encoding)
>  > -----BEGIN PGP SIGNED MESSAGE-----
>  >  Hash: SHA1
>  >
>  >  Mathieu Malaterre wrote:
>  >  > Package: wget
>  >  > Version: 1.10.2-0bpo1
>  >  > Severity: normal
>  >  >
>  >  >
>  >  > wget -N does not work when filename has a space in the filename.
>  >  >
>  >  > Steps to reproduce:
>  >  >
>  >  > $ echo
>  >  > 
> "http://www.medical.philips.com/us/company/connectivity/assets/docs/dicomcs/ENsphere%2520DICOM%25203%2520Conformance%2520Statement.pdf";
>  > dummy.txt
>  >  > $ wget -N -i dummy.txt
>  >  > $ wget -N -i dummy.txt
>  >  >
>  >  > the second time, the file should not have been downloaded.
>  >
>  >  In the log that Wget issues while downloading that file, is the line:
>  >
>  >   Last-modified header missing -- time-stamps turned off.
>  >
>  >  Your issue has nothing to do with spaces in the filename (at least, on
>  >  Wget's end), and everything to do with the server not telling wget when
>  >  it was last modified. Therefore, wget cannot determine whether the file
>  >  on the server is newer or older than the local copy.
>  >
>  >  - --
>  >  Micah J. Cowan
>  >  Programmer, musician, typesetting enthusiast, gamer...
>  >  http://micah.cowan.name/
>  >  -----BEGIN PGP SIGNATURE-----
>  >  Version: GnuPG v1.4.6 (GNU/Linux)
>  >  Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org
>  >
>  >  iD8DBQFH4/bM7M8hyUobTrERApCdAJsFlWyubh1pnVY8qwgatoZPRWDXBgCdFyVn
>  >  yjgZ+itvfDouqQ40WL3C4BE=
>  >  =Mn8U
>  >  -----END PGP SIGNATURE-----
>  >
>  >
>  >
>  > ---------- Forwarded message ----------
>  > From: Mathieu Malaterre <[EMAIL PROTECTED]>
>  > To: Debian Bug Tracking System <[EMAIL PROTECTED]>
>  > Date: Fri, 21 Mar 2008 14:27:51 +0100
>  > Subject: wget -N and space in the path (HTML encoding)
>  > Package: wget
>  >  Version: 1.10.2-0bpo1
>  >  Severity: normal
>  >
>  >
>  >  wget -N does not work when filename has a space in the filename.
>  >
>  >  Steps to reproduce:
>  >
>  >  $ echo
>  >  
> "http://www.medical.philips.com/us/company/connectivity/assets/docs/dicomcs/ENsphere%2520DICOM%25203%2520Conformance%2520Statement.pdf";
>  > dummy.txt
>  >  $ wget -N -i dummy.txt
>  >  $ wget -N -i dummy.txt
>  >
>  >  the second time, the file should not have been downloaded. I suspect the
>  >  use of % in the HTML URL encoding is not being decoded properly for use
>  >  in the -N option.
>  >
>  >  Thanks !
>  >
>  >  -- System Information:
>  >  Debian Release: 3.1
>  >  Architecture: i386 (i686)
>  >  Kernel: Linux 2.6.18-4-686-bigmem
>  >  Locale: LANG=C, LC_CTYPE=C (charmap=ANSI_X3.4-1968)
>  >
>  >  Versions of packages wget depends on:
>  >  ii  libc6                     2.5-9+b1       GNU C Library: Shared 
> libraries
>  >  ii  libssl0.9.7               0.9.7e-3sarge5 SSL shared libraries
>  >
>  >  -- no debconf information
>  >
>  >
>  >
>  >
>
>
>
>  --
>  Mathieu
>



-- 
Mathieu



-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]

Reply via email to