> -----Original Message----- > From: Tony Lewis [mailto:[email protected]] > Sent: Tuesday, November 10, 2009 1:58 PM > To: [email protected] > Cc: KARR, DAVID (ATTCINW) > Subject: RE: [Bug-wget] Problems recursively downloading a directory > tree > > KARR, DAVID (ATTCINW) wrote: > > > I must be missing something simple. > > wget translates file name based on valid file name characters for your > local > file system.
I don't think it's as simple as that. There are no "special" characters used in file names in this tree. All the base file names represent the names of Java classes. It seems like it's making an unexpected decision based on the links it's finding in the HTML files. For instance, one of the produced file names "index.h...@atg%2fadapter%2fgsa%2fbcpdbcopier.html" appears to be produced by this request: ----------- --2009-11-10 15:09:03-- http://ecom.cingular.net/wiki_downloads/documentation/ATGPlatform20071do cs/apidoc/index.html?atg/adapter/gsa/BcpDBCopier.html Reusing existing connection to ecom.cingular.net:80. HTTP request sent, awaiting response... 200 OK Length: 1319 (1.3K) [text/html] Saving to: `ecom.cingular.net/wiki_downloads/documentation/ATGPlatform20071docs/api doc/index.h...@atg%2fadapter%2fgsa%2fbcpdbcopier.html' ----------- I don't know why the request has a query parameter. What's even more curious is that earlier in the wget output, I note the request for a similar file: ------------- --2009-11-10 15:05:16-- http://ecom.cingular.net/wiki_downloads/documentation/ATGPlatform20071do cs/apidoc/atg/adapter/gsa/BcpDBCopier.html Reusing existing connection to ecom.cingular.net:80. HTTP request sent, awaiting response... 200 OK Length: 28327 (28K) [text/html] Saving to: `ecom.cingular.net/wiki_downloads/documentation/ATGPlatform20071docs/api doc/atg/adapter/gsa/BcpDBCopier.html' ------------- Note that both of these requests produce a file name ending with "BcpDBCopier.html", but the latter is in the directory I expect, and the other is with the weird name in the root directory. These two files are different. The funny-named one is pretty short, and uses framesets. The normal looking one is longer, and looks like the expected javadoc for the class.
