Windows FAQ :) wget can't save a file containing '?' in its name on a dos/windows filesystem, and won't (yet) remap the '?' to something else. If you try a wget -O of a single url it should work, but that won't help you with recursive crawl.
Heiko -- -- PREVINET S.p.A. [EMAIL PROTECTED] -- Via Ferretto, 1 ph x39-041-5907073 -- I-31021 Mogliano V.to (TV) fax x39-041-5907472 -- ITALY > -----Original Message----- > From: Steve Mestdagh [mailto:[EMAIL PROTECTED]] > Sent: Thursday, May 16, 2002 6:07 PM > To: [EMAIL PROTECTED] > Subject: crawling servlet based urls > > > Hi, > I'm trying to get crawl intranet urls of form: > http://webspherecommunity.raleigh.ibm.com/WKCInternet/WKCComma > nd?command=getLesson&LessonId=137 > > running wget 1.8 on cygwin (windoze 2000) > > I've tried numerous parameters but base format of command is: > wget -E -S -l1 -i url_file1 --load-cookies='c:/Program > Files/Netscape/Users/default/cookies.txt' > where url above is in url_file1 > I have cookie set in cookies.txt > > output is: > --10:03:16-- > http://webspherecommunity.raleigh.ibm.com/WKCInternet/WKCComma > nd?command=getLesson&Les > sonId=137 > => `WKCCommand?command=getLesson&LessonId=137' > Resolving webspherecommunity.raleigh.ibm.com... done. > Connecting to webspherecommunity.raleigh.ibm.com[9.37.240.76]:80... > connected. > HTTP request sent, awaiting response... > 1 HTTP/1.1 200 OK > 2 Date: Thu, 16 May 2002 16:03:41 GMT > 3 Server: IBM_HTTP_SERVER/1.3.19 Apache/1.3.20 (Win32) > 4 Set-Cookie: JSESSIONID=0000CCL1ZTJD1X20AHQFHYKDD0Q:tiinjahh;Path=/ > 5 Cache-Control: no-cache="set-cookie,set-cookie2" > 6 Expires: Thu, 01 Dec 1994 16:00:00 GMT > 7 Connection: close > 8 Content-Type: text/html;charset=Cp1252 > 9 Content-Language: en > WKCCommand?command=getLesson&LessonId=137.html: No such file > or directory > > Cannot write to > `WKCCommand?command=getLesson&LessonId=137.html' (No such > file or directory). > > I've tried using --referer option also but it seems to hang after > displaying above .. until I hit enter key, then it exits > > Any ideas? > thanks, > steve >
