Windows FAQ :)
wget can't save a file containing '?' in its name on a dos/windows
filesystem, and won't (yet) remap the '?' to something else.
If you try a wget -O of a single url it should work, but that won't help you
with recursive crawl.

Heiko

-- 
-- PREVINET S.p.A.            [EMAIL PROTECTED]
-- Via Ferretto, 1            ph  x39-041-5907073
-- I-31021 Mogliano V.to (TV) fax x39-041-5907472
-- ITALY

> -----Original Message-----
> From: Steve Mestdagh [mailto:[EMAIL PROTECTED]]
> Sent: Thursday, May 16, 2002 6:07 PM
> To: [EMAIL PROTECTED]
> Subject: crawling servlet based urls
> 
> 
> Hi,
> I'm trying to get crawl intranet urls of form:
> http://webspherecommunity.raleigh.ibm.com/WKCInternet/WKCComma
> nd?command=getLesson&LessonId=137
> 
> running wget 1.8 on cygwin (windoze 2000)
> 
> I've tried numerous parameters but base format of command is:
> wget  -E -S  -l1 -i url_file1 --load-cookies='c:/Program
> Files/Netscape/Users/default/cookies.txt'
> where url above is in url_file1
> I have cookie set in cookies.txt
> 
> output is:
> --10:03:16--
> http://webspherecommunity.raleigh.ibm.com/WKCInternet/WKCComma
> nd?command=getLesson&Les
> sonId=137
>            => `WKCCommand?command=getLesson&LessonId=137'
> Resolving webspherecommunity.raleigh.ibm.com... done.
> Connecting to webspherecommunity.raleigh.ibm.com[9.37.240.76]:80...
> connected.
> HTTP request sent, awaiting response...
>  1 HTTP/1.1 200 OK
>  2 Date: Thu, 16 May 2002 16:03:41 GMT
>  3 Server: IBM_HTTP_SERVER/1.3.19  Apache/1.3.20 (Win32)
>  4 Set-Cookie: JSESSIONID=0000CCL1ZTJD1X20AHQFHYKDD0Q:tiinjahh;Path=/
>  5 Cache-Control: no-cache="set-cookie,set-cookie2"
>  6 Expires: Thu, 01 Dec 1994 16:00:00 GMT
>  7 Connection: close
>  8 Content-Type: text/html;charset=Cp1252
>  9 Content-Language: en
> WKCCommand?command=getLesson&LessonId=137.html: No such file 
> or directory
> 
> Cannot write to 
> `WKCCommand?command=getLesson&LessonId=137.html' (No such
> file or directory).
> 
> I've tried using --referer option also but it seems to hang after
> displaying above .. until I hit enter key, then it exits
> 
> Any ideas?
> thanks,
> steve
> 

Reply via email to