Re: interesting bug

2002-06-09 Thread Hack Kampbjørn

[EMAIL PROTECTED] wrote:
 
 I was using wget to suck a website, and found an interesting problem
 some of the URLs it found contained a question mark, after which it
 responded with cannot write to '... insert file/URL here?more
 text  ...'  (invalid argument).
 
 And - it didn't save any of those URLs to files (on my NTFS/windows XP
 machine) ...

It may also have said Illegal filename. Note that not all characters
are allowed in Windows filenames, among them '?'. As '?' is quite common
in data driven web-sites most Windows binaries have included a patch to
deal with it.

The latest wget release 1.8.2 includes now such a patch. But the rest of
illegal characters are not deal with, nor is other special windows
features.
 
 what can I do in order to spider/crawl these pages and save them to my
 local disk ?

Use wget version 1.8.2
 
 Alex

-- 
Med venlig hilsen / Kind regards

Hack Kampbjørn



interesting bug

2002-06-07 Thread alex

I was using wget to suck a website, and found an interesting problem
some of the URLs it found contained a question mark, after which it
responded with cannot write to '... insert file/URL here?more 
text  ...'  (invalid argument).

And - it didn't save any of those URLs to files (on my NTFS/windows XP 
machine) ...

what can I do in order to spider/crawl these pages and save them to my 
local disk ?

Alex