Hi
I have set Nutch up and the crawler (following the intranet tutorial) and
can fetch results OK for the few URL's I have tested, but for some reason I
cannot get any results returned when I try to crawl this URL:
http://www.comlaw.gov.au/ComLaw/legislation/actcompilation1.nsf/sh/browse&VIEW=current&ORDER=bytitle&CATEGORY=actcompilation


I think it might have something to do with the file extension ".nsf" which
is midway in the URL. I think the crawler cannot deal with it. Has anybody
else had this problem or can help?

Much obliged if anybody knows the answer.

Cheers
Paul

Reply via email to