I am trying to index files local intranet using nutch 1.0, hence, i m
giving path as file:hostname/shared/ as seed.
Now when i use AdaptiveScheduler and crawl the intranet for the first
time, it works fine but when i recrawl, it gives me malformedURL
exception. But when i use the Default
may be you can try with file:/hostname// or file:///hostname
Looks like you have 4 slashes...just a guess..
On Sat, May 1, 2010 at 2:36 PM, arpit khurdiya arpitkhurd...@gmail.comwrote:
I am trying to index files local intranet using nutch 1.0, hence, i m
giving path as