I'm trying to crawl a local file system.  I've made the changes to not
ignore file urls and added protocol-file to the plugins list.  I've
included file:///data/mydir in my url fille.

However, when I run the fetch, Nutch tries to connect to file://data/mydir
and therefore returns a 404 error.  I think the root slash is being
stripped during the injection, but I can't seem to find out why.

Anybody have any suggestions or ideas?

Thanks.

Reply via email to