Hi MOHIT!
I haven't any experience on file-protocol crawl.But i think you should check
out the  plugin.includes property in the nutch-default.xml . Are you sure
you have enabled the protocol-file plugin ?

Regards ,
Keven


MOHIT GOYAL wrote:
> 
> 
> 
> -- 
> I tried to crawl the local directory files by giving links to local 
> directory in urls.I got the following error.
> 
> command:
> bin/nutch crawl ../urls -dir crawlresult_localfs1
> 
> 
> please help
> 
> -------------------------------------------------
> failed with: org.apache.nutch.protocol.ProtocolNotFound: protocol not 
> found for url=file
> fetching file:///root/Desktop/csiro-split/CSIRO002
> 
> 
> 
> 
> 
> 
> 
> MOHIT GOYAL
> CSE
> 200502013
> 
> 
> 
> 
> 
> 
> 

-- 
View this message in context: 
http://www.nabble.com/why-did-nutch-miss-so-many-links-when-crawling--tf4322916.html#a12322096
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to