This should help you: http://wiki.apache.org/nutch/FAQ#head-c721b23b43b15885f5ea7d8da62c1c40a37878 e6
-----Original Message----- From: Arun Kaundal [mailto:[EMAIL PROTECTED] Sent: Monday, December 05, 2005 11:23 PM To: nutch-user@lucene.apache.org Subject: Re: fetch of file:///F:/xxx/xxx/xxx.txt failed with: org.apache.nutch.protocol.ProtocolNotFound: protocol not found for url=file Jerome Thanx for replying. How can I activate protocol-file plugin. I am new to nutch, plz suggest some way . thanx a ton once again On 12/5/05, Jérôme Charron <[EMAIL PROTECTED]> wrote: > > It seems that you are trying to fetch some local files, but that the > protocol-file plugin is not activated in your configuration. > > Regards > > Jérôme > > > On 12/5/05, Arun Kaundal <[EMAIL PROTECTED]> wrote: > > > > I am getting protocol not found error. What configuartionsetting > require > > for my case. Plz come up with solution soon, I am waiting my posting > from > > long time. > > > > Log is attached. > > 051205 181723 logging at INFO > > 051205 181723 fetching > > file:///F:/Atalntis_scheduler/Crawl_Files/FetcherTask.html > > 051205 181723 fetching > > file:///F:/Atalntis_scheduler/Crawl_Files/Voltix_4n_network.txt > > 051205 181723 fetch of > file:///F:/Atalntis_scheduler/Crawl_Files/Voltix_4n_network.txt > > failed with: org.apache.nutch.protocol.ProtocolNotFound: protocol > > not found for url=file > > 051205 181723 fetch of > file:///F:/Atalntis_scheduler/Crawl_Files/FetcherTask.html > > failed with: org.apache.nutch.protocol.ProtocolNotFound: protocol > > not found for url=file > > 051205 181723 Could not clean the content-type [], Reason is [ > > org.apache.nutch.util.mime.MimeTypeException: The type can not be > > null > or > > empty]. Using its raw version... > > 051205 181723 Could not clean the content-type [], Reason is [ > > org.apache.nutch.util.mime.MimeTypeException: The type can not be > > null > or > > empty]. Using its raw version... > > 051205 181723 Parsing [ > file:///F:/Atalntis_scheduler/Crawl_Files/Voltix_4n_network.txt] > > with [EMAIL PROTECTED] > > 051205 181723 Parsing [ > file:///F:/Atalntis_scheduler/Crawl_Files/FetcherTask.html > > ] with [EMAIL PROTECTED] > > > > > > > -- > http://motrech.free.fr/ > http://www.frutch.org/ > >