As you can see, I have exactly the same problem as Stephen Fittich in the
preceding thread. Congratulations, you have been 18 minutes faster, Stephen!

-----Ursprüngliche Nachricht-----
Von: Alfred Ostermeier [mailto:[EMAIL PROTECTED] 
Gesendet: Sonntag, 18. Dezember 2005 00:40
An: [email protected]
Betreff: PluginRuntimeException: org.apache.nutch.indexer.IndexingFilter
does not exist


Hello,

I have just installed nutch 0.7.1. I'm running it on Win XP and cygwin.
Crawling an http-URL worked well. But: Crawling an file-URL failed. I did
configure nutch exactly as described in
http://wiki.apache.org/nutch/FAQ#head-c721b23b43b15885f5ea7d8da62c1c40a37878
e6. That means, I activated the "protocol-file" plugin. Below is the content
of the log-file with the errors. 

The only Google hit for "IndexingFilter does not exist" (
http://www.mail-archive.com/[email protected]/msg00878.htmlsuspect
ed ) suspected the CLASSPATH among other things. To which folders or jar
file(s) has the CLASSPATH to be set - if yes? Currently mine is set to the
current directory. I unfortunately couldn't find the jar-file with the class
IndexingFilter.

Regards,
Alfred

----------------------------------------------------------------------------
-------------------------------------------------

run java in c:\j2sdk1.4.2_04\jre
051217 235048 parsing file:/C:/nutch-0.7.1/conf/nutch-default.xml
051217 235049 parsing file:/C:/nutch-0.7.1/conf/crawl-tool.xml
051217 235049 parsing file:/C:/nutch-0.7.1/conf/nutch-site.xml
051217 235049 No FS indicated, using default:local
051217 235049 crawl started in: crawl.test
051217 235049 rootUrlFile = urls
051217 235049 threads = 10
051217 235049 depth = 3
051217 235049 Created webdb at LocalFS,C:\nutch-0.7.1\crawl.test\db
051217 235049 Starting URL processing
051217 235049 Plugins: looking in: C:\nutch-0.7.1\plugins 051217 235049 not
including: C:\nutch-0.7.1\plugins\clustering-carrot2
051217 235049 not including: C:\nutch-0.7.1\plugins\creativecommons
051217 235049 parsing: C:\nutch-0.7.1\plugins\index-basic\plugin.xml
051217 235049 impl: point=org.apache.nutch.indexer.IndexingFilter
class=org.apache.nutch.indexer.basic.BasicIndexingFilter
051217 235049 not including: C:\nutch-0.7.1\plugins\index-more 051217 235049
not including: C:\nutch-0.7.1\plugins\language-identifier
051217 235049 not including: C:\nutch-0.7.1\plugins\nutch-extensionpoints
051217 235049 not including: C:\nutch-0.7.1\plugins\ontology 051217 235049
not including: C:\nutch-0.7.1\plugins\parse-ext 051217 235049 parsing:
C:\nutch-0.7.1\plugins\parse-html\plugin.xml
051217 235049 impl: point=org.apache.nutch.parse.Parser
class=org.apache.nutch.parse.html.HtmlParser
051217 235049 not including: C:\nutch-0.7.1\plugins\parse-js 051217 235049
not including: C:\nutch-0.7.1\plugins\parse-msword
051217 235049 not including: C:\nutch-0.7.1\plugins\parse-pdf 051217 235049
not including: C:\nutch-0.7.1\plugins\parse-rss 051217 235049 parsing:
C:\nutch-0.7.1\plugins\parse-text\plugin.xml
051217 235049 impl: point=org.apache.nutch.parse.Parser
class=org.apache.nutch.parse.text.TextParser
051217 235049 parsing: C:\nutch-0.7.1\plugins\protocol-file\plugin.xml
051217 235049 impl: point=org.apache.nutch.protocol.Protocol
class=org.apache.nutch.protocol.file.File
051217 235049 not including: C:\nutch-0.7.1\plugins\protocol-ftp
051217 235049 parsing: C:\nutch-0.7.1\plugins\protocol-http\plugin.xml
051217 235049 impl: point=org.apache.nutch.protocol.Protocol
class=org.apache.nutch.protocol.http.Http
051217 235049 not including: C:\nutch-0.7.1\plugins\protocol-httpclient
051217 235049 parsing: C:\nutch-0.7.1\plugins\query-basic\plugin.xml
051217 235049 impl: point=org.apache.nutch.searcher.QueryFilter
class=org.apache.nutch.searcher.basic.BasicQueryFilter
051217 235049 not including: C:\nutch-0.7.1\plugins\query-more 051217 235049
parsing: C:\nutch-0.7.1\plugins\query-site\plugin.xml
051217 235049 impl: point=org.apache.nutch.searcher.QueryFilter
class=org.apache.nutch.searcher.site.SiteQueryFilter
051217 235049 parsing: C:\nutch-0.7.1\plugins\query-url\plugin.xml
051217 235049 impl: point=org.apache.nutch.searcher.QueryFilter
class=org.apache.nutch.searcher.url.URLQueryFilter
051217 235049 not including: C:\nutch-0.7.1\plugins\urlfilter-prefix
051217 235049 not including: C:\nutch-0.7.1\plugins\urlfilter-regex
051217 235049 SEVERE org.apache.nutch.plugin.PluginRuntimeException:
extension point: org.apache.nutch.indexer.IndexingFilter does not exist.
java.lang.ExceptionInInitializerError
        at org.apache.nutch.db.WebDBInjector.addPage(WebDBInjector.java:437)
        at
org.apache.nutch.db.WebDBInjector.injectURLFile(WebDBInjector.java:378)
        at org.apache.nutch.db.WebDBInjector.main(WebDBInjector.java:535)
        at org.apache.nutch.tools.CrawlTool.main(CrawlTool.java:134)
Caused by: java.lang.RuntimeException:
org.apache.nutch.plugin.PluginRuntimeException: extension point:
org.apache.nutch.indexer.IndexingFilter does not exist.
        at
org.apache.nutch.plugin.PluginRepository.getInstance(PluginRepository.java:1
47)
        at org.apache.nutch.net.URLFilters.<clinit>(URLFilters.java:40)
        ... 4 more
Caused by: org.apache.nutch.plugin.PluginRuntimeException: extension point:
org.apache.nutch.indexer.IndexingFilter does not exist.
        at
org.apache.nutch.plugin.PluginRepository.installExtensions(PluginRepository.
java:78)
        at
org.apache.nutch.plugin.PluginRepository.<init>(PluginRepository.java:61)
        at
org.apache.nutch.plugin.PluginRepository.getInstance(PluginRepository.java:1
44)
        ... 5 more
Exception in thread "main" 




-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_idv37&alloc_id865&op=click
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to