As you can see, I have exactly the same problem as Stephen Fittich in the
preceding thread. Congratulations, you have been 18 minutes faster, Stephen!

-----Ursprüngliche Nachricht-----
Von: Alfred Ostermeier [mailto:[EMAIL PROTECTED] 
Gesendet: Sonntag, 18. Dezember 2005 00:40
An: [email protected]
Betreff: PluginRuntimeException: org.apache.nutch.indexer.IndexingFilter
does not exist


Hello,

I have just installed nutch 0.7.1. I'm running it on Win XP and cygwin.
Crawling an http-URL worked well. But: Crawling an file-URL failed. I did
configure nutch exactly as described in
http://wiki.apache.org/nutch/FAQ#head-c721b23b43b15885f5ea7d8da62c1c40a37878
e6. That means, I activated the "protocol-file" plugin. Below is the content
of the log-file with the errors. 

The only Google hit for "IndexingFilter does not exist" (
http://www.mail-archive.com/[email protected]/msg00878.htmlsuspect
ed ) suspected the CLASSPATH among other things. To which folders or jar
file(s) has the CLASSPATH to be set - if yes? Currently mine is set to the
current directory. I unfortunately couldn't find the jar-file with the class
IndexingFilter.

Regards,
Alfred

----------------------------------------------------------------------------
-------------------------------------------------

run java in c:\j2sdk1.4.2_04\jre
051217 235048 parsing file:/C:/nutch-0.7.1/conf/nutch-default.xml
051217 235049 parsing file:/C:/nutch-0.7.1/conf/crawl-tool.xml
051217 235049 parsing file:/C:/nutch-0.7.1/conf/nutch-site.xml
051217 235049 No FS indicated, using default:local
051217 235049 crawl started in: crawl.test
051217 235049 rootUrlFile = urls
051217 235049 threads = 10
051217 235049 depth = 3
051217 235049 Created webdb at LocalFS,C:\nutch-0.7.1\crawl.test\db
051217 235049 Starting URL processing
051217 235049 Plugins: looking in: C:\nutch-0.7.1\plugins 051217 235049 not
including: C:\nutch-0.7.1\plugins\clustering-carrot2
051217 235049 not including: C:\nutch-0.7.1\plugins\creativecommons
051217 235049 parsing: C:\nutch-0.7.1\plugins\index-basic\plugin.xml
051217 235049 impl: point=org.apache.nutch.indexer.IndexingFilter
class=org.apache.nutch.indexer.basic.BasicIndexingFilter
051217 235049 not including: C:\nutch-0.7.1\plugins\index-more 051217 235049
not including: C:\nutch-0.7.1\plugins\language-identifier
051217 235049 not including: C:\nutch-0.7.1\plugins\nutch-extensionpoints
051217 235049 not including: C:\nutch-0.7.1\plugins\ontology 051217 235049
not including: C:\nutch-0.7.1\plugins\parse-ext 051217 235049 parsing:
C:\nutch-0.7.1\plugins\parse-html\plugin.xml
051217 235049 impl: point=org.apache.nutch.parse.Parser
class=org.apache.nutch.parse.html.HtmlParser
051217 235049 not including: C:\nutch-0.7.1\plugins\parse-js 051217 235049
not including: C:\nutch-0.7.1\plugins\parse-msword
051217 235049 not including: C:\nutch-0.7.1\plugins\parse-pdf 051217 235049
not including: C:\nutch-0.7.1\plugins\parse-rss 051217 235049 parsing:
C:\nutch-0.7.1\plugins\parse-text\plugin.xml
051217 235049 impl: point=org.apache.nutch.parse.Parser
class=org.apache.nutch.parse.text.TextParser
051217 235049 parsing: C:\nutch-0.7.1\plugins\protocol-file\plugin.xml
051217 235049 impl: point=org.apache.nutch.protocol.Protocol
class=org.apache.nutch.protocol.file.File
051217 235049 not including: C:\nutch-0.7.1\plugins\protocol-ftp
051217 235049 parsing: C:\nutch-0.7.1\plugins\protocol-http\plugin.xml
051217 235049 impl: point=org.apache.nutch.protocol.Protocol
class=org.apache.nutch.protocol.http.Http
051217 235049 not including: C:\nutch-0.7.1\plugins\protocol-httpclient
051217 235049 parsing: C:\nutch-0.7.1\plugins\query-basic\plugin.xml
051217 235049 impl: point=org.apache.nutch.searcher.QueryFilter
class=org.apache.nutch.searcher.basic.BasicQueryFilter
051217 235049 not including: C:\nutch-0.7.1\plugins\query-more 051217 235049
parsing: C:\nutch-0.7.1\plugins\query-site\plugin.xml
051217 235049 impl: point=org.apache.nutch.searcher.QueryFilter
class=org.apache.nutch.searcher.site.SiteQueryFilter
051217 235049 parsing: C:\nutch-0.7.1\plugins\query-url\plugin.xml
051217 235049 impl: point=org.apache.nutch.searcher.QueryFilter
class=org.apache.nutch.searcher.url.URLQueryFilter
051217 235049 not including: C:\nutch-0.7.1\plugins\urlfilter-prefix
051217 235049 not including: C:\nutch-0.7.1\plugins\urlfilter-regex
051217 235049 SEVERE org.apache.nutch.plugin.PluginRuntimeException:
extension point: org.apache.nutch.indexer.IndexingFilter does not exist.
java.lang.ExceptionInInitializerError
        at org.apache.nutch.db.WebDBInjector.addPage(WebDBInjector.java:437)
        at
org.apache.nutch.db.WebDBInjector.injectURLFile(WebDBInjector.java:378)
        at org.apache.nutch.db.WebDBInjector.main(WebDBInjector.java:535)
        at org.apache.nutch.tools.CrawlTool.main(CrawlTool.java:134)
Caused by: java.lang.RuntimeException:
org.apache.nutch.plugin.PluginRuntimeException: extension point:
org.apache.nutch.indexer.IndexingFilter does not exist.
        at
org.apache.nutch.plugin.PluginRepository.getInstance(PluginRepository.java:1
47)
        at org.apache.nutch.net.URLFilters.<clinit>(URLFilters.java:40)
        ... 4 more
Caused by: org.apache.nutch.plugin.PluginRuntimeException: extension point:
org.apache.nutch.indexer.IndexingFilter does not exist.
        at
org.apache.nutch.plugin.PluginRepository.installExtensions(PluginRepository.
java:78)
        at
org.apache.nutch.plugin.PluginRepository.<init>(PluginRepository.java:61)
        at
org.apache.nutch.plugin.PluginRepository.getInstance(PluginRepository.java:1
44)
        ... 5 more
Exception in thread "main" 


Reply via email to