Hi,
the extension point plugin need to be included in the includes also.
Please note that nutc-site do not extend parameters but overwrite it
and it is not a good idea to have just the parser plugins installed,
at least you need one protocol plugin, a query and a index filter also.
Stefan
Am 10.03.2006 um 12:51 schrieb Peter Swoboda:
I tried to include two more plugins.
changed the nutch-site.xml to
<?xml version="1.0"?>
<?xml-stylesheet type="text/xsl" href="nutch-conf.xsl"?>
<!-- Put site-specific property overrides in this file. -->
<nutch-conf>
<property>
<name>plugin.includes</name>
<value>parse-(js)|analysis-(de)</value>
<description>Regular expression naming plugin directory names to
include. Any plugin not matching this expression is excluded.
In any case you need at least include the nutch-extensionpoints
plugin. By
default Nutch includes crawling just HTML and plain text via HTTP,
and basic indexing and search plugins.
</description>
</property>
</nutch-conf>
starting crawling gets following errormesage:
060310 122551 SEVERE org.apache.nutch.plugin.PluginRuntimeException:
extension point: org.apache.nutch.parse.Parser does not exist.
Exception in thread "main" java.lang.ExceptionInInitializerError
at org.apache.nutch.db.WebDBInjector.addPage(WebDBInjector.java:437)
at org.apache.nutch.db.WebDBInjector.injectURLFile
(WebDBInjector.java:378)
at org.apache.nutch.db.WebDBInjector.main(WebDBInjector.java:535)
at nutch.Test.main(Test.java:128)
Caused by: java.lang.RuntimeException:
org.apache.nutch.plugin.PluginRuntimeException: extension point:
org.apache.nutch.parse.Parser does not exist.
at
org.apache.nutch.plugin.PluginRepository.getInstance
(PluginRepository.java:147)
at org.apache.nutch.net.URLFilters.<clinit>(URLFilters.java:40)
... 4 more
Caused by: org.apache.nutch.plugin.PluginRuntimeException:
extension point:
org.apache.nutch.parse.Parser does not exist.
at
org.apache.nutch.plugin.PluginRepository.installExtensions
(PluginRepository.java:78)
at
org.apache.nutch.plugin.PluginRepository.<init>
(PluginRepository.java:61)
at
org.apache.nutch.plugin.PluginRepository.getInstance
(PluginRepository.java:144)
... 5 more
nutch.default.xml is still unchanged.
What to do?
greetings
Peter
--
"Feel free" mit GMX FreeMail!
Monat für Monat 10 FreeSMS inklusive! http://www.gmx.net
---------------------------------------------------------------
company: http://www.media-style.com
forum: http://www.text-mining.org
blog: http://www.find23.net