Hi,
the extension point plugin need to be included in the includes also.
Please note that nutc-site do not extend parameters but overwrite it and it is not a good idea to have just the parser plugins installed, at least you need one protocol plugin, a query and a index filter also.

Stefan
Am 10.03.2006 um 12:51 schrieb Peter Swoboda:

I tried to include two more plugins.
changed the nutch-site.xml to

<?xml version="1.0"?>
<?xml-stylesheet type="text/xsl" href="nutch-conf.xsl"?>

<!-- Put site-specific property overrides in this file. -->

<nutch-conf>

<property>
  <name>plugin.includes</name>
  <value>parse-(js)|analysis-(de)</value>
  <description>Regular expression naming plugin directory names to
  include.  Any plugin not matching this expression is excluded.
In any case you need at least include the nutch-extensionpoints plugin. By
  default Nutch includes crawling just HTML and plain text via HTTP,
  and basic indexing and search plugins.
  </description>
</property>


</nutch-conf>

starting crawling gets following errormesage:

060310 122551 SEVERE org.apache.nutch.plugin.PluginRuntimeException:
extension point: org.apache.nutch.parse.Parser does not exist.
Exception in thread "main" java.lang.ExceptionInInitializerError
        at org.apache.nutch.db.WebDBInjector.addPage(WebDBInjector.java:437)
at org.apache.nutch.db.WebDBInjector.injectURLFile (WebDBInjector.java:378)
        at org.apache.nutch.db.WebDBInjector.main(WebDBInjector.java:535)
        at nutch.Test.main(Test.java:128)
Caused by: java.lang.RuntimeException:
org.apache.nutch.plugin.PluginRuntimeException: extension point:
org.apache.nutch.parse.Parser does not exist.
        at
org.apache.nutch.plugin.PluginRepository.getInstance (PluginRepository.java:147)
        at org.apache.nutch.net.URLFilters.<clinit>(URLFilters.java:40)
        ... 4 more
Caused by: org.apache.nutch.plugin.PluginRuntimeException: extension point:
org.apache.nutch.parse.Parser does not exist.
        at
org.apache.nutch.plugin.PluginRepository.installExtensions (PluginRepository.java:78)
        at
org.apache.nutch.plugin.PluginRepository.<init> (PluginRepository.java:61)
        at
org.apache.nutch.plugin.PluginRepository.getInstance (PluginRepository.java:144)
        ... 5 more


nutch.default.xml is still unchanged.
What to do?


greetings
Peter

--
"Feel free" mit GMX FreeMail!
Monat für Monat 10 FreeSMS inklusive! http://www.gmx.net


---------------------------------------------------------------
company:        http://www.media-style.com
forum:        http://www.text-mining.org
blog:            http://www.find23.net


Reply via email to