Nicolas, you could find help if you were more specific. Which plugin point did you extend and which version of nutch/hadoop are you using? conf/log4j.properties file determines the logging factor. Maybe setting logging level to INFO or DEBUG may help.

Oh, sorry =). I was trying to index some extra field I calculate from the URLs. So I followed the instructions at http://wiki.apache.org/nutch/CreateNewFilter .

I've created two classes: one implementing IndexingFilter and another extending RawFieldQueryFilter. And this is the plugin.xml file:

<?xml version="1.0" encoding="UTF-8"?>

<plugin
  id="my-plugin"
  name="My Plug-in"
  version="1.0.0"
  provider-name="com.pierpoint">

  <runtime>
     <library name="my-plugin.jar">
        <export name="*"/>
     </library>
  </runtime>

  <requires>
     <import plugin="nutch-extensionpoints"/>
  </requires>

  <extension id="com.pierpoint.datahunter.nutch.analyzer"
             name="DataHunter Filter"
             point="org.apache.nutch.indexer.IndexingFilter">

<implementation id="com.pierpoint.datahunter.nutch.DataHunterIndexingFilter" class="com.pierpoint.datahunter.nutch.DataHunterIndexingFilter">
     </implementation>
  </extension>

  <extension id="com.pierpoint.datahunter.nutch.filter"
             name="Zip3 Query Filter"
             point="org.apache.nutch.searcher.site.SiteQueryFilter">

     <implementation id="com.pierpoint.datahunter.nutch.Zip3QueryFilter"
class="com.pierpoint.datahunter.nutch.Zip3QueryFilter">
     </implementation>

  </extension>
</plugin>

Nutch version is the latest: 0.8.1.

Thanks!

Reply via email to