Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change 
notification.

The "IndexMetatags" page has been changed by JulienNioche:
http://wiki.apache.org/nutch/IndexMetatags?action=diff&rev1=2&rev2=3

  
  == Plugin Configuration ==
   1. In the file `conf/nutch-site.xml`, edit the property `plugin.includes` to 
contain the following plugins: `parse-metatags` and index`-metadata` so it 
looks like for example:
- 
   {{{
  <property>
  <name>plugin.includes</name>
@@ -16, +15 @@

  </property>
  }}}
   1. In the file `conf/nutch-site.xml`, specify which metatags should be 
indexed. Either specify specific metatags you want to index, or you can index 
all metatags. To index all, provide a '*' for the value of the property 
"metatags.names", otherwise provide the list of names separated by ';'. For 
example, to only index the metatag 'role', add the following configuration to 
conf/nutch-site.xml:
- 
   {{{
  <!-- Used only if plugin parse-metatags is enabled. -->
  <property>
@@ -31, +29 @@

  </property>
  }}}
   1. In the same file you need to configure the  index`-metadata `plugin. The 
values are stored in the parse metadata so we need to specify :
- 
   {{{
  <property>
    <name>index.parse.md</name>
@@ -44, +41 @@

  </property>
  }}}
   '''CAUTION : '''the names of the fields must be prefixed with 'metatags.'
-  1. You can test that the fields are generated correctly by using the 
IndexingFiltersChecker
+  1. You can test that the fields are generated correctly by using the 
[[bin/nutch indexchecker]] command
   1. In order to have the specified metatags indexed by Solr, edit your Solr 
`schema.xml` (located in `$SOLR_HOME$/conf`) and include new fields for each 
metatag you want to indexed. For example for the field 'role', add the 
following lines:
- 
   {{{
  ...
  <fields>

Reply via email to