Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The "IndexMetatags" page has been changed by JulienNioche: http://wiki.apache.org/nutch/IndexMetatags?action=diff&rev1=2&rev2=3 == Plugin Configuration == 1. In the file `conf/nutch-site.xml`, edit the property `plugin.includes` to contain the following plugins: `parse-metatags` and index`-metadata` so it looks like for example: - {{{ <property> <name>plugin.includes</name> @@ -16, +15 @@ </property> }}} 1. In the file `conf/nutch-site.xml`, specify which metatags should be indexed. Either specify specific metatags you want to index, or you can index all metatags. To index all, provide a '*' for the value of the property "metatags.names", otherwise provide the list of names separated by ';'. For example, to only index the metatag 'role', add the following configuration to conf/nutch-site.xml: - {{{ <!-- Used only if plugin parse-metatags is enabled. --> <property> @@ -31, +29 @@ </property> }}} 1. In the same file you need to configure the index`-metadata `plugin. The values are stored in the parse metadata so we need to specify : - {{{ <property> <name>index.parse.md</name> @@ -44, +41 @@ </property> }}} '''CAUTION : '''the names of the fields must be prefixed with 'metatags.' - 1. You can test that the fields are generated correctly by using the IndexingFiltersChecker + 1. You can test that the fields are generated correctly by using the [[bin/nutch indexchecker]] command 1. In order to have the specified metatags indexed by Solr, edit your Solr `schema.xml` (located in `$SOLR_HOME$/conf`) and include new fields for each metatag you want to indexed. For example for the field 'role', add the following lines: - {{{ ... <fields>

