Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The "IndexReplace" page has been changed by PeterCiuffetti: https://wiki.apache.org/nutch/IndexReplace?action=diff&rev1=1&rev2=2 = Index Replace = - The '''index-replace''' plugin is an indexing filter that allows regexp replace manipulation of metadata fields. The use cases would include adjusting the Nutch document field set and structure to conform to a field set used by a target core that was different than the default fieldset used by Nutch. With this plugin you can modify the structure of existing fields and copy modified fields into a new fields. It allows these replacements to be done globally for all parsed pages and for modifications to be done only for certain host or URL patterns. + The '''index-replace''' plugin is an indexing filter that allows regexp replace manipulation of metadata fields. The use cases would include adjusting the Nutch document field set and structure to conform to a field set used by a target core that was different than the default fieldset used by Nutch. With this plugin you can modify the structure of existing fields and copy modified fields into new fields. It allows these replacements to be done globally for all parsed pages and for modifications to be done only for certain host or URL patterns. Related plugins include [[https://issues.apache.org/jira/browse/NUTCH-940|index-static]] which allows you to add one or more fields with static values. Also the `indexer-solr` plugin has a config file `solrindex-mapping.xml` which allows you to rename and copy fields. The '''index-replace''' plugin allows you to make modifications to the fields.

