Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change 
notification.

The "IndexReplace" page has been changed by PeterCiuffetti:
https://wiki.apache.org/nutch/IndexReplace?action=diff&rev1=1&rev2=2

  = Index Replace =
  
- The '''index-replace''' plugin is an indexing filter that allows regexp 
replace manipulation of metadata fields.  The use cases would include adjusting 
the Nutch document field set and structure to conform to a field set used by a 
target core that was different than the default fieldset used by Nutch.  With 
this plugin you can modify the structure of existing fields and copy modified 
fields into a new fields.  It allows these replacements to be done globally for 
all parsed pages and for modifications to be done only for certain host or URL 
patterns.
+ The '''index-replace''' plugin is an indexing filter that allows regexp 
replace manipulation of metadata fields.  The use cases would include adjusting 
the Nutch document field set and structure to conform to a field set used by a 
target core that was different than the default fieldset used by Nutch.  With 
this plugin you can modify the structure of existing fields and copy modified 
fields into new fields.  It allows these replacements to be done globally for 
all parsed pages and for modifications to be done only for certain host or URL 
patterns.
  
  Related plugins include 
[[https://issues.apache.org/jira/browse/NUTCH-940|index-static]] which allows 
you to add one or more fields with static values.  Also the `indexer-solr` 
plugin has a config file `solrindex-mapping.xml` which allows you to rename and 
copy fields.  The '''index-replace''' plugin allows you to make modifications 
to the fields.
  

Reply via email to