Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change 
notification.

The "IndexStructure" page has been changed by LewisJohnMcgibbney:
http://wiki.apache.org/nutch/IndexStructure?action=diff&rev1=15&rev2=16

  ||    subType         ||      NO      ||      Indexed, Un-Tokenized   ||      
index-more      ||      subType (mime-type) ||
  ||      tld             ||     YES      || Un-Tokenized / NotStored(based on 
conf) || tld || Adds a '''top level domain''' field to the document.  ||
  ||      subcollection   ||    YES || Tokenized || subcollection || For 
Comprehensive description see 
src/java/org/apache/nutch/collection/'''package.html'''   ||
- 
+ ||    urlmeta ||      NO      ||      Indexed, Un-Tokenized   ||      urlmeta 
        || Adds any specified '''url metadata tags" to the document in the 
index.||
  ----
  Jira Issues about indexing and IndexingFilterPlugins are 
  
   * [[http://issues.apache.org/jira/browse/NUTCH-422|index-extra plugin]]
+  * [[https://issues.apache.org/jira/browse/NUTCH-940|index-static plugin]]
  
  ----
  
  The index plugins to include are : 
  
-  index-(anchor | basic | more ) | tld | subcollection | creativecommons | 
language-identifier
+  index-(anchor | basic | more | static ) | tld | subcollection | 
creativecommons | language-identifier | urlmeta
  

Reply via email to