Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The "IndexStructure" page has been changed by LewisJohnMcgibbney: http://wiki.apache.org/nutch/IndexStructure?action=diff&rev1=15&rev2=16 || subType || NO || Indexed, Un-Tokenized || index-more || subType (mime-type) || || tld || YES || Un-Tokenized / NotStored(based on conf) || tld || Adds a '''top level domain''' field to the document. || || subcollection || YES || Tokenized || subcollection || For Comprehensive description see src/java/org/apache/nutch/collection/'''package.html''' || - + || urlmeta || NO || Indexed, Un-Tokenized || urlmeta || Adds any specified '''url metadata tags" to the document in the index.|| ---- Jira Issues about indexing and IndexingFilterPlugins are * [[http://issues.apache.org/jira/browse/NUTCH-422|index-extra plugin]] + * [[https://issues.apache.org/jira/browse/NUTCH-940|index-static plugin]] ---- The index plugins to include are : - index-(anchor | basic | more ) | tld | subcollection | creativecommons | language-identifier + index-(anchor | basic | more | static ) | tld | subcollection | creativecommons | language-identifier | urlmeta

