Lewis John McGibbney created NUTCH-1917:
-------------------------------------------
Summary: index.parse.md, index.content.md and index.db.md should
support wildcard
Key: NUTCH-1917
URL: https://issues.apache.org/jira/browse/NUTCH-1917
Project: Nutch
Issue Type: Bug
Components: indexer
Affects Versions: 1.9
Reporter: Lewis John McGibbney
Fix For: 1.10
Right now metatags.names supports the '*' character for a catch all.
I believe that the above index properties should also support catch all as a
mechanism for quickly building augmented data models from crawl data.
Individual identification and manual inclusion of tags one by one is error
prone and time consuming.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)