Do not index empty values for title field
-----------------------------------------
Key: NUTCH-1004
URL: https://issues.apache.org/jira/browse/NUTCH-1004
Project: Nutch
Issue Type: Bug
Components: indexer
Affects Versions: 1.3, 2.0
Reporter: Markus Jelsma
Fix For: 2.0
Tika can generate multiple values for the title field for some files such as
certain PDF's and index-basic happily adds an empty value first and then the
title value. We should add a check on this to prevent empty values for the
title field.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira