Do not index empty values for title field
-----------------------------------------

                 Key: NUTCH-1004
                 URL: https://issues.apache.org/jira/browse/NUTCH-1004
             Project: Nutch
          Issue Type: Bug
          Components: indexer
    Affects Versions: 1.3, 2.0
            Reporter: Markus Jelsma
             Fix For: 2.0


Tika can generate multiple values for the title field for some files such as 
certain PDF's and index-basic happily adds an empty value first and then the 
title value. We should add a check on this to prevent empty values for the 
title field.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to