Sergey Beryozkin created TIKA-1351:
--------------------------------------

             Summary: Parser implementations should accept null content handlers
                 Key: TIKA-1351
                 URL: https://issues.apache.org/jira/browse/TIKA-1351
             Project: Tika
          Issue Type: Improvement
          Components: parser
            Reporter: Sergey Beryozkin
            Priority: Minor


Applications which want to let users search documents based only on their 
metadata do not need to get the content parsed. 
The only workaround I've found so far is to pass a no op content handler which 
can ignore the content events but it does not stop the parser such as PDFParser 
from parsing the content.

Proposal: update parser API docs to let implementers know ContentHandler can be 
empty and update the shipped implementations to parse the metadata only if 
ContentHandler is null



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to