Tim Allison created TIKA-1786:
---------------------------------

             Summary: Downgrade logging severity in FileResourceConsumer and 
fix handling of illegal xml characters
                 Key: TIKA-1786
                 URL: https://issues.apache.org/jira/browse/TIKA-1786
             Project: Tika
          Issue Type: Improvement
          Components: tika-batch
            Reporter: Tim Allison
            Assignee: Tim Allison
            Priority: Trivial


FileResourceConsumer logs an xmlified snippet to record problems encountered 
during parsing.  If a parser includes illegal xml characters in the 
ParseException, this exception is caught by the xmlification code and then 
logged as an error.

The xmlification code should be robust against illegal characters and we should 
downgrade logging severity from error to warnings when there wasn't an actual 
error thrown by a parser.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to