Calls to Charset.isSupported() will throw exceptions for invalid charset names
------------------------------------------------------------------------------

                 Key: TIKA-359
                 URL: https://issues.apache.org/jira/browse/TIKA-359
             Project: Tika
          Issue Type: Bug
    Affects Versions: 0.5
            Reporter: Ken Krugler
            Assignee: Ken Krugler
             Fix For: 0.6


The HtmlParser and TXTParser code currently call Charset.isSupported() to 
determine if charset hint info (from meta tags or incoming metadata).

But this method throws IllegalCharsetNameException for unknown (versus 
unsupported) encoding names, which kills the parsing process.

What's needed is a wrapper that catches this exception and returns false.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to