[ 
https://issues.apache.org/jira/browse/TIKA-319?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jukka Zitting resolved TIKA-319.
--------------------------------

       Resolution: Fixed
    Fix Version/s: 0.5
         Assignee: Jukka Zitting

Good point! Fixed as suggested in revision 835726.

> HtmlParser - use encoding hint only if charset is supported
> -----------------------------------------------------------
>
>                 Key: TIKA-319
>                 URL: https://issues.apache.org/jira/browse/TIKA-319
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 0.4
>            Reporter: Piotr B.
>            Assignee: Jukka Zitting
>             Fix For: 0.5
>
>
> Encoding hint should be considered only if that encoding is supported.
> Diff of my fix:
> --- HtmlParser.java   (wersja 835302)
> +++ HtmlParser.java   (kopia robocza)
> @@ -46,7 +46,7 @@
>          // Prepare the input source using the encoding hint if available
>          InputSource source = new InputSource(stream); 
>          String encoding = metadata.get(Metadata.CONTENT_ENCODING); 
> -        if (encoding != null) { 
> +        if (encoding != null && Charset.isSupported(encoding)) { 
>              source.setEncoding(encoding);
>          }

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to