[ 
https://issues.apache.org/jira/browse/TIKA-1529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14289429#comment-14289429
 ] 

Konstantin Gribov commented on TIKA-1529:
-----------------------------------------

[~talli...@mitre.org], it works with {{ISO-8859-1}} since only {{<html>}} tags 
presence is checked. It should also work with any utf-8 and single-byte 
encodings, so, I think, it's safe to decode with this encoding.

In openjdk8 {{new String(bytes)}} tries:
- to decode using default charset ({{Charset.defaultCharset().name()}}),
- if it fails print a warning and decode using {{ISO-8859-1}}.

We may use such pattern in {{ChmDirectoryListingSet}}.

> Turn forbidden-apis back on
> ---------------------------
>
>                 Key: TIKA-1529
>                 URL: https://issues.apache.org/jira/browse/TIKA-1529
>             Project: Tika
>          Issue Type: Bug
>            Reporter: Tim Allison
>            Priority: Minor
>
> [~thetaphi] recently noticed that forbidden-apis was turned off in r1624185, 
> and he submitted a patch to the dev list.  Let's turn it back on.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to