[
https://issues.apache.org/jira/browse/COMPRESS-471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16691230#comment-16691230
]
Gaurav Mittal commented on COMPRESS-471:
----------------------------------------
We are concerned about cases where we do not need to apply CP850 encoding but
still we are applying and then files names can contain undesirable characters.
query:
Is there any way we can know that some particular zip file contains non-UTF8
characters in file name and if yes, we can handle at application level.
Currently I do not see any method that can tell me about non-UTF8 characters
and hence we are not able to decide when to apply UTF8 and when to apply other
character encoding.
Is it possible to fix it from library side?
Thanks
> Zipped files names having non UTF-8 encoding are being replaced with '?'
> while previewing file.
> -----------------------------------------------------------------------------------------------
>
> Key: COMPRESS-471
> URL: https://issues.apache.org/jira/browse/COMPRESS-471
> Project: Commons Compress
> Issue Type: Bug
> Affects Versions: 1.18
> Reporter: Gaurav Mittal
> Priority: Major
> Attachments: Document(▒Γ║╗)_20150226_11.zip, Incorrect.JPG,
> correct.JPG
>
>
> | * All the strings which are not supported by UTF-8 are being replaced by
> '?' symbol,
> In the issue scenario the charset is 'Cp850', Since the common compress
> library cannot identify the 'Cp850' charset and it takes the default charset
> as 'UTF-8' therefore
> we can see the '?' symbol
> In our code
> ZipFile ret = new ZipFile(path);
> Moreover if we send the encoding in the function as defined below, it works
> fine
> ZipFile ret = new ZipFile(new File(path), "Cp850",false);
> But the second scenario where we are forcibly giving the encoding as 'Cp850'
> may cause side effects in some cases
> --------------------------------------------------------------------------
> Below code does not seem to resolve UTF8 conflicts and could not make file
> names into correct form -
>
> try {
> final Map<ZipArchiveEntry, NameAndComment> entriesWithoutUTF8Flag =
> populateFromCentralDirectory();
> resolveLocalFileHeaderData(entriesWithoutUTF8Flag);
> success = true;
> } finally {
> closed = !success;
> if (!success && closeOnError) {
> IOUtils.closeQuietly(archive);
> }
> }|
> | |
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)