[ 
https://issues.apache.org/jira/browse/COMPRESS-620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17536633#comment-17536633
 ] 

Michael Osipov commented on COMPRESS-620:
-----------------------------------------

For the time being, supply the encoding explicitly to the constructor.

> ArchiveInputStream fails reading filenames with ANSI characters
> ---------------------------------------------------------------
>
>                 Key: COMPRESS-620
>                 URL: https://issues.apache.org/jira/browse/COMPRESS-620
>             Project: Commons Compress
>          Issue Type: Bug
>          Components: Archivers
>    Affects Versions: 1.21
>            Reporter: Avi
>            Priority: Major
>
> I attempted to extract ANSI art packs from [SixteenColors ANSI 
> archive|https://github.com/sixteencolors/sixteencolors-archive] but many of 
> them fail.
>  
> Upon some debugging it appears that as many of the file names contain ANSI 
> characters which are parsed by the ArchiveInputStream as question marks, the 
> file fails to be saved to disk as question mark is a bad character to be had 
> in a filename.
> Specific code:
> ArchiveInputStream archiveInputStream = 
> archiveStreamFactory.createArchiveInputStream(ArchiveStreamFactory.ZIP, 
> inputStream);
> ArchiveEntry archiveEntry = null;
> while((archiveEntry = archiveInputStream.getNextEntry()) != null) {
> Path path = Paths.get(extractDirectory, archiveEntry.getName());
> example of a non parseable filename in an archive:
> https://github.com/sixteencolors/sixteencolors-archive/blob/master/1992/ace-r%232.zip
> A∙C∙E.ANS
> Bad ZIP file example:



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

Reply via email to