PAX header parser fails for non-ASCII values
--------------------------------------------
Key: COMPRESS-184
URL: https://issues.apache.org/jira/browse/COMPRESS-184
Project: Commons Compress
Issue Type: Bug
Components: Archivers
Affects Versions: 1.3
Reporter: Stefan Bodewig
Assignee: Stefan Bodewig
Fix For: 1.4
The current logic parsing PAX extension headers fails if the number of bytes
used to encode an entry is different from the number of characters - i.e. for
any character outside of the ASCII range as the headers are UTF-8 encoded. E.g.
{noformat}
11 path=รค
{noformat}
takes 11 bytes (one has to account for the trailing newline) for 10 characters
and the parser fails with "Expected 3 chars, read 2"
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira