[
https://issues.apache.org/jira/browse/HADOOP-6925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Todd Lipcon updated HADOOP-6925:
--------------------------------
Attachment: hadoop-6925.txt
Patch fixes the read() implementation to correctly mask with 0xff before
upcasting to int. Also augments the unit test to check the single-byte read()
function - the new test fails before this patch.
> BZip2Codec incorrectly implements read()
> ----------------------------------------
>
> Key: HADOOP-6925
> URL: https://issues.apache.org/jira/browse/HADOOP-6925
> Project: Hadoop Common
> Issue Type: Bug
> Components: io
> Affects Versions: 0.21.0, 0.22.0
> Reporter: Todd Lipcon
> Assignee: Todd Lipcon
> Priority: Critical
> Attachments: hadoop-6925.txt
>
>
> HADOOP-4012 added an implementation of read() in BZip2InputStream that
> doesn't work correctly when reading bytes > 0x80. This causes EOFExceptions
> when working with BZip2 compressed data inside of sequence files in some
> datasets.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.