[
https://issues.apache.org/jira/browse/MAPREDUCE-6643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15326400#comment-15326400
]
Kai Sasaki commented on MAPREDUCE-6643:
---------------------------------------
I found {{BZip2Codec}} always back 10bytes to search compression marker even
markers except for first marker has 6bytes. It can let
{{BZip2CompressionInputStream}} find same marker twice because the search start
points is back more than marker size. I fixed compression stream only back
6bytes other than the file start point.
> org.apache.hadoop.mapred.TestTextInputFormat.testSplitableCodecs failed
> -----------------------------------------------------------------------
>
> Key: MAPREDUCE-6643
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-6643
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Reporter: Haibo Chen
> Assignee: Kai Sasaki
> Attachments: MAPREDUCE-6643.01.patch, MAPREDUCE-6643.02.patch,
> MAPREDUCE-6643.03.patch
>
>
> Unit test TestTextInputFormat.testSplitableCodecs() failed when the seed is
> 1313094493.
> Stacktrace
> java.lang.AssertionError: Key in multiple partitions.
> at org.junit.Assert.fail(Assert.java:88)
> at org.junit.Assert.assertTrue(Assert.java:41)
> at org.junit.Assert.assertFalse(Assert.java:64)
> at
> org.apache.hadoop.mapred.TestTextInputFormat.testSplitableCodecs(TestTextInputFormat.java:223)
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]