Sandy Ryza created MAPREDUCE-5049:
-------------------------------------
Summary: CombineFileInputFormat counts all compressed files
non-splitable
Key: MAPREDUCE-5049
URL: https://issues.apache.org/jira/browse/MAPREDUCE-5049
Project: Hadoop Map/Reduce
Issue Type: Bug
Affects Versions: 1.1.1
Reporter: Sandy Ryza
Assignee: Sandy Ryza
In branch-1, CombineFileInputFormat doesn't take SplittableCompressionCodec
into account and thinks that all compressible input files aren't splittable.
This is a regression from when handling for non-splitable compression codecs
was originally added in MAPREDUCE-1597, and seems to have somehow gotten in
when the code was pulled from 0.22 to branch-1.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira