Jason Dere created MAPREDUCE-5853:
-------------------------------------
Summary: ChecksumFileSystem.getContentSummary() including contents
for crc files
Key: MAPREDUCE-5853
URL: https://issues.apache.org/jira/browse/MAPREDUCE-5853
Project: Hadoop Map/Reduce
Issue Type: Bug
Reporter: Jason Dere
Trying to track down some differences in Hive statistics between
hadoop-1/hadoop-2. It looks like although ChecksumFileSystem.listStatus()
filters out CRC files, getContentSummary() falls back to using the
FilterFileSystem.getContentSummary() implementation, which calls
fs.getContentSummary(). The underlying fs may not have the same filters as the
ChecksumFileSystem and so the CRC files can get included in the content summary.
--
This message was sent by Atlassian JIRA
(v6.2#6252)