Jason Dere created MAPREDUCE-5853:
-------------------------------------

             Summary: ChecksumFileSystem.getContentSummary() including contents 
for crc files 
                 Key: MAPREDUCE-5853
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5853
             Project: Hadoop Map/Reduce
          Issue Type: Bug
            Reporter: Jason Dere


Trying to track down some differences in Hive statistics between 
hadoop-1/hadoop-2.  It looks like although ChecksumFileSystem.listStatus() 
filters out CRC files, getContentSummary() falls back to using the 
FilterFileSystem.getContentSummary() implementation, which calls 
fs.getContentSummary().  The underlying fs may not have the same filters as the 
ChecksumFileSystem and so the CRC files can get included in the content summary.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to