Prasanth J created HIVE-6979:
--------------------------------

             Summary: Hadoop-2 test failures related to quick stats not being 
populated correctly
                 Key: HIVE-6979
                 URL: https://issues.apache.org/jira/browse/HIVE-6979
             Project: Hive
          Issue Type: Bug
    Affects Versions: 0.14.0
            Reporter: Prasanth J
            Assignee: Prasanth J


The test failures that are currently reported by Hive QA running on hadoop-2 
(https://issues.apache.org/jira/browse/HIVE-6968?focusedCommentId=13980570&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13980570)
 are related to difference in the way hadoop FileSystem.globStatus() api 
behaves. For a directory structure like below
{code}
dir1/file1
dir1/file2
{code}
Two level of path pattern like dir1/*/* will return both files in hadoop 1.x 
but will return empty result in hadoop 2.x (in fact it will say no such file or 
directory and return empty file status array). Hadoop 2.x seems to be compliant 
to linux behaviour (ls dir1/*/*) but hadoop 1.x is not.

As a result of this, the fast statistics (NUM_FILES and TOTAL_SIZE) are 
populated wrongly causing diffs in qfile tests for hadoop-1 and hadoop-2.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to