[ 
https://issues.apache.org/jira/browse/HIVE-6979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13981809#comment-13981809
 ] 

Prasanth J commented on HIVE-6979:
----------------------------------

[~ashutoshc]/[~jdere] Can anyone take a look at the test failures fix?

> Hadoop-2 test failures related to quick stats not being populated correctly
> ---------------------------------------------------------------------------
>
>                 Key: HIVE-6979
>                 URL: https://issues.apache.org/jira/browse/HIVE-6979
>             Project: Hive
>          Issue Type: Bug
>    Affects Versions: 0.14.0
>            Reporter: Prasanth J
>            Assignee: Prasanth J
>         Attachments: HIVE-6979.1.patch
>
>
> The test failures that are currently reported by Hive QA running on hadoop-2 
> (https://issues.apache.org/jira/browse/HIVE-6968?focusedCommentId=13980570&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13980570)
>  are related to difference in the way hadoop FileSystem.globStatus() api 
> behaves. For a directory structure like below
> {code}
> dir1/file1
> dir1/file2
> {code}
> Two level of path pattern like dir1/*/* will return both files in hadoop 1.x 
> but will return empty result in hadoop 2.x (in fact it will say no such file 
> or directory and return empty file status array). Hadoop 2.x seems to be 
> compliant to linux behaviour (ls dir1/*/*) but hadoop 1.x is not.
> As a result of this, the fast statistics (NUM_FILES and TOTAL_SIZE) are 
> populated wrongly causing diffs in qfile tests for hadoop-1 and hadoop-2.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to