[ 
https://issues.apache.org/jira/browse/HIVE-27365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17725257#comment-17725257
 ] 

Krisztian Kasa commented on HIVE-27365:
---------------------------------------

[~zabetak]
Test failed in our internal branch.
I can identify two non deterministic parts:
 * Table is partitioned and may have more than one partition the order of 
directories representing the partitions is non deterministic
[https://github.com/apache/hive/blob/4f1dba31b25fb2c3b071492ff06b54f76a55c99a/ql/src/java/org/apache/hadoop/hive/ql/hooks/PostExecOrcFileDump.java#LL92C12-L92C12]
 * Files and subdirectories contained by a directory but this one is addressed 
by HIVE-23220
[https://github.com/apache/hive/blob/4f1dba31b25fb2c3b071492ff06b54f76a55c99a/ql/src/java/org/apache/hadoop/hive/ql/hooks/PostExecOrcFileDump.java#L103-L104]

> Fix test acid_bloom_filter_orc_file_dump
> ----------------------------------------
>
>                 Key: HIVE-27365
>                 URL: https://issues.apache.org/jira/browse/HIVE-27365
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Kokila N
>            Assignee: Krisztian Kasa
>            Priority: Minor
>              Labels: pull-request-available
>
> This test dumps orc file data and the table in the test has 2 delta 
> directories with one orc file in each
> Hive has a posthook which scans all the directories of a table and dumps all 
> orc files to the output but the order of the directory list is not 
> deterministic.
> https://github.com/apache/hive/blob/master/ql/src/java/org/apache/hadoop/hive/ql/hooks/PostExecOrcFileDump.java



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to