[
https://issues.apache.org/jira/browse/HIVE-27365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17725257#comment-17725257
]
Krisztian Kasa commented on HIVE-27365:
---------------------------------------
[~zabetak]
Test failed in our internal branch.
I can identify two non deterministic parts:
* Table is partitioned and may have more than one partition the order of
directories representing the partitions is non deterministic
[https://github.com/apache/hive/blob/4f1dba31b25fb2c3b071492ff06b54f76a55c99a/ql/src/java/org/apache/hadoop/hive/ql/hooks/PostExecOrcFileDump.java#LL92C12-L92C12]
* Files and subdirectories contained by a directory but this one is addressed
by HIVE-23220
[https://github.com/apache/hive/blob/4f1dba31b25fb2c3b071492ff06b54f76a55c99a/ql/src/java/org/apache/hadoop/hive/ql/hooks/PostExecOrcFileDump.java#L103-L104]
> Fix test acid_bloom_filter_orc_file_dump
> ----------------------------------------
>
> Key: HIVE-27365
> URL: https://issues.apache.org/jira/browse/HIVE-27365
> Project: Hive
> Issue Type: Bug
> Reporter: Kokila N
> Assignee: Krisztian Kasa
> Priority: Minor
> Labels: pull-request-available
>
> This test dumps orc file data and the table in the test has 2 delta
> directories with one orc file in each
> Hive has a posthook which scans all the directories of a table and dumps all
> orc files to the output but the order of the directory list is not
> deterministic.
> https://github.com/apache/hive/blob/master/ql/src/java/org/apache/hadoop/hive/ql/hooks/PostExecOrcFileDump.java
--
This message was sent by Atlassian Jira
(v8.20.10#820010)