[ 
https://issues.apache.org/jira/browse/HIVE-5502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13790703#comment-13790703
 ] 

Brock Noland commented on HIVE-5502:
------------------------------------

Hey [~prasanth_j] this is generated by running the test twice in a row in the 
exact same environment. Meaning I executed these exact commands in succession:

{noformat}
$ rm -rf /tmp/fail; mkdir /tmp/fail; while mvn -o clean test 
-Dtest=TestFileDump; do ls -l target/tmp/TestFileDump.testDump.orc;done; cp -R 
target/* /tmp/fail
$ rm -rf /tmp/pass; mkdir /tmp/pass; while ! mvn -o clean test 
-Dtest=TestFileDump; do ls -l target/tmp/TestFileDump.testDump.orc;done; cp -R 
target/* /tmp/pass
{noformat}

to generate my data and and the failures. Note however the above commands won't 
work on the maven branch since I am working on a patch to fix the branch post 
vectorization.

> ORC TestFileDump is flaky
> -------------------------
>
>                 Key: HIVE-5502
>                 URL: https://issues.apache.org/jira/browse/HIVE-5502
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Brock Noland
>            Priority: Minor
>         Attachments: TestFileDump.tar.gz
>
>
> I found in my maven work that TestFileDump is non-deterministic. For example 
> sometimes the output ORC file is much larger
> {noformat}
> pass:
> -rwxrwxrwx 1 brock brock 290055 Oct  9 12:02 TestFileDump.testDump.orc
> fail:
> -rwxrwxrwx 1 brock brock 1938634 Oct  9 12:08 TestFileDump.testDump.orc
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to