[ https://issues.apache.org/jira/browse/HIVE-5502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13790703#comment-13790703 ]
Brock Noland commented on HIVE-5502: ------------------------------------ Hey [~prasanth_j] this is generated by running the test twice in a row in the exact same environment. Meaning I executed these exact commands in succession: {noformat} $ rm -rf /tmp/fail; mkdir /tmp/fail; while mvn -o clean test -Dtest=TestFileDump; do ls -l target/tmp/TestFileDump.testDump.orc;done; cp -R target/* /tmp/fail $ rm -rf /tmp/pass; mkdir /tmp/pass; while ! mvn -o clean test -Dtest=TestFileDump; do ls -l target/tmp/TestFileDump.testDump.orc;done; cp -R target/* /tmp/pass {noformat} to generate my data and and the failures. Note however the above commands won't work on the maven branch since I am working on a patch to fix the branch post vectorization. > ORC TestFileDump is flaky > ------------------------- > > Key: HIVE-5502 > URL: https://issues.apache.org/jira/browse/HIVE-5502 > Project: Hive > Issue Type: Bug > Reporter: Brock Noland > Priority: Minor > Attachments: TestFileDump.tar.gz > > > I found in my maven work that TestFileDump is non-deterministic. For example > sometimes the output ORC file is much larger > {noformat} > pass: > -rwxrwxrwx 1 brock brock 290055 Oct 9 12:02 TestFileDump.testDump.orc > fail: > -rwxrwxrwx 1 brock brock 1938634 Oct 9 12:08 TestFileDump.testDump.orc > {noformat} -- This message was sent by Atlassian JIRA (v6.1#6144)