Joe McDonnell has submitted this change and it was merged. ( http://gerrit.cloudera.org:8080/16292 )
Change subject: IMPALA-9923: Load ORC serially to hack around flakiness ...................................................................... IMPALA-9923: Load ORC serially to hack around flakiness ORC dataload has been intermittently failing with "Fail to get checksum, since file .../_orc_acid_version is under construction." This is due to some Hive/HDFS interaction that seems to get worse with parallelism. This has been hitting a lot of developer tests. As a temporary workaround, this changes dataload to load ORC serially. This is slightly slower, but it should be more reliable. Testing: - Ran precommit tests, manually verified dataload logs Change-Id: I15eff1ec6cab32c1216ed7400e4c4b57bb81e4cd Reviewed-on: http://gerrit.cloudera.org:8080/16292 Reviewed-by: Tim Armstrong <[email protected]> Tested-by: Impala Public Jenkins <[email protected]> --- M bin/load-data.py 1 file changed, 11 insertions(+), 0 deletions(-) Approvals: Tim Armstrong: Looks good to me, approved Impala Public Jenkins: Verified -- To view, visit http://gerrit.cloudera.org:8080/16292 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-Project: Impala-ASF Gerrit-Branch: master Gerrit-MessageType: merged Gerrit-Change-Id: I15eff1ec6cab32c1216ed7400e4c4b57bb81e4cd Gerrit-Change-Number: 16292 Gerrit-PatchSet: 3 Gerrit-Owner: Joe McDonnell <[email protected]> Gerrit-Reviewer: Impala Public Jenkins <[email protected]> Gerrit-Reviewer: Joe McDonnell <[email protected]> Gerrit-Reviewer: Tim Armstrong <[email protected]>
