Marcel Kornacker has submitted this change and it was merged. Change subject: IMPALA-4223: Handle truncated file read from HDFS cache ......................................................................
IMPALA-4223: Handle truncated file read from HDFS cache While overwriting files on HDFS via Hive it can happen that Impala sees a partially written, cached file. In these cases we did not correctly handle the partial cached read. This change adds a check and triggers a fall back to disk reads for such errors. If the file is partially written to disk, too, then the query will report a file corruption warning through the disk read path. Change-Id: Id1e1fdb0211819c5938956abb13b512350a46f1a Reviewed-on: http://gerrit.cloudera.org:8080/4828 Reviewed-by: Dan Hecht <[email protected]> Reviewed-by: Tim Armstrong <[email protected]> Tested-by: Marcel Kornacker <[email protected]> --- M be/src/runtime/disk-io-mgr-scan-range.cc 1 file changed, 13 insertions(+), 5 deletions(-) Approvals: Marcel Kornacker: Verified Tim Armstrong: Looks good to me, but someone else must approve Dan Hecht: Looks good to me, approved -- To view, visit http://gerrit.cloudera.org:8080/4828 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-MessageType: merged Gerrit-Change-Id: Id1e1fdb0211819c5938956abb13b512350a46f1a Gerrit-PatchSet: 2 Gerrit-Project: Impala-ASF Gerrit-Branch: master Gerrit-Owner: Lars Volker <[email protected]> Gerrit-Reviewer: Dan Hecht <[email protected]> Gerrit-Reviewer: Lars Volker <[email protected]> Gerrit-Reviewer: Marcel Kornacker <[email protected]> Gerrit-Reviewer: Tim Armstrong <[email protected]>
