Yongjun Zhang has uploaded a new patch set (#6). ( 
http://gerrit.cloudera.org:8080/11379 )

Change subject: IMPALA-6442: Misleading file offset reporting in error messages.
......................................................................

IMPALA-6442: Misleading file offset reporting in error messages.

The error message described in IMPALA-6442 incorrectly reported the file offset 
where the
Parquet footer starts, as if the offset is counted from the file end instead of 
from the
file beginning. The fix changed the reported file offset to be counted from the 
beginning
of the Parquet file.

Testing:
Create a small table that contains one row of data with a single column that's 
bigint and
store it as Parquet. Manually changed the footer size field to be
  1) smaller than the original footer size by 1, to trigger the error message 
fixed by
this jira to be printed, to verify that the fix functions correctly;
  2) bigger than the file size, thus to trigger another related error message 
to be
printed.

Change-Id: I35235e99ea9ceb0d31961dd3b8069f7194f5a2de
---
M be/src/exec/hdfs-parquet-scanner.cc
M testdata/data/README
A testdata/data/corrupt_footer_len_decr.parquet
A testdata/data/corrupt_footer_len_incr.parquet
A 
testdata/workloads/functional-query/queries/QueryTest/parquet-corrupt-footer-len-decr.test
A 
testdata/workloads/functional-query/queries/QueryTest/parquet-corrupt-footer-len-incr.test
M tests/query_test/test_scanners.py
7 files changed, 86 insertions(+), 18 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/79/11379/6
--
To view, visit http://gerrit.cloudera.org:8080/11379
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I35235e99ea9ceb0d31961dd3b8069f7194f5a2de
Gerrit-Change-Number: 11379
Gerrit-PatchSet: 6
Gerrit-Owner: Yongjun Zhang <yjzhan...@apache.org>
Gerrit-Reviewer: Impala Public Jenkins <impala-public-jenk...@cloudera.com>
Gerrit-Reviewer: Joe McDonnell <joemcdonn...@cloudera.com>
Gerrit-Reviewer: Lars Volker <l...@cloudera.com>
Gerrit-Reviewer: Michael Ho <k...@cloudera.com>
Gerrit-Reviewer: Tim Armstrong <tarmstr...@cloudera.com>
Gerrit-Reviewer: Yongjun Zhang <yjzhan...@apache.org>

Reply via email to