Wenzhe Zhou has uploaded this change for review. ( http://gerrit.cloudera.org:8080/15087
Change subject: IMPALA-8110 Fix the Parquet stats filtering issue to correctly handle narrowed int types ...................................................................... IMPALA-8110 Fix the Parquet stats filtering issue to correctly handle narrowed int types This patch add validation for the paired stats value for int8 and int16 data type when reading min/max column stats value from Parquet file. Testing: 1) Manual test - create table with column as int32 type, intert some values, then alter table to change the column data type as tinyint (int8) or smallint (int16), make sure the query return correct number of rows when PARQUET_READ_STATISTICS is set as 1. 2) Passed pre-review-test on Jenkins. Change-Id: Id8bdaf4c4b2d0c6ea26d6e9bf013afca647e53a1 --- M be/src/exec/parquet/hdfs-parquet-scanner.cc M be/src/exec/parquet/parquet-column-stats.cc M be/src/exec/parquet/parquet-column-stats.h 3 files changed, 40 insertions(+), 9 deletions(-) git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/87/15087/1 -- To view, visit http://gerrit.cloudera.org:8080/15087 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-Project: Impala-ASF Gerrit-Branch: master Gerrit-MessageType: newchange Gerrit-Change-Id: Id8bdaf4c4b2d0c6ea26d6e9bf013afca647e53a1 Gerrit-Change-Number: 15087 Gerrit-PatchSet: 1 Gerrit-Owner: Wenzhe Zhou <[email protected]> Gerrit-Reviewer: Bikramjeet Vig <[email protected]>
