Sailesh Mukil has uploaded a new patch set (#5). Change subject: IMPALA-2466: Add more tests for the HDFS parquet scanner. ......................................................................
IMPALA-2466: Add more tests for the HDFS parquet scanner. These tests functionally test whether the following type of files are able to be scanned properly: 1) Add a parquet file with multiple blocks such that each node has to scan multiple blocks. 2) Add a parquet file with multiple blocks but only one row group that spans the entire file. Only one scan range should do any work in this case. Change-Id: I4faccd9ce3fad42402652c8f17d4e7aa3d593368 --- M testdata/LineItemMultiBlock/README.dox A testdata/LineItemMultiBlock/lineitem_one_row_group.parquet A testdata/LineItemMultiBlock/lineitem_sixblocks.parquet M testdata/bin/create-load-data.sh M testdata/datasets/functional/functional_schema_template.sql M testdata/datasets/functional/schema_constraints.csv M testdata/workloads/functional-query/queries/QueryTest/parquet.test M tests/query_test/test_scanners.py 8 files changed, 163 insertions(+), 17 deletions(-) git pull ssh://gerrit.cloudera.org:29418/Impala refs/changes/00/1500/5 -- To view, visit http://gerrit.cloudera.org:8080/1500 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-MessageType: newpatchset Gerrit-Change-Id: I4faccd9ce3fad42402652c8f17d4e7aa3d593368 Gerrit-PatchSet: 5 Gerrit-Project: Impala Gerrit-Branch: cdh5-trunk Gerrit-Owner: Sailesh Mukil <[email protected]> Gerrit-Reviewer: Dan Hecht <[email protected]> Gerrit-Reviewer: Marcel Kornacker <[email protected]> Gerrit-Reviewer: Sailesh Mukil <[email protected]> Gerrit-Reviewer: Skye Wanderman-Milne <[email protected]>
