Skye Wanderman-Milne has uploaded a new patch set (#2). Change subject: IMPALA-2069: add USE_UTF8_PARQUET_STRINGS query option ......................................................................
IMPALA-2069: add USE_UTF8_PARQUET_STRINGS query option This option toggles whether the parquet writer will use the UTF8 annotation for string columns. This patch includes a test that writes a table using this option and verifies that it can read it back correctly, but doesn't actually check that the annotation is used. I manually verified that the annotation is there, but am not sure how to progamatically check this without introducing a dependency on parquet-tools or something else that will output a parquet file's schema. Change-Id: I030c9f5c6272e09c1ce133f66234e3cfb26b68d4 --- M be/src/exec/hdfs-parquet-scanner.cc M be/src/exec/hdfs-parquet-table-writer.cc M be/src/service/query-options.cc M be/src/service/query-options.h M common/thrift/ImpalaInternalService.thrift M common/thrift/ImpalaService.thrift A testdata/workloads/functional-query/queries/QueryTest/parquet-use-utf8.test M tests/query_test/test_scanners.py 8 files changed, 43 insertions(+), 6 deletions(-) git pull ssh://gerrit.cloudera.org:29418/Impala refs/changes/31/2531/2 -- To view, visit http://gerrit.cloudera.org:8080/2531 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-MessageType: newpatchset Gerrit-Change-Id: I030c9f5c6272e09c1ce133f66234e3cfb26b68d4 Gerrit-PatchSet: 2 Gerrit-Project: Impala Gerrit-Branch: cdh5-trunk Gerrit-Owner: Skye Wanderman-Milne <[email protected]> Gerrit-Reviewer: Dan Hecht <[email protected]> Gerrit-Reviewer: Skye Wanderman-Milne <[email protected]>
