Skye Wanderman-Milne has uploaded a new change for review. http://gerrit.cloudera.org:8080/2531
Change subject: IMPALA-2069: add USE_UTF8_PARQUET_STRINGS query option ...................................................................... IMPALA-2069: add USE_UTF8_PARQUET_STRINGS query option This option toggles whether the parquet writer will use the UTF8 annotation for string columns. This patch includes a test that writes a table using this option and verifies that it can read it back correctly, but doesn't actually check that the annotation is used. I manually verified that the annotation is there, but am not sure how to progamatically check this without introducing a dependency on parquet-tools or something else that will output a parquet file's schema. Change-Id: I030c9f5c6272e09c1ce133f66234e3cfb26b68d4 --- M be/src/exec/hdfs-parquet-scanner.cc M be/src/exec/hdfs-parquet-table-writer.cc M be/src/exec/hdfs-parquet-table-writer.h M be/src/service/query-options.cc M be/src/service/query-options.h M common/thrift/ImpalaInternalService.thrift M common/thrift/ImpalaService.thrift A testdata/workloads/functional-query/queries/QueryTest/parquet-use-utf8.test M tests/query_test/test_scanners.py 9 files changed, 48 insertions(+), 6 deletions(-) git pull ssh://gerrit.cloudera.org:29418/Impala refs/changes/31/2531/1 -- To view, visit http://gerrit.cloudera.org:8080/2531 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-MessageType: newchange Gerrit-Change-Id: I030c9f5c6272e09c1ce133f66234e3cfb26b68d4 Gerrit-PatchSet: 1 Gerrit-Project: Impala Gerrit-Branch: cdh5-trunk Gerrit-Owner: Skye Wanderman-Milne <[email protected]>
