Internal Jenkins has submitted this change and it was merged. Change subject: IMPALA-2069: add USE_UTF8_PARQUET_STRINGS query option ......................................................................
IMPALA-2069: add USE_UTF8_PARQUET_STRINGS query option This option toggles whether the parquet writer will use the UTF8 annotation for string columns. This patch includes a test that writes a table with or without this option, then verifies that the annotation is or isn't present using a new get_parquet_metadata Python utility. Change-Id: I030c9f5c6272e09c1ce133f66234e3cfb26b68d4 Reviewed-on: http://gerrit.cloudera.org:8080/2531 Reviewed-by: Dan Hecht <[email protected]> Tested-by: Internal Jenkins --- M be/src/exec/hdfs-parquet-scanner.cc M be/src/exec/hdfs-parquet-table-writer.cc M be/src/service/query-options.cc M be/src/service/query-options.h M common/thrift/ImpalaInternalService.thrift M common/thrift/ImpalaService.thrift M tests/query_test/test_scanners.py A tests/util/get_parquet_metadata.py 8 files changed, 143 insertions(+), 7 deletions(-) Approvals: Internal Jenkins: Verified Dan Hecht: Looks good to me, approved -- To view, visit http://gerrit.cloudera.org:8080/2531 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-MessageType: merged Gerrit-Change-Id: I030c9f5c6272e09c1ce133f66234e3cfb26b68d4 Gerrit-PatchSet: 8 Gerrit-Project: Impala Gerrit-Branch: cdh5-trunk Gerrit-Owner: Skye Wanderman-Milne <[email protected]> Gerrit-Reviewer: Casey Ching <[email protected]> Gerrit-Reviewer: Dan Hecht <[email protected]> Gerrit-Reviewer: Internal Jenkins Gerrit-Reviewer: Skye Wanderman-Milne <[email protected]>
