Skye Wanderman-Milne has uploaded a new change for review.

  http://gerrit.cloudera.org:8080/2531

Change subject: IMPALA-2069: add USE_UTF8_PARQUET_STRINGS query option
......................................................................

IMPALA-2069: add USE_UTF8_PARQUET_STRINGS query option

This option toggles whether the parquet writer will use the UTF8
annotation for string columns. This patch includes a test that writes
a table using this option and verifies that it can read it back
correctly, but doesn't actually check that the annotation is used. I
manually verified that the annotation is there, but am not sure how to
progamatically check this without introducing a dependency on
parquet-tools or something else that will output a parquet file's
schema.

Change-Id: I030c9f5c6272e09c1ce133f66234e3cfb26b68d4
---
M be/src/exec/hdfs-parquet-scanner.cc
M be/src/exec/hdfs-parquet-table-writer.cc
M be/src/exec/hdfs-parquet-table-writer.h
M be/src/service/query-options.cc
M be/src/service/query-options.h
M common/thrift/ImpalaInternalService.thrift
M common/thrift/ImpalaService.thrift
A testdata/workloads/functional-query/queries/QueryTest/parquet-use-utf8.test
M tests/query_test/test_scanners.py
9 files changed, 48 insertions(+), 6 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala refs/changes/31/2531/1
-- 
To view, visit http://gerrit.cloudera.org:8080/2531
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-MessageType: newchange
Gerrit-Change-Id: I030c9f5c6272e09c1ce133f66234e3cfb26b68d4
Gerrit-PatchSet: 1
Gerrit-Project: Impala
Gerrit-Branch: cdh5-trunk
Gerrit-Owner: Skye Wanderman-Milne <[email protected]>

Reply via email to