Quanlong Huang has uploaded this change for review. ( http://gerrit.cloudera.org:8080/18228
Change subject: IMPALA-11120: Fix codec not set in generating ORC tables ...................................................................... IMPALA-11120: Fix codec not set in generating ORC tables We use 'mapred.output.compression.codec' to set the compression codec in generating test files by Hive. However, it doesn't affect ORC files. Instead, we need to set 'orc.compress' in tblproperties for each ORC tables. The default value of 'orc.compress' is ZLIB which corresponds to our 'def' codec. We only need to set it for non-def codecs. This patch also fixes a bug in build_compression_codec_statement() that would raise KeyError when loading lz4 non-avro tables. Tests - Loaded tpch data in orc/none/none, orc/def/block, orc/snap/block, orc/lz4/block and verified there compression codecs. Change-Id: I02bd5d9400864145133ff019a3d076a6cab36fcc --- M testdata/bin/generate-schema-statements.py 1 file changed, 9 insertions(+), 1 deletion(-) git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/28/18228/1 -- To view, visit http://gerrit.cloudera.org:8080/18228 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-Project: Impala-ASF Gerrit-Branch: master Gerrit-MessageType: newchange Gerrit-Change-Id: I02bd5d9400864145133ff019a3d076a6cab36fcc Gerrit-Change-Number: 18228 Gerrit-PatchSet: 1 Gerrit-Owner: Quanlong Huang <[email protected]>
