Adar Dembo has submitted this change and it was merged.

Change subject: KUDU-1751: Change default encodings
......................................................................


KUDU-1751: Change default encodings

* Change numeric (int/float/double) encodings to BIT_SHUFFLE

BIT_SHUFFLE is a better default than PLAIN since it's much more compact
and generally performs better.

* Change BINARY encodings to DICT_ENCODING

This is the default in Parquet, and we've seen that it's a common reason
that Kudu performs poorly. This automatically falls back to
non-dict-encoded for high-cardinality data.

Change-Id: I32db89337026eb6be13333ff450a6cb2b2862f7a
Reviewed-on: http://gerrit.cloudera.org:8080/5169
Tested-by: Kudu Jenkins
Reviewed-by: Adar Dembo <[email protected]>
---
M src/kudu/cfile/cfile-test.cc
M src/kudu/cfile/type_encodings.cc
M src/kudu/tablet/diskrowset-test.cc
3 files changed, 14 insertions(+), 14 deletions(-)

Approvals:
  Adar Dembo: Looks good to me, approved
  Kudu Jenkins: Verified



-- 
To view, visit http://gerrit.cloudera.org:8080/5169
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-MessageType: merged
Gerrit-Change-Id: I32db89337026eb6be13333ff450a6cb2b2862f7a
Gerrit-PatchSet: 11
Gerrit-Project: kudu
Gerrit-Branch: master
Gerrit-Owner: Todd Lipcon <[email protected]>
Gerrit-Reviewer: Adar Dembo <[email protected]>
Gerrit-Reviewer: Dan Burkert <[email protected]>
Gerrit-Reviewer: David Ribeiro Alves <[email protected]>
Gerrit-Reviewer: Jean-Daniel Cryans <[email protected]>
Gerrit-Reviewer: Kudu Jenkins
Gerrit-Reviewer: Todd Lipcon <[email protected]>

Reply via email to