Yao Xu has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/14046


Change subject: KUDU-2917 Split a tablet into primary key ranges by number of 
rows
......................................................................

KUDU-2917 Split a tablet into primary key ranges by number of rows

Since we implemented KUDU-2437 and KUDU-2670, the spark job can read
data inside the tablet in parallel.

However, we found in actual use that splitting key range by size may
cause the spark task to read long tails. This issue is caused by the
encoding and compression of column-wise.

So I added the ability to split key range by number of rows. As long
as one of the 'target_chunk_size' or 'target_chunk_rows' is satisfied,
a new key range will be created.

Change-Id: Ib859c615743ebf7c3b481dfe1bd428769b02f0d7
---
M src/kudu/common/common.proto
M src/kudu/common/key_range.cc
M src/kudu/common/key_range.h
M src/kudu/tablet/mock-rowsets.h
M src/kudu/tablet/rowset_info.cc
M src/kudu/tablet/rowset_info.h
M src/kudu/tablet/tablet-test.cc
M src/kudu/tablet/tablet.cc
M src/kudu/tablet/tablet.h
M src/kudu/tserver/tablet_server-test.cc
M src/kudu/tserver/tablet_service.cc
M src/kudu/tserver/tserver.proto
12 files changed, 266 insertions(+), 133 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/kudu refs/changes/46/14046/1
--
To view, visit http://gerrit.cloudera.org:8080/14046
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: kudu
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: Ib859c615743ebf7c3b481dfe1bd428769b02f0d7
Gerrit-Change-Number: 14046
Gerrit-PatchSet: 1
Gerrit-Owner: Yao Xu <[email protected]>

Reply via email to