Hello Kudu Jenkins, Andrew Wong, Adar Dembo,

I'd like you to reexamine a change. Please visit

    http://gerrit.cloudera.org:8080/12489

to look at the new patch set (#2).

Change subject: Enable --rowset_metadata_store_keys by default
......................................................................

Enable --rowset_metadata_store_keys by default

While starting up a cluster where each tablet server had ~4000 replicas,
I noticed many replicas spent seconds opening. I traced this to having
to read the rowset bounds for each rowset from disk. It appears to be
the only I/O that occurs when opening the tablet replica. I think we
should try turning this flag on (at the beginning of the 1.10 release
cycle) to see if it would be a good idea to have it on by default. We
can always revert this change if there are problems. If it goes well
enough and there's no use case for omitted the rowset bounds, maybe
the flag should be removed altogether and the only behavior will be
to store the bounds.

Expected benefit:
1. Significantly decreased startup time when there are a lot of
   replicas. See c127477b52 for some numbers.

Expected cost:
1. Bigger tablet metadata files, and therefore bigger flushes of tablet
   metadata. Note that tablet metadata size is already (in part)
   proportional to the number of rowsets, so the metadata will not grow
   differently.
2. Initting the key file or ad hoc index is done at first scan instead
   of at tablet start.

Also:
Good: This is backwards compatible-- old servers will ignore the new
      field and read the bounds from disk.
Good: This is forwards compatible-- new servers will read the bounds
      from disk if the new field is missing.
Bad:  This won't affect rowsets that are already on disk, so read-only
      tablets written in previous versions will never see any benefit
      from this patch.

Change-Id: I2c4ac4b58845666cd10102fb125fc787c637e473
---
M src/kudu/tablet/diskrowset.cc
1 file changed, 1 insertion(+), 2 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/kudu refs/changes/89/12489/2
--
To view, visit http://gerrit.cloudera.org:8080/12489
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: kudu
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I2c4ac4b58845666cd10102fb125fc787c637e473
Gerrit-Change-Number: 12489
Gerrit-PatchSet: 2
Gerrit-Owner: Will Berkeley <[email protected]>
Gerrit-Reviewer: Adar Dembo <[email protected]>
Gerrit-Reviewer: Andrew Wong <[email protected]>
Gerrit-Reviewer: Kudu Jenkins (120)

Reply via email to