Hello Adar Dembo,
I'd like you to do a code review. Please visit
to review the following change.
Change subject: KUDU-1538 (WIP): prevent LBM block ID reuse to avoid potential
KUDU-1538 (WIP): prevent LBM block ID reuse to avoid potential data loss
This changes the LogBlockManager to allocate block IDs sequentially
rather than randomly. Given our 64-bit block IDs, this prevents ever
reusing an ID (it would take thousands of years even at unrealistically
high allocation rates).
The trickiness of this patch is that, in many unit tests, the BlockCache
singleton ends up persisting across multiple separate LogBlockManagers.
Even though the test has torn down and recreated a new block manager,
the BlockCache continues to cache entries from the previous block manager.
With the block IDs starting from '1', we would be sure to have a collision
and many tests failed.
The workaround is for the LBM to notice when it is running in a gtest
(by way of some weak symbol magic) and start its allocation at a
random point in block space, rather than starting at 1.
Unfortunately there's no real way to write a regression test for this:
it would only produce itself after inserting tens of terabytes of data
in the presence of lots of remote bootstraps, etc.
WIP because we probably should do the same fix for FBM.
4 files changed, 38 insertions(+), 9 deletions(-)
git pull ssh://gerrit.cloudera.org:29418/kudu refs/changes/19/3719/1
To view, visit http://gerrit.cloudera.org:8080/3719
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings
Gerrit-Owner: Todd Lipcon <t...@apache.org>
Gerrit-Reviewer: Adar Dembo <a...@cloudera.com>