Re: Clarification: Merging and getRowKeyAtOrBefore

Andra Adams Mon, 21 Jul 2008 17:14:46 -0700

Thanks Bryan and Stack for the answers to my questions!

About Question 1, I fully understand the need for a Merge tool fornon-adjacent regions in broken tables. Thanks for the extra clarification.

About Question 2, I understand the need for getRowKeyAtOrBefore and itsoverall goal, and ditto for getFull. What I don't understand is theprocess of finding the best candidate keys in getRowKeyAtOrBefore.

In my previous post, I was comparing getRowKeyAtOrBefore to getFullsince they are slightly similar operations. They both must look throughthe entire Memcache (mc and snapshot) as well as every single HStoreFileon disk (as Bryan confirmed). And they both must handle deletes, whichis where I get lost with getRowKeyAtOrBefore.

getFull keeps a set of deleted items and since it is looking in reversechronological order over the data set, these deleted items are alwaysmore recent than the current item being analyzed. Thus laterdiscoveries of items that match entries in te deleted set will be ignored.

getRowKeyAtOrBefore takes the opposite approach, collecting candidatekeys from the sources in no chronological order, and removing items fromthis candidate set if they are later found to be deleted. Whatguarantees that there will always remain at least one key in the set ofcandidate keys when the Memcache is finally searched? What if theMemcache contains only deletes, such that every candidate key that waschosen from the HStoreFiles is now discovered to have been deleted?Isn't this a possible scenario, and couldn't it be avoided by searchingthe data in reverse chronological order like getFull (keeping a list ofdeletes and moving backwards in time, rather than keeping a list ofcandidate keys and moving forwards)?


Thanks,
Andra


stack wrote:

An attempt at answering first question is inlined below.

Bryan Duxbury wrote:

My replies to the second question inline. Feel free to ask follow ups.

-Bryan

On Jun 26, 2008, at 5:24 PM, Andra Adams wrote:
Hi,
I've been looking through the HBase code and I was wondering if Icould get some clarification on two points.
1. Why doesn't HRegion's static merge method check that the tworegions specified are adjacent?

Originally, merge would only allow merging of adjacent regions but acouple of months back, we had bugs that could manufacture regions withoverlapping keys. To fix damaged clusters, the merge tool was amendedto remove the adjacency check and refactored so it could merge overlapsas well as adjacents (See unit tests for merge tool. IIRC, it includestests that merge adjacent and overlapping regions).

Yes, if an operator tries to merge non-adjacents, they'll do damage. Weshould add back some smarts that guard against this (If you don't filean issue, I will).


Thanks for reminding us of this hole,
St.Ack

As far as I can tell, HRegion's merge method is called from the Mergetool which gets its region names from command line arguments. As faras I can see, merging non-adjacent regions would break many of theassertions that HBase depends on, yet all calls to HRegion's mergemethod result in a merged region. So how come the caller of theMerge tool is being trusted to ensure the adjacency of the regions itis specifying on the command line? ( Although admittedly, theadjacency check could be quite computationally-expensive since itwould involve a complete scan of all regions in the "parent" METAtable (either .META. or -ROOT-) to ensure that there are no regionsin the "daughter" (either a user table or .META.) table that have astart key between the end key and start key of the regions beingasked to merge).
2. Can I get an overview of the algorithm used to determine the bestcandidate key in HStore's getRowKeyAtOrBefore (including Memcache'sinternalGetRowKeyAtOrBefore, and HStore's rowAtOrBeforeFromMapFile)?
I'm having trouble figuring out why HStore's getFull method looksthrough the mc, snapshot and storefiles in reverse chronologicalorder (i.e. mc, then snapshot, then store files), while thegetRowKeyAtOrBefore looks through the storefiles, then the mc, thenthe snapshot (in apparently no chronological order...?). Why doesgetFull create a map of deletes (and older entries check this mapbefore inserting their values in the results map), whilegetRowAtOrBefore opts to remove entries from the results map if adelete is found at a later time?
Aside from the difference in style between getFull andgetRowAtOrBefore, I'm also wondering why the discovery of a deletedvalue sometimes removes that key from the candidateKeys map, andother times is simply ignored. (It could be that I'm missing some ofthe concepts behind the algorithm).
The idea of getRowKeyAtOrBefore is to discover the row that comesimmediately before or right upon the search row. This is usedexclusively when trying to locate which region a key resides in. Thereasoning behind this is a little tricky. Regions in HBase are keyedon their start row, which is inclusive. The end row is implied by thepresence of the next region. So, when you have an arbitrary key you'dlike to perform some operation on, you need to find the region whichcontains it, which you can only know by scanning past it.
getRowKeyAtOrBefore is a specific, internal-only RPC method that doesthis operation. In order to actually do the work, at the HStore level,we have to decide amongst the possible keys that presented by thememcache (including the snapshot) and all of the store files. Theorder here is unimportant, because ultimately, we're going to have tolook at every one of those things unless we encounter a precise match.Moreover, there could be deletes in any one of them, so we have tocarry the candidates along with us and apply the deletes where theyare required. The reasoning here is that if a row is completelydeleted, that is, all cells are suppressed by deletes, even if itmatches precisely, we don't want to return it as a candidate key.Deletes are ignored when the don't apply to the data we've alreadyfound, usually because there's a newer piece of data than there is adelete (this is simply a memory optimization).
Likewise, getFull tries to find a whole row of information about a keyat a time. We need to follow deletes around here for the same reasonthat we do it in regular get: we don't want to return deleted data. Wego in reverse chronological order here because that allows the mostrecent data to easily take precedence.
Thanks,
Andra

[EMAIL PROTECTED]

Re: Clarification: Merging and getRowKeyAtOrBefore

Reply via email to