Re: duplicated hbase timestamps

stack Mon, 15 Dec 2008 08:48:55 -0800

Toby White wrote:

Thanks - yes the symptoms do look very similar to HBASE-29. However,by my reading, that problem ought to go away after compaction; but inthis case it doesn't, after compaction the duplicate cells are allstill there. (I *think* the order in which they are reported sometimeschanges after compaction, but I can't tell reliably.) I might bemisunderstanding though.

On 'major' compaction, these happen every 24 hours by default (and aredifferent from 'minor' compactions triggered by count of files infilesystem), we'll clear any cells beyond the designated MAX_VERSIONSand anything older than designated TTL. For the latter, its the cell'stimestamp/version that we use. Another tangle is that 'our' keys aremade of row/column/timestamp and edits first go into a sorted Map. Iftwo edits with same r/c/t, then the latter will overwrite the older(unless there have been flushes between inserts -- have there been inyour case?).

I'm thinking each edit needs to carry two timestamps to solve HBASE-29,the user designated one and another that records actual insert time.The latter is used during major compactions figuring cell TTL and itsused distingushing two edits of same r/c/t so we don't overwrite oldervintage edits.


St.Ack

Toby

On 14 Dec 2008, at 22:41, stack wrote:
The below looks like the known issue, HBASE-29 'HStore#get andHStore#getFull may not return expected values by timestamp when thereis more than one MapFile'. What do you think Toby? Basically, ifupdates do not go in in chronological order, you'll get unexpectedresults. We need to fix this but first need to get ourselves set upwith some smarter internals before we can address it (Though, thatsaid, I'd think your particular case shouldn't be that hard to makework).
St.Ack

Toby White wrote:
Sorry for the very slow response - local priorities changed and Ididn't have a chance to respond properly before.
The issue described previously is still occurring (brief recap -hbase is reporting cells with duplicate timestamps, see the quotedoutput below.)
I originally saw this with 0.18.0 - I've now checked, and I stillsee it with 0.18.1 (both on hadoop 0.18.1) and current trunk:r725828(with hdfs upgraded to run on hadoop 0.19.0)
This is running in pseudo-distributed mode.
I've been able to narrow down the trigger a bit. I can't cause it tohappen entirely reproducibly, but it seems
to occur only when I've done the following:

* Create a row;
* Add lots of data at different timestamps into one column (thriftmutateRowTs or shell put)* Delete all data in that column, or indeed the entire row (thriftdeleteAll or deleteAllRow or shell deleteall)
* at this point, hbase reports that the row has indeed been removed.
* Recreate the row, and put data back into the same column, at thesame timestamps, but with potentially different values (thriftmutateRowTs / shell put)* On reading the row, Hbase seems to see and report back both thenewly-added data, and the data previously deleted (thrift getVer /shell get)
On a row where this has happened once, it seems to happen almostalways thereafter, each time appending a whole new set of data. So,if you're adding/removing 100 cells at a time, then the total numberof cells hbase reports back will grow by 100 every time you repeatthe cycle.
On a row where it hasn't happened yet, the delete behaviour seemsusually correct.
The problem is observable working either through the Python thriftinterface, or directly through the Hbase JRuby shell.
The HDFS filesystem on which I'm observing this is fairly small -under 100Mb compressed - I can forward it for debugging off list ifthat's helpful. I'd be grateful for any help sorting this out.
Toby

On 20 Oct 2008, at 17:51, Jean-Daniel Cryans wrote:
Toby,

Can you tell us more about your setup? Numbers of machines, if NTP is
installed and running, number of regions in your table and otheruseful
stuff.

Thx,

J-D

On Mon, Oct 20, 2008 at 11:11 AM, Toby White
<[email protected]>wrote:
I'm seeing a strange effect on my hbase instance. Sometimes, onrequestingthe full history of a column, I get back individual cells severaltimes
over.

That is, I'm getting results like this:
base(main):006:0* get 'my_table', 'scw9npU7Q4ma_khXqlDGXg',{COLUMN =>
'value:', VERSIONS=>4000}
timestamp=1224504133000, value=1013.0
timestamp=1224502749000, value=1012.0
timestamp=1224502749000, value=1012.0
timestamp=1224499880000, value=1011.0
timestamp=1224499880000, value=1011.0
timestamp=1224499880000, value=1011.0
timestamp=1224415961000, value=1010.0
timestamp=1224415961000, value=1010.0
timestamp=1224415961000, value=1010.0
timestamp=1224415701000, value=1009.0
timestamp=1224415701000, value=1009.0
timestamp=1224415701000, value=1009.0
timestamp=1224414200000, value=1008.0
timestamp=1224414200000, value=1008.0
timestamp=1224414200000, value=1008.0

This happens both through the hbase shell as shown here, and when
communicating with the server via thrift.
In either case, the cells are reported either as shown above; thatis, witheach cell simply repeated several times (in this case, 3) orsometimes with
the series repeated; something like this:
base(main):006:0* get 'golddigger', 'scw9npU7Q4ma_khXqlDGXg',{COLUMN =>
'value:', VERSIONS=>4000}
timestamp=1224504133000, value=1013.0
timestamp=1224502749000, value=1012.0
timestamp=1224499880000, value=1011.0
timestamp=1224415961000, value=1010.0
timestamp=1224415701000, value=1009.0
timestamp=1224414200000, value=1008.0
timestamp=1224504133000, value=1013.0
timestamp=1224502749000, value=1012.0
timestamp=1224499880000, value=1011.0
timestamp=1224415961000, value=1010.0
timestamp=1224415701000, value=1009.0
timestamp=1224414200000, value=1008.0
or sometimes a combination of both ie an entire series, each cellrepeated
a couple of times, and then the whole lot repeated again.
This doesn't happen with all rows, only some of them, apparentlyat random.Sometimes, restarting hbase & the underlying hdf makes the problemgo away;
sometimes, it doesn't, and the issue persists.

This is with hbase 0.18.0 on hadoop 0.18.1

Is this a known issue?

Re: duplicated hbase timestamps

Reply via email to