Hello St.Ack,
thanks for your answer.
I will vote on HBASE-880, that's exactly what I needed :)
Except of the little glitch in the webinterface regarding the ending key,
that table seems to be ok then. I thought regions would get deleted. I'm
adding new data (Timestamp of current time is the key), and when I have
processed that data, I'm deleting the data. That's the reason why I get more
and more regions... (because as soon as I add data that spans over two
regions, the first region won't be removed when I have deleted that data).
Here is the meta info of that region (NAME =>
'tobeprocessed,12293840182411045696639,1229385024829') and the regions
around that region
.........
tobeprocessed,1229383104274 column=info:regioninfo,
timestamp=1229384271404, value=REGION => {NAME => 'tobepr
796785789,1229384269280 ocessed,1229383104274796785789,1229384269280',
STARTKEY => '122938310427479678578
9', ENDKEY => '12293837601871695303679',
ENCODED => 1839260151, TABLE => {{NAME =
> 'tobeprocessed', IS_ROOT => 'false', IS_META
=> 'false', FAMILIES => [{NAME =>
'data', BLOOMFILTER => 'false', COMPRESSION =>
'NONE', VERSIONS => '1', LENGTH =>
'2147483647', TTL => '-1', IN_MEMORY =>
'false', BLOCKCACHE => 'false'}], INDEXE
S => []}}
tobeprocessed,1229383760187 column=info:regioninfo,
timestamp=1229385036201, value=REGION => {NAME => 'tobepr
1695303679,1229385024829 ocessed,12293837601871695303679,1229385024829',
STARTKEY => '12293837601871695303
679', ENDKEY => '12293840182411045696639',
ENCODED => 1597707466, TABLE => {{NAME
=> 'tobeprocessed', IS_ROOT => 'false',
IS_META => 'false', FAMILIES => [{NAME =
> 'data', BLOOMFILTER => 'false', COMPRESSION
=> 'NONE', VERSIONS => '1', LENGTH
=> '2147483647', TTL => '-1', IN_MEMORY =>
'false', BLOCKCACHE => 'false'}], INDE
XES => []}}
tobeprocessed,1229384018241 column=info:regioninfo,
timestamp=1229385036202, value=REGION => {NAME => 'tobepr
1045696639,1229385024829 ocessed,12293840182411045696639,1229385024829',
STARTKEY => '12293840182411045696
639', ENDKEY => '', ENCODED => 2091963726,
TABLE => {{NAME => 'tobeprocessed', IS
_ROOT => 'false', IS_META => 'false', FAMILIES
=> [{NAME => 'data', BLOOMFILTER =
> 'false', COMPRESSION => 'NONE', VERSIONS =>
'1', LENGTH => '2147483647', TTL =>
'-1', IN_MEMORY => 'false', BLOCKCACHE =>
'false'}], INDEXES => []}}
tobeprocessed,1229387389554 column=info:regioninfo,
timestamp=1229423191262, value=REGION => {NAME => 'tobepr
1519264796,1229423166282 ocessed,12293873895541519264796,1229423166282',
STARTKEY => '12293873895541519264
796', ENDKEY => '12293964745931154000190',
ENCODED => 136070661, TABLE => {{NAME
=> 'tobeprocessed', IS_ROOT => 'false', IS_META
=> 'false', FAMILIES => [{NAME =>
'data', BLOOMFILTER => 'false', COMPRESSION =>
'NONE', VERSIONS => '1', LENGTH =
> '2147483647', TTL => '-1', IN_MEMORY =>
'false', BLOCKCACHE => 'false'}], INDEX
ES => []}}
tobeprocessed,1229396474593 column=info:regioninfo,
timestamp=1229427246552, value=REGION => {NAME => 'tobepr
1154000190,1229427238214 ocessed,12293964745931154000190,1229427238214',
STARTKEY => '12293964745931154000
190', ENDKEY => '12294032229291208031240',
ENCODED => 1505006567, TABLE => {{NAME
=> 'tobeprocessed', IS_ROOT => 'false',
IS_META => 'false', FAMILIES => [{NAME =
> 'data', BLOOMFILTER => 'false', COMPRESSION
=> 'NONE', VERSIONS => '1', LENGTH
=> '2147483647', TTL => '-1', IN_MEMORY =>
'false', BLOCKCACHE => 'false'}], INDE
XES => []}}
............
But I have another table (rsssources), which when I scanned it yesterday had
> 400 000 entries (count 'rsssources'), and today only has 180 000 entries
(after killing hbase (kill -9), because it was unresponsive. I'm also
logging the DEBUG entries now to see what's happening at that point).
When I execute a mapreduce job, a few regions don't seem to have any data in
them. I did however never delete any data in that table (just replaces).
I did increase the timeout values, because I read somewhere else that it
would help in some cases. But I will reset the values to their original
values.
What's the best way to stop hbase when the hbase-stop script doesn't work.
(Sometimes it just runs for hours... (probably a deadlock somewhere)?
I'm waiting now for hbase to shut down, and will try to run the merge script
on those two tables.
Thanks,
Thibaut
--
View this message in context:
http://www.nabble.com/Table-with-80-regions-having-nearly-no-data-in-it-tp21058687p21074408.html
Sent from the HBase User mailing list archive at Nabble.com.