Hello St.Ack,
thanks for your answer.

I will vote on HBASE-880, that's exactly what I needed :)

Except of the little glitch in the webinterface regarding the ending key,
that table seems to be ok then. I thought regions would get deleted. I'm
adding new data (Timestamp of current time is the key), and when I have
processed that data, I'm deleting the data. That's the reason why I get more
and more regions... (because as soon as I add data that spans over two
regions, the first region won't be removed when I have deleted that data).

Here is the meta info of that region (NAME =>
'tobeprocessed,12293840182411045696639,1229385024829') and the regions
around that region
.........
 tobeprocessed,1229383104274 column=info:regioninfo,
timestamp=1229384271404, value=REGION => {NAME => 'tobepr
 796785789,1229384269280     ocessed,1229383104274796785789,1229384269280',
STARTKEY => '122938310427479678578
                             9', ENDKEY => '12293837601871695303679',
ENCODED => 1839260151, TABLE => {{NAME =
                             > 'tobeprocessed', IS_ROOT => 'false', IS_META
=> 'false', FAMILIES => [{NAME =>
                             'data', BLOOMFILTER => 'false', COMPRESSION =>
'NONE', VERSIONS => '1', LENGTH =>
                              '2147483647', TTL => '-1', IN_MEMORY =>
'false', BLOCKCACHE => 'false'}], INDEXE
                             S => []}}
 tobeprocessed,1229383760187 column=info:regioninfo,
timestamp=1229385036201, value=REGION => {NAME => 'tobepr
 1695303679,1229385024829    ocessed,12293837601871695303679,1229385024829',
STARTKEY => '12293837601871695303
                             679', ENDKEY => '12293840182411045696639',
ENCODED => 1597707466, TABLE => {{NAME
                              => 'tobeprocessed', IS_ROOT => 'false',
IS_META => 'false', FAMILIES => [{NAME =
                             > 'data', BLOOMFILTER => 'false', COMPRESSION
=> 'NONE', VERSIONS => '1', LENGTH
                             => '2147483647', TTL => '-1', IN_MEMORY =>
'false', BLOCKCACHE => 'false'}], INDE
                             XES => []}}
 tobeprocessed,1229384018241 column=info:regioninfo,
timestamp=1229385036202, value=REGION => {NAME => 'tobepr
 1045696639,1229385024829    ocessed,12293840182411045696639,1229385024829',
STARTKEY => '12293840182411045696
                             639', ENDKEY => '', ENCODED => 2091963726,
TABLE => {{NAME => 'tobeprocessed', IS
                             _ROOT => 'false', IS_META => 'false', FAMILIES
=> [{NAME => 'data', BLOOMFILTER =
                             > 'false', COMPRESSION => 'NONE', VERSIONS =>
'1', LENGTH => '2147483647', TTL =>
                              '-1', IN_MEMORY => 'false', BLOCKCACHE =>
'false'}], INDEXES => []}}
 tobeprocessed,1229387389554 column=info:regioninfo,
timestamp=1229423191262, value=REGION => {NAME => 'tobepr
 1519264796,1229423166282    ocessed,12293873895541519264796,1229423166282',
STARTKEY => '12293873895541519264
                             796', ENDKEY => '12293964745931154000190',
ENCODED => 136070661, TABLE => {{NAME
                             => 'tobeprocessed', IS_ROOT => 'false', IS_META
=> 'false', FAMILIES => [{NAME =>
                              'data', BLOOMFILTER => 'false', COMPRESSION =>
'NONE', VERSIONS => '1', LENGTH =
                             > '2147483647', TTL => '-1', IN_MEMORY =>
'false', BLOCKCACHE => 'false'}], INDEX
                             ES => []}}
 tobeprocessed,1229396474593 column=info:regioninfo,
timestamp=1229427246552, value=REGION => {NAME => 'tobepr
 1154000190,1229427238214    ocessed,12293964745931154000190,1229427238214',
STARTKEY => '12293964745931154000
                             190', ENDKEY => '12294032229291208031240',
ENCODED => 1505006567, TABLE => {{NAME
                              => 'tobeprocessed', IS_ROOT => 'false',
IS_META => 'false', FAMILIES => [{NAME =
                             > 'data', BLOOMFILTER => 'false', COMPRESSION
=> 'NONE', VERSIONS => '1', LENGTH
                             => '2147483647', TTL => '-1', IN_MEMORY =>
'false', BLOCKCACHE => 'false'}], INDE
                             XES => []}}
............



But I have another table (rsssources), which when I scanned it yesterday had
> 400 000 entries (count 'rsssources'), and today only has 180 000 entries
(after killing hbase (kill -9), because it was unresponsive. I'm also
logging the DEBUG entries now to see what's happening at that point).
When I execute a mapreduce job, a few regions don't seem to have any data in
them. I did however never delete any data in that table (just replaces). 

I did increase the timeout values, because I read somewhere else that it
would help in some cases. But I will reset the values to their original
values.


What's the best way to stop hbase when the hbase-stop script doesn't work.
(Sometimes it just runs for hours... (probably a deadlock somewhere)?

I'm waiting now for hbase to shut down, and will try to run the merge script
on those two tables.

Thanks,
Thibaut



-- 
View this message in context: 
http://www.nabble.com/Table-with-80-regions-having-nearly-no-data-in-it-tp21058687p21074408.html
Sent from the HBase User mailing list archive at Nabble.com.

Reply via email to