Re: Table with 80 regions having nearly no data in it

Thibaut_ Mon, 22 Dec 2008 04:49:17 -0800

Hello St.Ack,

unfortunately I don't have the tables anymore (I did do a clean restart and
deleted all the tables).


If I encouter the missing region issue again, I will open up an issue. I
will also have DEBUG output available then.

As for the processes not stopping, I posted a stack dump and the the tail of
the log file in HBASE-1078.

Thibaut



stack-3 wrote:
> 
> Thibaut_ wrote:
>> Hello St.Ack,
>> thanks for your answer.
>>
>> I will vote on HBASE-880, that's exactly what I needed :)
>>
>> Except of the little glitch in the webinterface regarding the ending key,
>> that table seems to be ok then. I thought regions would get deleted. I'm
>> adding new data (Timestamp of current time is the key), and when I have
>> processed that data, I'm deleting the data. That's the reason why I get
>> more
>> and more regions... (because as soon as I add data that spans over two
>> regions, the first region won't be removed when I have deleted that
>> data).
>>   
> 
> We'd need a cluster cleaner process that would notice empty regions and 
> would merge them into adjacents.  That seems reasonable to me (Sounds 
> like HBASE-420).
> 
> The below is a bit hard to read but from what I can make out, its not 
> right.  The ENDKEY of a region should be the STARTKEY of the next.  The 
> ENDKEY of '', should be last in table.  Neither seems to be going on 
> here.   Perhaps make an issue and paste a clean output from .META. and 
> I'll take a look.
>> Here is the meta info of that region (NAME =>
>> 'tobeprocessed,12293840182411045696639,1229385024829') and the regions
>> around that region
>> .........
>>  tobeprocessed,1229383104274 column=info:regioninfo,
>> timestamp=1229384271404, value=REGION => {NAME => 'tobepr
>>  796785789,1229384269280    
>> ocessed,1229383104274796785789,1229384269280',
>> STARTKEY => '122938310427479678578
>>                              9', ENDKEY => '12293837601871695303679',
>> ENCODED => 1839260151, TABLE => {{NAME =
>>                              > 'tobeprocessed', IS_ROOT => 'false',
>> IS_META
>> => 'false', FAMILIES => [{NAME =>
>>                              'data', BLOOMFILTER => 'false', COMPRESSION
>> =>
>> 'NONE', VERSIONS => '1', LENGTH =>
>>                               '2147483647', TTL => '-1', IN_MEMORY =>
>> 'false', BLOCKCACHE => 'false'}], INDEXE
>>                              S => []}}
>>  tobeprocessed,1229383760187 column=info:regioninfo,
>>   
> 
> ....
>> But I have another table (rsssources), which when I scanned it yesterday
>> had
>>   
>>> 400 000 entries (count 'rsssources'), and today only has 180 000 entries
>>>     
>> (after killing hbase (kill -9), because it was unresponsive. 
> 
> Well, the above listing would seem to have missing regions.  Lets figure 
> whats gong on.  Make an issue 'missing regions' and paste in your scan 
> '.META.' output.  Do you think you can reproduce this?
> 
> 
>> I'm also
>> logging the DEBUG entries now to see what's happening at that point).
>>   
> 
> Good.  Can you see anything about the regions that go missing?
> 
>> When I execute a mapreduce job, a few regions don't seem to have any data
>> in
>> them. I did however never delete any data in that table (just replaces). 
>>
>> I did increase the timeout values, because I read somewhere else that it
>> would help in some cases. But I will reset the values to their original
>> values.
>>
>>
>> What's the best way to stop hbase when the hbase-stop script doesn't
>> work.
>> (Sometimes it just runs for hours... (probably a deadlock somewhere)?
>>
>>   
> Tail the master log with DEBUG enabled.  Should point at what is taking 
> so long to go down.  May indicate a particular regionserver.  Thread 
> dump it ("kill -QUIT PID").  Stick that in the issue to.
> 
> 
>> I'm waiting now for hbase to shut down, and will try to run the merge
>> script
>> on those two tables.
>>   
> Good stuff,
> St.Ack
> 
> 

-- 
View this message in context: 
http://www.nabble.com/Table-with-80-regions-having-nearly-no-data-in-it-tp21058687p21127237.html
Sent from the HBase User mailing list archive at Nabble.com.

Re: Table with 80 regions having nearly no data in it

Reply via email to