[ 
https://issues.apache.org/jira/browse/HBASE-866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-866:
------------------------

    Priority: Minor  (was: Major)

Ok. This issue is not as bad as I thought.   I had 
hbase.hregion.memcache.block.multiplier set to 1 -- and optional flushes to 
every ten minutes -- so was seeing loads of blocking.  Downgrading to minor.  
The possibility for our blocking forever is still there so needs to be fixed; 
its just not too likely.  Have been able to load up a 4 node cluster w/ 1/4B 
rows using default heap sizes with about 300 regions per server without my MR 
loader failing a single task.

> Blocking for ten minutes at a time
> ----------------------------------
>
>                 Key: HBASE-866
>                 URL: https://issues.apache.org/jira/browse/HBASE-866
>             Project: Hadoop HBase
>          Issue Type: Bug
>            Reporter: stack
>            Priority: Minor
>
> I've been testing running biggish MR jobs uploading into hbase.  My jobs 
> consistently fail with child task timing out its ten minute period.  Adding 
> logging, was able to see that we're actual stuck in a commit.  Following the 
> thread of the row we're committing, I see this in the log:
> {code}
> ...
> 2008-09-03 18:37:03,446 DEBUG org.apache.hadoop.hbase.regionserver.HRegion: 
> Flush requested on TestTable,0029377106,1220466998108
> 2008-09-03 18:37:03,446 DEBUG org.apache.hadoop.hbase.regionserver.HRegion: 
> Started memcache flush for region TestTable,0029377106,1220466998108. Current 
> region memcache size 64.0m
> 2008-09-03 18:37:03,446 INFO org.apache.hadoop.hbase.regionserver.HRegion: 
> Blocking updates for 'IPC Server handler 1 on 60020' on region 
> TestTable,0029377106,1220466998108: Memcache size 64.0m is >= than blocking 
> 64.0m size
> 2008-09-03 18:37:13,450 INFO org.apache.hadoop.hbase.regionserver.HRegion: 
> Unblocking updates for region TestTable,0029377106,1220466998108 'IPC Server 
> handler 1 on 60020'
> 2008-09-03 18:37:16,089 INFO org.apache.hadoop.hbase.regionserver.HRegion: 
> Blocking updates for 'IPC Server handler 16 on 60020' on region 
> TestTable,0029377106,1220466998108: Memcache size 64.0m is >= than blocking 
> 64.0m size
> 2008-09-03 18:37:16,090 INFO org.apache.hadoop.hbase.regionserver.HRegion: 
> Blocking updates for 'IPC Server handler 1 on 60020' on region 
> TestTable,0029377106,1220466998108: Memcache size 64.0m is >= than blocking 
> 64.0m size
> 2008-09-03 18:37:16,090 INFO org.apache.hadoop.hbase.regionserver.HRegion: 
> Blocking updates for 'IPC Server handler 4 on 60020' on region 
> TestTable,0029377106,1220466998108: Memcache size 64.0m is >= than blocking 
> 64.0m size
> 2008-09-03 18:37:16,090 INFO org.apache.hadoop.hbase.regionserver.HRegion: 
> Blocking updates for 'IPC Server handler 6 on 60020' on region 
> TestTable,0029377106,1220466998108: Memcache size 64.0m is >= than blocking 
> 64.0m size
> 2008-09-03 18:37:16,090 INFO org.apache.hadoop.hbase.regionserver.HRegion: 
> Blocking updates for 'IPC Server handler 2 on 60020' on region 
> TestTable,0029377106,1220466998108: Memcache size 64.0m is >= than blocking 
> 64.0m size
> 2008-09-03 18:37:16,090 INFO org.apache.hadoop.hbase.regionserver.HRegion: 
> Blocking updates for 'IPC Server handler 12 on 60020' on region 
> TestTable,0029377106,1220466998108: Memcache size 64.0m is >= than blocking 
> 64.0m size
> 2008-09-03 18:37:16,090 INFO org.apache.hadoop.hbase.regionserver.HRegion: 
> Blocking updates for 'IPC Server handler 9 on 60020' on region 
> TestTable,0029377106,1220466998108: Memcache size 64.0m is >= than blocking 
> 64.0m size
> 2008-09-03 18:37:16,091 INFO org.apache.hadoop.hbase.regionserver.HRegion: 
> Blocking updates for 'IPC Server handler 7 on 60020' on region 
> TestTable,0029377106,1220466998108: Memcache size 64.0m is >= than blocking 
> 64.0m size
> 2008-09-03 18:37:21,984 DEBUG org.apache.hadoop.hbase.regionserver.HRegion: 
> Finished memcache flush for region TestTable,0029377106,1220466998108 in 
> 18538ms, sequence id=2852547, compaction requested=false
> 2008-09-03 18:47:06,241 DEBUG org.apache.hadoop.hbase.regionserver.HRegion: 
> Started memcache flush for region TestTable,0029377106,1220466998108. Current 
> region memcache size 64.0m
> 2008-09-03 18:47:10,031 DEBUG org.apache.hadoop.hbase.regionserver.HRegion: 
> Finished memcache flush for region TestTable,0029377106,1220466998108 in 
> 3790ms, sequence id=2919208, compaction requested=true
> 2008-09-03 18:47:10,031 INFO org.apache.hadoop.hbase.regionserver.HRegion: 
> Unblocking updates for region TestTable,0029377106,1220466998108 'IPC Server 
> handler 9 on 60020'
> 2008-09-03 18:47:10,031 DEBUG 
> org.apache.hadoop.hbase.regionserver.CompactSplitThread: Compaction requested 
> for region: TestTable,0029377106,1220466998108
> 2008-09-03 18:47:10,031 INFO org.apache.hadoop.hbase.regionserver.HRegion: 
> Unblocking updates for region TestTable,0029377106,1220466998108 'IPC Server 
> handler 12 on 60020'
> 2008-09-03 18:47:10,032 INFO org.apache.hadoop.hbase.regionserver.HRegion: 
> starting compaction on region TestTable,0029377106,1220466998108
> 2008-09-03 18:47:10,032 INFO org.apache.hadoop.hbase.regionserver.HRegion: 
> Unblocking updates for region TestTable,0029377106,1220466998108 'IPC Server 
> handler 7 on 60020'
> 2008-09-03 18:47:10,035 INFO org.apache.hadoop.hbase.regionserver.HRegion: 
> Unblocking updates for region TestTable,0029377106,1220466998108 'IPC Server 
> handler 6 on 60020'
> 2008-09-03 18:47:10,035 INFO org.apache.hadoop.hbase.regionserver.HRegion: 
> Unblocking updates for region TestTable,0029377106,1220466998108 'IPC Server 
> handler 4 on 60020'
> 2008-09-03 18:47:10,035 INFO org.apache.hadoop.hbase.regionserver.HRegion: 
> Unblocking updates for region TestTable,0029377106,1220466998108 'IPC Server 
> handler 2 on 60020'
> 2008-09-03 18:47:10,037 INFO org.apache.hadoop.hbase.regionserver.HRegion: 
> Unblocking updates for region TestTable,0029377106,1220466998108 'IPC Server 
> handler 16 on 60020'
> 2008-09-03 18:47:10,043 INFO org.apache.hadoop.hbase.regionserver.HRegion: 
> Unblocking updates for region TestTable,0029377106,1220466998108 'IPC Server 
> handler 1 on 60020'
> 2008-09-03 18:47:18,403 INFO org.apache.hadoop.hbase.regionserver.HRegion: 
> compaction completed on region TestTable,0029377106,1220466998108 in 8sec
> ...
> {code}
> Notice how we're blocked for ten minutes until new flush runs.  My guess is 
> that the flush that is going on concurrent with the blocking is clearing the 
> flag 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to