Blocking for ten minutes at a time
----------------------------------
Key: HBASE-866
URL: https://issues.apache.org/jira/browse/HBASE-866
Project: Hadoop HBase
Issue Type: Bug
Reporter: stack
I've been testing running biggish MR jobs uploading into hbase. My jobs
consistently fail with child task timing out its ten minute period. Adding
logging, was able to see that we're actual stuck in a commit. Following the
thread of the row we're committing, I see this in the log:
{code}
...
2008-09-03 18:37:03,446 DEBUG org.apache.hadoop.hbase.regionserver.HRegion:
Flush requested on TestTable,0029377106,1220466998108
2008-09-03 18:37:03,446 DEBUG org.apache.hadoop.hbase.regionserver.HRegion:
Started memcache flush for region TestTable,0029377106,1220466998108. Current
region memcache size 64.0m
2008-09-03 18:37:03,446 INFO org.apache.hadoop.hbase.regionserver.HRegion:
Blocking updates for 'IPC Server handler 1 on 60020' on region
TestTable,0029377106,1220466998108: Memcache size 64.0m is >= than blocking
64.0m size
2008-09-03 18:37:13,450 INFO org.apache.hadoop.hbase.regionserver.HRegion:
Unblocking updates for region TestTable,0029377106,1220466998108 'IPC Server
handler 1 on 60020'
2008-09-03 18:37:16,089 INFO org.apache.hadoop.hbase.regionserver.HRegion:
Blocking updates for 'IPC Server handler 16 on 60020' on region
TestTable,0029377106,1220466998108: Memcache size 64.0m is >= than blocking
64.0m size
2008-09-03 18:37:16,090 INFO org.apache.hadoop.hbase.regionserver.HRegion:
Blocking updates for 'IPC Server handler 1 on 60020' on region
TestTable,0029377106,1220466998108: Memcache size 64.0m is >= than blocking
64.0m size
2008-09-03 18:37:16,090 INFO org.apache.hadoop.hbase.regionserver.HRegion:
Blocking updates for 'IPC Server handler 4 on 60020' on region
TestTable,0029377106,1220466998108: Memcache size 64.0m is >= than blocking
64.0m size
2008-09-03 18:37:16,090 INFO org.apache.hadoop.hbase.regionserver.HRegion:
Blocking updates for 'IPC Server handler 6 on 60020' on region
TestTable,0029377106,1220466998108: Memcache size 64.0m is >= than blocking
64.0m size
2008-09-03 18:37:16,090 INFO org.apache.hadoop.hbase.regionserver.HRegion:
Blocking updates for 'IPC Server handler 2 on 60020' on region
TestTable,0029377106,1220466998108: Memcache size 64.0m is >= than blocking
64.0m size
2008-09-03 18:37:16,090 INFO org.apache.hadoop.hbase.regionserver.HRegion:
Blocking updates for 'IPC Server handler 12 on 60020' on region
TestTable,0029377106,1220466998108: Memcache size 64.0m is >= than blocking
64.0m size
2008-09-03 18:37:16,090 INFO org.apache.hadoop.hbase.regionserver.HRegion:
Blocking updates for 'IPC Server handler 9 on 60020' on region
TestTable,0029377106,1220466998108: Memcache size 64.0m is >= than blocking
64.0m size
2008-09-03 18:37:16,091 INFO org.apache.hadoop.hbase.regionserver.HRegion:
Blocking updates for 'IPC Server handler 7 on 60020' on region
TestTable,0029377106,1220466998108: Memcache size 64.0m is >= than blocking
64.0m size
2008-09-03 18:37:21,984 DEBUG org.apache.hadoop.hbase.regionserver.HRegion:
Finished memcache flush for region TestTable,0029377106,1220466998108 in
18538ms, sequence id=2852547, compaction requested=false
2008-09-03 18:47:06,241 DEBUG org.apache.hadoop.hbase.regionserver.HRegion:
Started memcache flush for region TestTable,0029377106,1220466998108. Current
region memcache size 64.0m
2008-09-03 18:47:10,031 DEBUG org.apache.hadoop.hbase.regionserver.HRegion:
Finished memcache flush for region TestTable,0029377106,1220466998108 in
3790ms, sequence id=2919208, compaction requested=true
2008-09-03 18:47:10,031 INFO org.apache.hadoop.hbase.regionserver.HRegion:
Unblocking updates for region TestTable,0029377106,1220466998108 'IPC Server
handler 9 on 60020'
2008-09-03 18:47:10,031 DEBUG
org.apache.hadoop.hbase.regionserver.CompactSplitThread: Compaction requested
for region: TestTable,0029377106,1220466998108
2008-09-03 18:47:10,031 INFO org.apache.hadoop.hbase.regionserver.HRegion:
Unblocking updates for region TestTable,0029377106,1220466998108 'IPC Server
handler 12 on 60020'
2008-09-03 18:47:10,032 INFO org.apache.hadoop.hbase.regionserver.HRegion:
starting compaction on region TestTable,0029377106,1220466998108
2008-09-03 18:47:10,032 INFO org.apache.hadoop.hbase.regionserver.HRegion:
Unblocking updates for region TestTable,0029377106,1220466998108 'IPC Server
handler 7 on 60020'
2008-09-03 18:47:10,035 INFO org.apache.hadoop.hbase.regionserver.HRegion:
Unblocking updates for region TestTable,0029377106,1220466998108 'IPC Server
handler 6 on 60020'
2008-09-03 18:47:10,035 INFO org.apache.hadoop.hbase.regionserver.HRegion:
Unblocking updates for region TestTable,0029377106,1220466998108 'IPC Server
handler 4 on 60020'
2008-09-03 18:47:10,035 INFO org.apache.hadoop.hbase.regionserver.HRegion:
Unblocking updates for region TestTable,0029377106,1220466998108 'IPC Server
handler 2 on 60020'
2008-09-03 18:47:10,037 INFO org.apache.hadoop.hbase.regionserver.HRegion:
Unblocking updates for region TestTable,0029377106,1220466998108 'IPC Server
handler 16 on 60020'
2008-09-03 18:47:10,043 INFO org.apache.hadoop.hbase.regionserver.HRegion:
Unblocking updates for region TestTable,0029377106,1220466998108 'IPC Server
handler 1 on 60020'
2008-09-03 18:47:18,403 INFO org.apache.hadoop.hbase.regionserver.HRegion:
compaction completed on region TestTable,0029377106,1220466998108 in 8sec
...
{code}
Notice how we're blocked for ten minutes until new flush runs. My guess is
that the flush that is going on concurrent with the blocking is clearing the
flag
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.