+1, especially the locking evaluation and de-entanglement done by Jim and J-D 
as part of 810.

We might be seeing regionserver deadlocks with 0.2.0. See attached partial log, 
including stack trace requested from the UI, from a regionserver that reports 
to the master but does not handle requests from clients, hanging them. There is 
no hint that anything is amiss in the log. All of the IPC handlers are blocked. 
Also I'm not sure what to make of the high counts on 
CompactSplitThread.compactionQueue :

  Thread 9 (regionserver/0:0:0:0:0:0:0:0:60020.compactor):
    State: TIMED_WAITING
    Blocked count: 217143
    Waited count: 393898
    Stack:
      sun.misc.Unsafe.park(Native Method)
      java.util.concurrent.locks.LockSupport.parkNanos (LockSupport.java:198)
      java.util.concurrent.locks.AbstractQueuedSynchronizer 
$ConditionObject.awaitNanos(AbstractQueuedSynchronizer.java:1963)
      java.util.concurrent.LinkedBlockingQueue.poll 
(LinkedBlockingQueue.java:395)
      org.apache.hadoop.hbase.regionserver.CompactSplitThread.run 
(CompactSplitThread.java:76)

but do not think I have enough solid information to file a JIRA and 
investigate/fix this yet. 

I'm hoping we just won't see this with 0.2.1. :-)

  -  Andy

On Sun, Aug 31, 2008 at 1:28 AM, stack wrote:
 
> All known bugs filed against 0.2.1 have been resolved.
> If no objections, I'll cut a release candidate in the
> next day or so.  For those interested, here's the list
> of 37 issues fixed since 0.2.0: http://tinyurl.com/6qhehe.
> Thanks.
> St.Ack
> 


      

Reply via email to