On Fri, Apr 1, 2011 at 9:01 AM, 陈加俊 <[email protected]> wrote: > 2011-04-01 19:13:40,413 WARN org.apache.hadoop.hbase.regionserver.Store: > Failed open of hdfs:// > master.uc.uuwatch.com:9000/hbase/cjjHTML/1494733632/page/5173469199902346167.1864097884; > presumption is that file was corrupted at flush and lost edits picked up by > commit log replay. Verify! > java.io.IOException: Cannot open filename > /hbase/cjjHTML/1864097884/page/5173469199902346167 > ...... >
This is a case where a daughter region is unable to open its parent regions storefile (The daughter refers to parent storefiles for a period of time after initial open). Look at what happened to the parent region. Was it prematurely removed? > 2011-04-01 19:17:22,716 INFO > org.apache.hadoop.hbase.regionserver.HRegionServer: MSG_REGION_CLOSE: > cjjHTML,http://news.ifeng.com/gundong/detail_2011_03/15/515 > 4913_0.shtml,1300245193111: Overloaded > 2011-04-01 19:17:22,716 INFO This we've discussed. > 2011-04-01 22:01:49,212 WARN org.apache.zookeeper.ClientCnxn: Exception > closing session 0x942f0f7ae13d0000 to sun.nio.ch.SelectionKeyImpl@34819c89 > java.io.IOException: TIMED OUT > at This looks like straight session timeout against ZK. Long GC pause? St.Ack
