Sorry, I found some logs indicating lost connection to zookeeper, and be treated as dead node, far away from the previous logs. I'll check further, please ignore this thread.
Mao Xu-Feng On Tue, Mar 22, 2011 at 4:52 PM, 茅旭峰 <m9s...@gmail.com> wrote: > Hi, > > I saw logs at regionserver like > > ====== > 2011-03-22 15:58:26,890 DEBUG > org.apache.hadoop.hbase.regionserver.handler.CloseRegionHandler: Processing > close of > table1,j-sLRcrI_bZ1NbfoRH-0fz-g9m0=,1300726276228.b13159778f4e190b0ee99c655de6d928. > 2011-03-22 15:58:26,890 DEBUG org.apache.hadoop.hbase.regionserver.HRegion: > Closing > table1,j-sLRcrI_bZ1NbfoRH-0fz-g9m0=,1300726276228.b13159778f4e190b0ee99c655de6d928.: > disabling compactions & flushes > 2011-03-22 15:58:26,890 DEBUG org.apache.hadoop.hbase.regionserver.HRegion: > Updates disabled for region > table1,j-sLRcrI_bZ1NbfoRH-0fz-g9m0=,1300726276228.b13159778f4e190b0ee99c655de6d928. > 2011-03-22 15:58:26,890 DEBUG org.apache.hadoop.hbase.regionserver.Store: > closed cfEStore > 2011-03-22 15:58:26,890 INFO org.apache.hadoop.hbase.regionserver.HRegion: > Closed > table1,-CAbYuU0AjfZ8MIM6fo7Q7i8Qhg=,1300583933922.41d9d50874bc4c905286dfd38fd02ead. > 2011-03-22 15:58:26,890 DEBUG > org.apache.hadoop.hbase.regionserver.handler.CloseRegionHandler: Closed > region > table1,-CAbYuU0AjfZ8MIM6fo7Q7i8Qhg=,1300583933922.41d9d50874bc4c905286dfd38fd02ead. > 2011-03-22 15:58:26,891 DEBUG org.apache.hadoop.hbase.regionserver.Store: > closed cfEStore > 2011-03-22 15:58:26,891 INFO org.apache.hadoop.hbase.regionserver.HRegion: > Closed > table1,j-sLRcrI_bZ1NbfoRH-0fz-g9m0=,1300726276228.b13159778f4e190b0ee99c655de6d928. > 2011-03-22 15:58:26,891 DEBUG > org.apache.hadoop.hbase.regionserver.handler.CloseRegionHandler: Closed > region > table1,j-sLRcrI_bZ1NbfoRH-0fz-g9m0=,1300726276228.b13159778f4e190b0ee99c655de6d928. > 2011-03-22 15:58:27,054 INFO > org.apache.hadoop.hbase.regionserver.HRegionServer: Waiting on 1 regions to > close > 2011-03-22 15:58:27,055 DEBUG > org.apache.hadoop.hbase.regionserver.HRegionServer: > {af002166e6bd14c7627f0db8286a809f=table1,h1LJj7DVYthsxlwnjy061YnKX0k=,1300710576102.af002166e6bd14c7627f0db8286a809f.} > 2011-03-22 15:58:27,060 INFO org.apache.hadoop.hbase.regionserver.HRegion: > compaction interrupted by user: > java.io.InterruptedIOException: Aborting compaction of store cfEStore in > region > table1,h1LJj7DVYthsxlwnjy061YnKX0k=,1300710576102.af002166e6bd14c7627f0db8286a809f. > because user requested stop. > at > org.apache.hadoop.hbase.regionserver.Store.compact(Store.java:945) > at > org.apache.hadoop.hbase.regionserver.Store.compact(Store.java:733) > at > org.apache.hadoop.hbase.regionserver.HRegion.compactStores(HRegion.java:769) > at > org.apache.hadoop.hbase.regionserver.HRegion.compactStores(HRegion.java:714) > at > org.apache.hadoop.hbase.regionserver.CompactSplitThread.run(CompactSplitThread.java:81) > 2011-03-22 15:58:27,060 INFO org.apache.hadoop.hbase.regionserver.HRegion: > aborted compaction on region > table1,h1LJj7DVYthsxlwnjy061YnKX0k=,1300710576102.af002166e6bd14c7627f0db8286a809f. > after 4mins, 34sec > 2011-03-22 15:58:27,061 DEBUG org.apache.hadoop.hbase.regionserver.HRegion: > Updates disabled for region > table1,h1LJj7DVYthsxlwnjy061YnKX0k=,1300710576102.af002166e6bd14c7627f0db8286a809f. > 2011-03-22 15:58:27,061 INFO > org.apache.hadoop.hbase.regionserver.CompactSplitThread: > regionserver60020.compactor exiting > 2011-03-22 15:58:27,065 DEBUG org.apache.hadoop.hbase.regionserver.Store: > closed cfEStore > 2011-03-22 15:58:27,065 INFO org.apache.hadoop.hbase.regionserver.HRegion: > Closed > table1,h1LJj7DVYthsxlwnjy061YnKX0k=,1300710576102.af002166e6bd14c7627f0db8286a809f. > 2011-03-22 15:58:27,066 DEBUG > org.apache.hadoop.hbase.regionserver.handler.CloseRegionHandler: Closed > region > table1,h1LJj7DVYthsxlwnjy061YnKX0k=,1300710576102.af002166e6bd14c7627f0db8286a809f. > 2011-03-22 15:58:28,055 INFO org.apache.hadoop.hbase.regionserver.Leases: > regionserver60020 closing leases > 2011-03-22 15:58:28,055 INFO org.apache.hadoop.hbase.regionserver.Leases: > regionserver60020 closed leases > 2011-03-22 15:58:28,102 INFO > org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation: > Closed zookeeper sessionid=0x12ec6cedd6e0026 > 2011-03-22 15:58:28,121 INFO org.apache.zookeeper.ZooKeeper: Session: > 0x12ec6cedd6e0026 closed > 2011-03-22 15:58:28,121 INFO org.apache.zookeeper.ClientCnxn: EventThread > shut down > 2011-03-22 15:58:28,124 INFO > org.apache.hadoop.hbase.regionserver.HRegionServer: regionserver60020 > exiting > 2011-03-22 15:58:28,174 INFO > org.apache.hadoop.hbase.regionserver.ShutdownHook: Shutdown hook starting; > hbase.shutdown.hook=true; fsShutdownHook=Thread[Thread-14,5,main] > 2011-03-22 15:58:28,174 INFO > org.apache.hadoop.hbase.regionserver.HRegionServer: STOPPED: Shutdown hook > 2011-03-22 15:58:28,174 INFO > org.apache.hadoop.hbase.regionserver.ShutdownHook: Starting fs shutdown hook > thread. > 2011-03-22 15:58:28,286 INFO > org.apache.hadoop.hbase.regionserver.ShutdownHook: Shutdown hook finished. > ====== > > It saids "compaction interrupted by user:", but we did not do like this. > > Any comments? Thanks! > > Mao Xu-Feng >