You should have as many loggers as you have tablet servers. Your log recovery is failing because the loggers are not running.
Please start all your loggers, and/or determine while they are going down. Then restart the master and the system should recover. -Eric On Thu, Jul 19, 2012 at 12:39 PM, Perko, Ralph J <[email protected]> wrote: > From the master log file at startup: > > 9 08:38:40,612 [master.CoordinateRecoveryTask] WARN : Unable to recover > 192.168.1.244:11224/65911601-d684-43e8-94b3-cdf959590298(java.io.IOExceptio > n: org.apache.thrift.transport.TTransportException: > java.net.ConnectException: Connection refused) > java.io.IOException: org.apache.thrift.transport.TTransportException: > java.net.ConnectException: Connection refused > at > org.apache.accumulo.server.tabletserver.log.RemoteLogger.<init>(RemoteLogge > r.java:99) > at > org.apache.accumulo.server.master.CoordinateRecoveryTask$RecoveryJob.startC > opy(CoordinateRecoveryTask.java:132) > at > org.apache.accumulo.server.master.CoordinateRecoveryTask$RecoveryJob.access > $400(CoordinateRecoveryTask.java:114) > at > org.apache.accumulo.server.master.CoordinateRecoveryTask.recover(Coordinate > RecoveryTask.java:289) > at > org.apache.accumulo.server.master.Master$TabletGroupWatcher.run(Master.java > :1351) > Caused by: org.apache.thrift.transport.TTransportException: > java.net.ConnectException: Connection refused > at > org.apache.accumulo.core.client.impl.ThriftTransportPool.createNewTransport > (ThriftTransportPool.java:475) > at > org.apache.accumulo.core.client.impl.ThriftTransportPool.getTransport(Thrif > tTransportPool.java:464) > at > org.apache.accumulo.core.client.impl.ThriftTransportPool.getTransport(Thrif > tTransportPool.java:441) > at > org.apache.accumulo.core.util.ThriftUtil.getClient(ThriftUtil.java:67) > at > org.apache.accumulo.server.tabletserver.log.RemoteLogger.<init>(RemoteLogge > r.java:96) > ... 4 more > Caused by: java.net.ConnectException: Connection refused > at sun.nio.ch.Net.connect(Native Method) > at sun.nio.ch.SocketChannelImpl.connect(SocketChannelImpl.java:500) > at sun.nio.ch.SocketAdaptor.connect(SocketAdaptor.java:81) > at sun.nio.ch.SocketAdaptor.connect(SocketAdaptor.java:65) > at > org.apache.accumulo.core.util.TTimeoutTransport.create(TTimeoutTransport.ja > va:39) > at > org.apache.accumulo.core.client.impl.ThriftTransportPool.createNewTransport > (ThriftTransportPool.java:473) > ... 8 more > 19 08:38:40,652 [master.CoordinateRecoveryTask] WARN : Recovery of > 192.168.1.244:11224:65911601-d684-43e8-94b3-cdf959590298 failed > 19 08:38:45,071 [master.CoordinateRecoveryTask] INFO : Deleting recovery > directory org.apache.hadoop.fs.FileStatus@75641fd > 19 09:08:40,848 [master.CoordinateRecoveryTask] WARN : Recovery taking too > long, giving up > 19 09:08:40,848 [master.EventCoordinator] INFO : Log recovery > 192.168.1.244:11224/65911601-d684-43e8-94b3-cdf959590298 complete > > > > > On 7/19/12 9:34 AM, "Keith Turner" <[email protected]> wrote: > >>What you are describing sounds like ZooKeeper is up and running (this >>is where table config info is stored, so thats why you can list >>tables), but not tablets are assigned to tablet servers. Need to >>determine why no tablets are assigned. Look in the master log for >>anything suspicious related to tablet assignment. >> >> >>On Thu, Jul 19, 2012 at 12:28 PM, Perko, Ralph J <[email protected]> >>wrote: >>> Hi, >>> >>> I restarted my cluster and now the Accumulo Overview page says there >>>are 0 tables. However, when I go to the Table List page, all my tables >>>are listed with a status of "ONLINE" but nothing else. From the >>>Accumulo shell I cannot access any of my tables but I can list them, >>>like the web site. Hadoop is up and healthy. The tablet servers are up >>>but each states 0 for Hosted Tablets. Do you know what is causing this >>>and how to fix it? >>> >>> Thanks, >>> Ralph >>> >>> >
