Can you give us a bit more information - I assume hbase config is the same on this server compared with the other region servers:
region server log preceding 'Failed to close file' exception - I looked for 'Caused by' but didn't find any. is this a secure cluster ? w.r.t. /apps/hbase/data/WALs/ip-10-0-1-74.ec2.internal,60020, 1431910460262/ip-10-0-1-74.ec2.internal%2C60020% 2C1431910460262.1431910462148 Can you look for the above file in hdfs audit log to see when it was deleted / renamed ? Thanks On Sun, May 17, 2015 at 6:33 PM, Billy Watson <[email protected]> wrote: > I added a region server process to an existing cluster node that didn't > previously have a region server process and I'm getting a weird error (see > stack traces below). > > I'm using HDP 2.1, HBase 0.98.0.2.1.7.0-784-hadoop2, > hadoop 2.4.0.2.1.7.0-784. All the other 6 region servers are up without > issue. > > I've tried starting the server multiple times, an HDFS balance (that's > still going after 24+ hours) an HBase master restart. > > Thanks for any help! > > 2015-05-18 00:54:22,586 ERROR [Thread-20] hdfs.DFSClient: Failed to close > file > > /apps/hbase/data/WALs/ip-10-0-1-74.ec2.internal,60020,1431910460262/ip-10-0-1-74.ec2.internal%2C60020%2C1431910460262.1431910462148 > > org.apache.hadoop.ipc.RemoteException(org.apache.hadoop.hdfs.server.namenode.LeaseExpiredException): > No lease on > > /apps/hbase/data/WALs/ip-10-0-1-74.ec2.internal,60020,1431910460262/ip-10-0-1-74.ec2.internal%2C60020%2C1431910460262.1431910462148: > File does not exist. Holder > > DFSClient_hb_rs_ip-10-0-1-74.ec2.internal,60020,1431910460262_-1181395688_33 > does not have any open files. > at > > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.checkLease(FSNamesystem.java:2946) > at > > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.completeFileInternal(FSNamesystem.java:3016) > at > > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.completeFile(FSNamesystem.java:2996) > at > > org.apache.hadoop.hdfs.server.namenode.NameNodeRpcServer.complete(NameNodeRpcServer.java:647) > at > > org.apache.hadoop.hdfs.protocolPB.ClientNamenodeProtocolServerSideTranslatorPB.complete(ClientNamenodeProtocolServerSideTranslatorPB.java:484) > at > > org.apache.hadoop.hdfs.protocol.proto.ClientNamenodeProtocolProtos$ClientNamenodeProtocol$2.callBlockingMethod(ClientNamenodeProtocolProtos.java) > at > > org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:585) > at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:928) > at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2013) > at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2009) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:415) > at > > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1594) > at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2007) > > at org.apache.hadoop.ipc.Client.call(Client.java:1410) > at org.apache.hadoop.ipc.Client.call(Client.java:1363) > at > > org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:206) > at com.sun.proxy.$Proxy18.complete(Unknown Source) > at > > org.apache.hadoop.hdfs.protocolPB.ClientNamenodeProtocolTranslatorPB.complete(ClientNamenodeProtocolTranslatorPB.java:404) > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > at > > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) > at > > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > at java.lang.reflect.Method.invoke(Method.java:606) > at > > org.apache.hadoop.io.retry.RetryInvocationHandler.invokeMethod(RetryInvocationHandler.java:190) > at > > org.apache.hadoop.io.retry.RetryInvocationHandler.invoke(RetryInvocationHandler.java:103) > at com.sun.proxy.$Proxy19.complete(Unknown Source) > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > at > > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) > at > > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > at java.lang.reflect.Method.invoke(Method.java:606) > at org.apache.hadoop.hbase.fs.HFileSystem$1.invoke(HFileSystem.java:272) > at com.sun.proxy.$Proxy20.complete(Unknown Source) > at > > org.apache.hadoop.hdfs.DFSOutputStream.completeFile(DFSOutputStream.java:2116) > at org.apache.hadoop.hdfs.DFSOutputStream.close(DFSOutputStream.java:2100) > at > > org.apache.hadoop.hdfs.DFSClient.closeAllFilesBeingWritten(DFSClient.java:850) > at org.apache.hadoop.hdfs.DFSClient.close(DFSClient.java:867) > at > > org.apache.hadoop.hdfs.DistributedFileSystem.close(DistributedFileSystem.java:857) > at org.apache.hadoop.fs.FileSystem$Cache.closeAll(FileSystem.java:2488) > at > > org.apache.hadoop.fs.FileSystem$Cache$ClientFinalizer.run(FileSystem.java:2505) > at java.lang.Thread.run(Thread.java:745) > 2015-05-18 00:54:22,587 INFO [Thread-11] regionserver.ShutdownHook: > Shutdown hook finished. > > > > > William Watson > Software Engineer > (904) 705-7056 PCS >
