[
https://issues.apache.org/jira/browse/HBASE-9563?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13788685#comment-13788685
]
stack commented on HBASE-9563:
------------------------------
Just ran into this. Looks like this.
{code}
24 2013-10-07 05:07:00,009 INFO [main] zookeeper.ZooKeeper: Client
environment:java.compiler=<NA>
23 2013-10-07 05:07:00,009 INFO [main] zookeeper.ZooKeeper: Client
environment:os.name=Linux
22 2013-10-07 05:07:00,009 INFO [main] zookeeper.ZooKeeper: Client
environment:os.arch=amd64
21 2013-10-07 05:07:00,009 INFO [main] zookeeper.ZooKeeper: Client
environment:os.version=3.2.0-43-generic
20 2013-10-07 05:07:00,010 INFO [main] zookeeper.ZooKeeper: Client
environment:user.name=hbase
19 2013-10-07 05:07:00,010 INFO [main] zookeeper.ZooKeeper: Client
environment:user.home=/home/hbase
18 2013-10-07 05:07:00,010 INFO [main] zookeeper.ZooKeeper: Client
environment:user.dir=/home/hbase
17 2013-10-07 05:07:00,011 INFO [main] zookeeper.ZooKeeper: Initiating client
connection, connectString=a1805.halxg.cloudera.com:2181 sessionTimeout=90000
watcher=clean znode for master
16 2013-10-07 05:07:00,042 INFO [main] zookeeper.RecoverableZooKeeper:
Process identifier=clean znode for master connecting to ZooKeeper
ensemble=a1805.halxg.cloudera.com:2181
15 2013-10-07 05:07:00,043 WARN [main] hbase.ZNodeClearer: Can't read the
content of the znode file
14 java.io.FileNotFoundException: /tmp/hbase-hbase-master.znode (No such file
or directory)
13 ,...at java.io.FileInputStream.open(Native Method)
12 ,...at java.io.FileInputStream.<init>(FileInputStream.java:138)
11 ,...at java.io.FileInputStream.<init>(FileInputStream.java:97)
10 ,...at java.io.FileReader.<init>(FileReader.java:58)
9 ,...at
org.apache.hadoop.hbase.ZNodeClearer.readMyEphemeralNodeOnDisk(ZNodeClearer.java:95)
8 ,...at org.apache.hadoop.hbase.ZNodeClearer.clear(ZNodeClearer.java:143)
7 ,...at
org.apache.hadoop.hbase.master.HMasterCommandLine.run(HMasterCommandLine.java:138)
6 ,...at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
5 ,...at
org.apache.hadoop.hbase.util.ServerCommandLine.doMain(ServerCommandLine.java:126)
4 ,...at org.apache.hadoop.hbase.master.HMaster.main(HMaster.java:2787)
3 2013-10-07 05:07:00,046 INFO
[main-SendThread(a1805.halxg.cloudera.com:2181)] zookeeper.ClientCnxn: Opening
socket connection to server a1805.halxg.cloudera.com/10.20.200.105:2181. Will
not attempt to authenticate using SASL (unknown error)
2 Mon Oct 7 10:54:01 PDT 2013 Starting master on a1805.halxg.cloudera.com
1 core file size (blocks, -c) 0
0 data seg size (kbytes, -d) unlimited
1 scheduling priority (-e) 0
2 file size (blocks, -f) unlimited
3 pending signals (-i) 386225
4 max locked memory (kbytes, -l) 64
5 max memory size (kbytes, -m) unlimited
{code}
> Autorestart doesn't work if zkcleaner fails
> -------------------------------------------
>
> Key: HBASE-9563
> URL: https://issues.apache.org/jira/browse/HBASE-9563
> Project: HBase
> Issue Type: Bug
> Reporter: Elliott Clark
>
> I've seen this several times where a master didn't autorestart because zk
> cleaner failed. We should still restart the daemon even if it's not possible
> to clean the zk nodes.
--
This message was sent by Atlassian JIRA
(v6.1#6144)