Found it. I had upgraded from 0.2 to 0.3 and both times never changed the default_zk.cfg which puts the data in /tmp/zk_data. The old layout didn't match the new so it died. We may want something in the release notes when 0.3 comes out in case someone doesn't start fresh with a new zookeeper.
Thanks for your help. On Sat, Nov 2, 2013 at 7:31 AM, Vikrant Navalgund < [email protected]> wrote: > Hi Chris, > I forgot to add one more point. > Is it something in the console testing that you are facing this issue? > Let me hazard a guess here, maybe your blur-console and the blur-core are > out of sync. > > Regards, > Vikrant > > > On Sat, Nov 2, 2013 at 10:14 PM, Vikrant Navalgund < > [email protected]> wrote: > > > Hello Chris, > > I have couple of questions. Looks like your lib jars are out of sync. > > Have you rebuilt your lib jars with the latest master ? The code that > > creates that ZK node is in the Controller startup code.(blur-core). > > > > Also is there any specific action which triggered the issue? > > > > I would recommend that you rebuild the project and deploy all the latest > > lib jars, or at least deploy the blur-core latest jar. > > Let me know how it goes. > > > > Regards, > > Vikrant > > > > > > > > > > On Sat, Nov 2, 2013 at 9:56 PM, Chris Rohr <[email protected]> wrote: > > > >> ok, here is a better error for you: > >> > >> ERROR 20131102_06:53:52:010_EDT [main] server.ZooKeeperServerMain: > >> Unexpected exception, exiting abnormally > >> java.io.IOException: Failed to process transaction type: 1 error: > >> KeeperErrorCode = NoNode for /blur/online-controller-nodes > >> at > >> > >> > org.apache.zookeeper.server.persistence.FileTxnSnapLog.restore(FileTxnSnapLog.java:153) > >> at > >> org.apache.zookeeper.server.ZKDatabase.loadDataBase(ZKDatabase.java:223) > >> at > >> > >> > org.apache.zookeeper.server.ZooKeeperServer.loadData(ZooKeeperServer.java:250) > >> at > >> > >> > org.apache.zookeeper.server.ZooKeeperServer.startdata(ZooKeeperServer.java:377) > >> at > >> > >> > org.apache.zookeeper.server.NIOServerCnxnFactory.startup(NIOServerCnxnFactory.java:122) > >> at > >> > >> > org.apache.zookeeper.server.ZooKeeperServerMain.runFromConfig(ZooKeeperServerMain.java:112) > >> at > >> > >> > org.apache.zookeeper.server.ZooKeeperServerMain.initializeAndRun(ZooKeeperServerMain.java:86) > >> at > >> > >> > org.apache.zookeeper.server.ZooKeeperServerMain.main(ZooKeeperServerMain.java:52) > >> at > >> > >> > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:116) > >> at > >> > >> > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:78) > >> Caused by: org.apache.zookeeper.KeeperException$NoNodeException: > >> KeeperErrorCode = NoNode for /blur/online-controller-nodes > >> at > >> > >> > org.apache.zookeeper.server.persistence.FileTxnSnapLog.processTransaction(FileTxnSnapLog.java:211) > >> at > >> > >> > org.apache.zookeeper.server.persistence.FileTxnSnapLog.restore(FileTxnSnapLog.java:151) > >> ... 9 more > >> > >> This is in the ZK log when I start it and then it shuts down. > >> > >> Thanks > >> Chris > >> > >> > >> On Fri, Nov 1, 2013 at 9:35 AM, Aaron McCurry <[email protected]> > wrote: > >> > >> > I haven't tested 0.3 in awhile, I have been focusing on getting 0.2.1 > >> out > >> > the door. Hopefully we will call a vote today on it and assuming all > >> goes > >> > well we can focus on 0.3. > >> > > >> > Aaron > >> > > >> > > >> > On Fri, Nov 1, 2013 at 8:51 AM, Chris Rohr <[email protected]> > >> wrote: > >> > > >> > > Aaron, > >> > > > >> > > Yes sorry I am on master but ~week old. If you want I can try to > >> update > >> > > tonight and see if its still an issue. > >> > > > >> > > Chris > >> > > > >> > > > >> > > On Fri, Nov 1, 2013 at 6:43 AM, Aaron McCurry <[email protected]> > >> > wrote: > >> > > > >> > > > I assume you are on master? I will take a look today. > >> > > > > >> > > > Aaron > >> > > > > >> > > > > >> > > > On Fri, Nov 1, 2013 at 6:30 AM, Chris Rohr <[email protected]> > >> > wrote: > >> > > > > >> > > > > I am trying to run the quick start setup for Blur (to use for > >> testing > >> > > the > >> > > > > console) and I am having some connection problems. The 3 > >> processes > >> > > start > >> > > > > but very soon after the controller dies and then shortly after > >> that > >> > the > >> > > > > shard dies. Here is what I have noticed: > >> > > > > > >> > > > > - blur-site.properties has default.zookeeper set to 127.0.0.1 > >> > > > > - both controller and shard fail with connection refused to > >> > localhost/ > >> > > > > 127.0.0.1 > >> > > > > - zookeeper log states that it is binding on 0.0.0.0 > >> > > > > > >> > > > > I'm not sure where zookeeper is getting that IP. Any ideas? > >> > > > > > >> > > > > Thanks, > >> > > > > Chris > >> > > > > > >> > > > > >> > > > >> > > >> > > > > > > > > -- > > *Regards*, > > *Vikrant Navalgund* > > > > > > -- > *Regards*, > *Vikrant Navalgund* >
