Reading znodes directly from snapshot and log files

2010-10-21 Thread Vishal K
Hi, Is it possible to read znodes directly from snapshot and log files instead of usign ZooKeeper API. In case a ZK ensemble is not available, can I login to all available nodes and run a utility that will dump all znodes? Thanks. -Vishal

Re: Retrying sequential znode creation

2010-10-13 Thread Vishal K
the state of play there. > > On Tue, Oct 12, 2010 at 12:11 PM, Vishal K wrote: > > > Hi, > > > > What is the best approach to have an idempotent create() operation for a > > sequential node? > > > > Suppose a client is trying to create a sequential node and it

Retrying sequential znode creation

2010-10-12 Thread Vishal K
Hi, What is the best approach to have an idempotent create() operation for a sequential node? Suppose a client is trying to create a sequential node and it gets a ConnectionLoss KeeperException, it cannot know for sure whether the request succeeded or not. If in the meantime, the client's session

Re: znode inconsistencies across ZooKeeper servers

2010-10-07 Thread Vishal K
from all servers during the time of the incident? If you could run the > servers in DEBUG level logging during the time you reproduce the issue that > would probably help: > https://issues.apache.org/jira/browse/ZOOKEEPER > > Thanks! > > Patrick > > > On Wed, Oct 6, 20

Re: znode inconsistencies across ZooKeeper servers

2010-10-06 Thread Vishal K
riod), in which case the ephemeral znodes _should_ > reappear when A is restarted and successfully rejoins the cluster. (at > least > until the session timeout is exceeded) > > Patrick > > On Tue, Oct 5, 2010 at 11:04 AM, Vishal K wrote: > > > Hi, > > > > I have a 3

Re: Understanding ZooKeeper data file management and LogFormatter

2010-09-08 Thread Vishal K
> cluster. > > I keep a few around just for backup purposes. > > On Wed, Sep 8, 2010 at 12:01 PM, Vishal K wrote: > > > Hi All, > > > > Can you please share your experience regarding ZK snapshot retention and > > recovery policies? > > > > We ha

Understanding ZooKeeper data file management and LogFormatter

2010-09-08 Thread Vishal K
Hi All, Can you please share your experience regarding ZK snapshot retention and recovery policies? We have an application where we never need to rollback (i.e., revert back to a previous state by using old snapshots). Given this, I am trying to understand under what circumstances would we ever n

Re: Session expiration caused by time change

2010-08-19 Thread Vishal K
ow, wouldn't it be ok for the server to just send a ping request to see if the client is really dead? > ben > > > On 08/19/2010 10:17 AM, Vishal K wrote: > >> Hi Ted, >> >> I haven't give it a serious thought yet, but I don't think it is >> neccess

Re: Session expiration caused by time change

2010-08-19 Thread Vishal K
not help. > > On Thu, Aug 19, 2010 at 7:51 AM, Vishal K wrote: > > > Hi, > > > > I remember Ben had opened a jira for clock jumps earlier: > > https://issues.apache.org/jira/browse/ZOOKEEPER-366. It is not uncommon > to > > have clocks jump forward in virtual

Re: Session expiration caused by time change

2010-08-19 Thread Vishal K
Hi, I remember Ben had opened a jira for clock jumps earlier: https://issues.apache.org/jira/browse/ZOOKEEPER-366. It is not uncommon to have clocks jump forward in virtualized environments. It is desirable to modify ZooKeeper to handle this situation (as much as possible) internally. It would ne

Re: Weird ephemeral node issue

2010-08-17 Thread Vishal K
Hi Qing, Can you list the znodes from the monitor and from the node that the monitor is restarting (run zkCli.sh on both machines). I am curious to see if the node that did not receive the SESSION_EXPIRED event still has the znode in its database. Also can you describe your setiup? Can you send ou

Re: How to handle "Node does not exist" error?

2010-08-16 Thread Vishal K
In my case, I am pretty sure that the configuration was right. I will reproduce it and post more info later. Thanks. On Mon, Aug 16, 2010 at 1:08 PM, Patrick Hunt wrote: > Try using the logs, stat command or JMX to verify that each ZK server is > indeed a leader/follower as expected. You should

Re: How to handle "Node does not exist" error?

2010-08-16 Thread Vishal K
Hi Dr Hao, If you think this is not a configuration issue, then it would be a good idea to open a jira. Thanks. On Thu, Aug 12, 2010 at 8:42 PM, Ted Dunning wrote: > On Thu, Aug 12, 2010 at 4:57 PM, Dr Hao He wrote: > > > hi, Ted, > > > > I am a little bit confused here. So, is the node incon

Re: How to handle "Node does not exist" error?

2010-08-12 Thread Vishal K
ination > layer in the code being coordinated is a bad idea. > > On Thu, Aug 12, 2010 at 6:33 AM, Vishal K wrote: > > > Hi Ted, > > > > Can you explain why running ZK in embedded mode can cause znode > > inconsistencies? > > Thanks. > > > > -Vish

Re: How to handle "Node does not exist" error?

2010-08-12 Thread Vishal K
Hi Ted, Can you explain why running ZK in embedded mode can cause znode inconsistencies? Thanks. -Vishal On Thu, Aug 12, 2010 at 12:01 AM, Ted Dunning wrote: > Try running the server in non-embedded mode. > > Also, you are assuming that you know everything about how to configure the > quorumPe

Too many "KeeperErrorCode = Session moved" messages

2010-08-05 Thread Vishal K
Hi All, I am seeing a lot of these messages in our application. I would like to know if I am doing something wrong or this is a ZK bug. Setup: - Server environment:zookeeper.version=3.3.0-925362 - 3 node cluster - Each node has few clients that connect to the local server using 127.0.0.1 as the h

cleanup ZK takes 40-60 seconds

2010-07-16 Thread Vishal K
Hi, We have embedded ZK server in our application. We start a thread in our application and call QuorumPeerMain.InitializeArgs(). When cleaning-up ZK we call QuorumPeerMain.shutdown() and wait for the thread that is calling InitializeArgs() to finish. These two steps are taking around 60 seconds.

Re: zookeeper crash

2010-06-16 Thread Vishal K
Hi, We are running into this bug very often (almost 60-75% hit rate) while testing our newly developed application over ZK. This is almost a blocker for us. Will the fix be simplified if backward compatibility was not an issue? Considering that this bug is rarely reported, I am wondering why we a

Securing ZooKeeper connections

2010-05-25 Thread Vishal K
Hi All, Since ZooKeeper does not support secure network connections yet, I thought I would poll and see what people are doing to address this problem. Is anyone running ZooKeeper over secure channels (client - server and server- server authentication/encryption)? If yes, can you please elaborate h

Re: Embedding ZK in another application

2010-04-29 Thread Vishal K
Hi, Well looks like FastLeaderElection.shutdown() is not invoked. This has been in 3.3.0. Should have checked on that earlier :-) On Thu, Apr 29, 2010 at 10:13 AM, Vishal K wrote: > Hi Ted, > > We want the application that embeds the ZK server to be running even after > the

Re: Embedding ZK in another application

2010-04-29 Thread Vishal K
mon to do a rolling restart on a ZK cluster. Just restart one > server at a time. This is often used during system upgrades. > > On Wed, Apr 28, 2010 at 8:22 PM, Vishal K wrote: > > > > > What is a good way to restart a ZK server (standalone and quorum) without > &g

Re: Embedding ZK in another application

2010-04-28 Thread Vishal K
how I can restart without restarting the application? Thanks. On Sun, Apr 25, 2010 at 2:52 PM, Vishal K wrote: > Hi Mahadev, Ted, > > Thanks for the feedback. > > > On Fri, Apr 23, 2010 at 3:02 PM, Ted Dunning wrote: > >> It is, of course, your decision, but a key

Re: Embedding ZK in another application

2010-04-25 Thread Vishal K
Hi Mahadev, Ted, Thanks for the feedback. On Fri, Apr 23, 2010 at 3:02 PM, Ted Dunning wrote: > It is, of course, your decision, but a key coordination function is to > determine whether your application is up or not. That is very hard to do > if > Zookeeper is inside your application. > > On

Re: Embedding ZK in another application

2010-04-23 Thread Vishal K
Hi, Good question. We are planning to do something similar as well and it will great to know if there are any issues with embedding ZK server into an app. We simply use QourumPeerMain and QourumPeer from our app to start/stop the ZK server. Is this not a good way to do it? On Fri, Apr 23, 2010 at

Re: Recovery issue - how to debug?

2010-04-19 Thread Vishal K
Hi Hao, How are you determining whether a ZK server has received the writes or not? Regards, -Vishal On Mon, Apr 19, 2010 at 1:54 AM, Dr Hao He wrote: > I have zookeeper cluster E1 with 3 nodes A,B, and C. > > I stopped C and did some writes on E1. Both A and B received the writes. > I then

Re: Errors while running sytest

2010-04-07 Thread Vishal K
Hi, It works for me now. Just for the record, I had to copy junit*.jar to buil/lib because fat.jar expects it to be there. Then, I had to rebuild fatjar.jar. On Wed, Apr 7, 2010 at 12:10 AM, Vishal K wrote: > Hi, > > I am trying to run systest on a 3 node cluster ( > http://sv

Errors while running sytest

2010-04-06 Thread Vishal K
Hi, I am trying to run systest on a 3 node cluster ( http://svn.apache.org/repos/asf/hadoop/zookeeper/trunk/src/java/systest/README.txt ). When I reach the 4th step which is to actually run the test I get exception shown below. Exception in thread "main" java.lang.NoClassDefFoundError: junit/fra