Roadmap for ZooKeeper 3.0

2008-09-16 Thread Patrick Hunt
The ZooKeeper JIRA is up to date and a tentative date of 9/22/2008 for the 3.0 release has been set. You can see the roadmap here: https://issues.apache.org/jira/browse/ZOOKEEPER?report=com.atlassian.jira.plugin.system.project:roadmap-panel 3.0 will be our first Apache release. As such the

Proposal to require Java6 in 3.1.0

2008-10-27 Thread Patrick Hunt
I've entered a JIRA targeted for ZooKeeper 3.1.0 that will add Java 6 requirement to ZooKeeper (we will drop java5 support). If you have any feedback (pos or neg) please add comments to the issue: https://issues.apache.org/jira/browse/ZOOKEEPER-210 Regards, Patrick

Re: ZooKeeper Roadmap - 3.1.0 and beyond.

2008-11-05 Thread Patrick Hunt
, 2008 at 10:35 PM, Patrick Hunt [EMAIL PROTECTED] wrote: Our first official Apache release has shipped and I'm already looking forward to 3.1.0. ;-) In particular I believe we should look at the following for 3.1.0: 1) there are a number of issues that we're targeted to 3.1.0 during the 3.0.0

Re: Exists Watch Triggered by Delete

2008-11-12 Thread Patrick Hunt
Patrick -Original Message- From: Patrick Hunt [EMAIL PROTECTED] Sent: Wednesday, November 12, 2008 2:11pm To: zookeeper-user@hadoop.apache.org Subject: Re: Exists Watch Triggered by Delete Hi Stu, The zk server maintains 2 lists of watches, data and child watches: http

ZooKeeper 3.0 Fix Release slated for end of this week.

2008-11-18 Thread Patrick Hunt
I've slated the 3.0.1 fix release of ZooKeeper for the end of this week. https://issues.apache.org/jira/browse/ZOOKEEPER?report=com.atlassian.jira.plugin.system.project:roadmap-panel Of particular interest are exists() NPE in ZOOKEEPER-226, and some perf issues ZOOKEEPER-212 ZOOKEEPER-223 If

Re: ActiveMQ is now using ZooKeeper

2008-11-24 Thread Patrick Hunt
That's great, very cool! Can you create ZOOKEEPER JIRAs for these items that you've identified? First look it seems like we should be able to include these in 3.1.0, perhaps even 3.0.2. Regards, Patrick Hiram Chirino wrote: FYI: ActiveMQ has now started using ZooKeeper to do master

Re: RPM?

2008-11-24 Thread Patrick Hunt
I'm not aware of any. Patrick Garth Patil wrote: Hi, Has anyone created an RPM or a SPEC file for Zookeeper? I thought I'd ask before I embarked on creating one. Thanks, Garth

Re: ZooKeeper 3.0 Fix Release slated for end of this week.

2008-12-02 Thread Patrick Hunt
. On 11/24/08 5:50 PM, Patrick Hunt [EMAIL PROTECTED] wrote: From: Patrick Hunt [EMAIL PROTECTED] Date: November 24, 2008 5:50:24 PM PST To: [EMAIL PROTECTED] Subject: [VOTE] Release ZooKeeper 3.0.1 (candidate 1) I've created a second candidate build for ZooKeeper 3.0.1. *** Please download, test

Re: ZooKeeper (not) on Wikipedia

2008-12-03 Thread Patrick Hunt
broad. So I added them both, which most probably is the real nature of the beast anyway ;o) Cheers k/ |-Original Message- |From: Patrick Hunt [mailto:[EMAIL PROTECTED] |Sent: Tuesday, December 02, 2008 11:13 PM |To: zookeeper-user@hadoop.apache.org; Krishna Sankar (ksankar) |Cc: [EMAIL

Re: Missing statement in DataMonitor example ?

2008-12-11 Thread Patrick Hunt
Hi Hanson, I think that the code is OK in this situation, however the comment is misleading. In ZooKeeper version 3 watches are re-registered when the client reconnects to a server. See this section of the docs:

Re: Reconnecting to another host on failure but before session expires...

2009-01-07 Thread Patrick Hunt
There's also been interest in having a chroot type capability as part of the connect string: host:port/app/abc,... where the client's session would be rooted at /app/abc rather than / This is very useful in multi-tenant situations (more than 1 app sharing a zk cluster). Patrick Benjamin

Re: Simpler ZooKeeper event interface....

2009-01-07 Thread Patrick Hunt
Kevin Burton wrote: 3) it's possible for your code to get notified of a change, but never process the change. This might happen if: a) a node changed watch fires b) your client code runs an async getData c) you are disconnected from the server Also, this seems very confusing... If I run

Re: Simpler ZooKeeper event interface....

2009-01-07 Thread Patrick Hunt
To say that it will never return is not correct. The client will be notified of connectionloss in the callback, however the client will not know if the operation was successful (from point of view of the server) or not. Patrick Kevin Burton wrote: On Wed, Jan 7, 2009 at 11:12 AM, Mahadev

Re: Does session expiration only happen during total ensemble failure or network split?

2009-01-07 Thread Patrick Hunt
Mahadev Konar wrote: Why would you want the session to expire if all the servers are down (which should not happen unless you kill all the nodes or the datacenter is down) ? A more likely case is that the client port on the switch dies and the client is partitioned from the servers...

Re: ouch, zookeeper infinite loop

2009-01-07 Thread Patrick Hunt
with Jute a bit. Kevin On Wed, Jan 7, 2009 at 10:07 AM, Patrick Hunt ph...@apache.org wrote: Thanks for the report, entered as: https://issues.apache.org/jira/browse/ZOOKEEPER-268 For the time being you can work around this by setting the threshold to INFO for that class (in log4j.properties

Re: Simpler ZooKeeper event interface....

2009-01-07 Thread Patrick Hunt
Kevin Burton wrote: Here's a good reason for each client to know it's session status (connected/disconnected/expired). Depending on the application, if L does not have a connected session to the ensemble it may need to be careful how it acts. connected/disconnected events are given out in

Re: Simpler ZooKeeper event interface....

2009-01-07 Thread Patrick Hunt
Vinod Johnson wrote: I guess then I don't follow the leader election recipe. Is the following scenario possible in the leader election recipe: 1) Leader L is partitioned from the ensemble. 2) ZK servers expire its session. 3) Some other follower F now becomes a leader. 4) L and F form a

Re: Updated NodeWatcher...

2009-01-09 Thread Patrick Hunt
Ben this is great, thanks! Do you want to close out this one and point to the faq? https://issues.apache.org/jira/browse/ZOOKEEPER-264 Although IMO this should be moved to the forrest docs. Patrick Benjamin Reed wrote: I'm really bad a creating figures, but i've put up something that

Re: Standard redistributable set of primitives?

2009-01-16 Thread Patrick Hunt
There's been some interest to see this in 3.2.0, which I think is a good idea. We'll be firming up 3.2 plans once 3.1 is out the door. There's open JIRA on this btw: https://issues.apache.org/jira/browse/ZOOKEEPER-78 Feel free to work on this if you'd like, we certainly see the benefit of

Re: Delaying 3.1 release by 2 to 3 weeks?

2009-01-16 Thread Patrick Hunt
Mahadev, can you complete quotas in 2 weeks? This includes completing the code itself, documentation, tests, and incorporating review feedback? Parick Benjamin Reed wrote: we should delay. it would be good to try out quotas for a bit before we do the release. quotas are also a key part of the

ZooKeeper 3.1 release process starting today.

2009-02-03 Thread Patrick Hunt
All 3.1 issues have been resolved, I'll be starting the release process today, detailed here: http://wiki.apache.org/hadoop/ZooKeeper/HowToRelease If voting is timely successful an official release should be available early/mid next week. You can follow more closely on the zookeeper-dev list.

Re: ZooKeeper 3.1 and C API/ABI

2009-02-05 Thread Patrick Hunt
Chris, that's unfortunate re the version number (config.h), but I think I see why that is -- config.h should only really be visible in the implementation, not exposed through the includes. I've created a JIRA for this: https://issues.apache.org/jira/browse/ZOOKEEPER-293 We'll hold 3.1 for

Re: ZooKeeper 3.1 and C API/ABI

2009-02-05 Thread Patrick Hunt
contributions from anyone. ;-) Patrick Hunt wrote: Chris, that's unfortunate re the version number (config.h), but I think I see why that is -- config.h should only really be visible in the implementation, not exposed through the includes. I've created a JIRA for this: https://issues.apache.org/jira

Re: Testing Zookeeper

2009-02-10 Thread Patrick Hunt
Yes, please create a new JIRA! I'd also encourage you to subscribe to the zookeeper-dev list (please followup/discussion there). All of our technical discussion goes on there - zookeeper-user is more for support issues on released products. Thanks, Patrick Joshua Tuberville wrote: Nitay,

Re: Dealing with session expired

2009-02-12 Thread Patrick Hunt
Ephemerals and watches are maintained across disconnect/reconnect btw the client and server however session expiration (or closing the session explicitly) will trigger deletion of ephemeral nodes associated with the session. Right - once the session is expired the id is invalid. You need to

Re: Dealing with session expired

2009-02-12 Thread Patrick Hunt
Regardless of frequency Tom's code still has to handle this situation. I would suggest that the two classes Tom is referring to in his mail, the ones that use ZK client object, should either be able to reinitialize with a new zk session, or they themselves should be discarded and new

Re: Dealing with session expired

2009-02-12 Thread Patrick Hunt
have to throw the KeeperException as a fatal exception rather than letting that client try to re-elect. Or maybe add in some logic to say if I can't re-elect, _then_ throw an exception and consider it fatal. Thanks guys. -Tom On Thu, Feb 12, 2009 at 2:39 PM, Patrick Hunt ph...@apache.org wrote

Re: Watcher guarantees

2009-02-13 Thread Patrick Hunt
Tom White wrote: If client sets a watcher on a znode by doing a getData operation is it guaranteed to get the next change after the value it read, or can a change be missed? In other words if the value it read had zxid z1 and the next update of the znode has zxid z2, will the watcher always

Re: Watcher guarantees

2009-02-13 Thread Patrick Hunt
Tom White wrote: Thanks for your answers. I understand that you have to call getData() to find the value of the znode, and that it may have been updated between the watcher being called and the getData() call. So in that sense you can miss updates. But watches have strong enough guarantees to

[ANNOUNCE] Apache ZooKeeper 3.1.0

2009-02-13 Thread Patrick Hunt
The Apache ZooKeeper team is proud to announce Apache ZooKeeper version 3.1.0. ZooKeeper is a high-performance coordination service for distributed applications. It exposes common services - such as naming, configuration management, synchronization, and group services - in a simple interface

What are you using ZooKeeper for?

2009-02-17 Thread Patrick Hunt
If you are using ZK and can publicly share this information please update the wiki PoweredBy page: http://wiki.apache.org/hadoop/ZooKeeper/PoweredBy Patrick

Re: Recommended session timeout

2009-02-23 Thread Patrick Hunt
The latest docs (3.1.0 has some updates to that section) can be found here: http://hadoop.apache.org/zookeeper/docs/r3.1.0/zookeeperProgrammers.html#ch_zkSessions Patrick Mahadev Konar wrote: Hi Joey, here is a link to information on session timeouts.

Re: Recommended session timeout

2009-02-24 Thread Patrick Hunt
throughput of the ensemble. Patrick Is this a bug, or desirable behavior? Thanks, -Joey On Mon, Feb 23, 2009 at 10:37 PM, Patrick Hunt ph...@apache.org wrote: The latest docs (3.1.0 has some updates to that section) can be found here: http://hadoop.apache.org/zookeeper/docs/r3.1.0

Re: Recommended session timeout

2009-02-26 Thread Patrick Hunt
That's very interesting results, a good job sleuthing. You might try the concurrent collector? http://java.sun.com/javase/technologies/hotspot/gc/gc_tuning_6.html#available_collectors.selecting specifically item 4 -XX:+UseConcMarkSweepGC I've never used this before myself but it's supposed to

Re: Contrib section (nee Re: A modest proposal for simplifying zookeeper :)

2009-02-26 Thread Patrick Hunt
So far we've stayed with the process used by core as this minimizes the amount of work we need to do re process/build/release, etc... we just copy the process/build/release etc... used in core, we get all that for free. I'm hesitant to diverge as this will increase the amount of work we need

Re: Contrib section (nee Re: A modest proposal for simplifying zookeeper :)

2009-02-27 Thread Patrick Hunt
section? On Thu, Feb 26, 2009 at 10:00 PM, Patrick Hunt ph...@apache.org wrote: So far we've stayed with the process used by core as this minimizes the amount of work we need to do re process/build/release, etc... we just copy the process/build/release etc... used in core, we get all

Re: Contrib section (nee Re: A modest proposal for simplifying zookeeper :)

2009-02-27 Thread Patrick Hunt
Ben, you might want to look at buildr, it recently graduated from the apache incubator: http://buildr.apache.org/ Buildr is a build system for Java applications. We wanted something that’s simple and intuitive to use, so we only need to tell it what to do, and it takes care of the rest. But

Re: Adding a server to a running ensemble?

2009-02-27 Thread Patrick Hunt
we do have an open issue to do this more on the fly without having to do the bounce, but it is behind other priorities in the work queue. This is the JIRA: https://issues.apache.org/jira/browse/ZOOKEEPER-107 in case someone would like to work on this.

Re: Semantics of ConnectionLoss exception

2009-03-25 Thread Patrick Hunt
Mahadev Konar wrote: Hi Nitay, - Does this event happening mean my ephemeral nodes will go away? No. the client will try connecting to other servers and if its not able to reconnect to the servers within the remaining session timeout. If the client is not able to connect within the

[ANNOUNCE] Apache ZooKeeper 3.1.1

2009-03-27 Thread Patrick Hunt
The Apache ZooKeeper team is proud to announce Apache ZooKeeper version 3.1.1. ZooKeeper is a high-performance coordination service for distributed applications. It exposes common services - such as naming, configuration management, synchronization, and group services - in a simple interface

Re: ZooKeeper Perl module

2009-04-02 Thread Patrick Hunt
Hey Chris this is really great! Thanks for making it available to the community, very cool. Patrick Chris Darroch wrote: Hi -- The http://wiki.apache.org/hadoop/ZooKeeper page includes the comment that someday we hope to get Python, Perl, and REST interfaces. I hope I can help with one

Re: starting replicated ZK server

2009-04-13 Thread Patrick Hunt
Jun Rao wrote: From the ZK web site, it's not clear how to set up a multi-node ZK service. It seems that one has to add the server entries in the conf file and create myid files on each node. Then, how should I start the ZK nodes? I tried issuing zkServer start from each node and that didn't

Re: problems on EC2?

2009-04-16 Thread Patrick Hunt
Take a look at this section to start: http://hadoop.apache.org/zookeeper/docs/current/zookeeperAdmin.html#sc_commonProblems What type of monitoring are you doing on your cluster? You could monitor at both the host and at the java (jmx) level. That will give you some insight on where to look;

Re: problems on EC2?

2009-04-16 Thread Patrick Hunt
the server side timeout is sufficiently long. Thanks again. On Thu, Apr 16, 2009 at 10:57 AM, Patrick Hunt ph...@apache.org wrote: lots of stuff about monitoring ... jmx ... packet loss ... vm latencies ... timeout details. ... Hope this helps. Patrick

Re: problems on EC2?

2009-04-16 Thread Patrick Hunt
ps. please consider presenting your experiences running ZK inside EC2 at an upcoming Hadoop social or even at the summit. I know I'd really be interested to hear your experiences and I think it would be useful for both new and existing ZK users. Patrick Patrick Hunt wrote: Well that's good

Re: problems on EC2?

2009-04-16 Thread Patrick Hunt
Ted Dunning wrote: On a related note, what is best practice for handling session expiration? Just deal with it as if it is a new start? See this re handling the errors ZK can throw at you: http://wiki.apache.org/hadoop/ZooKeeper/ErrorHandling Patrick

New ZooKeeper wiki page on Troubleshooting

2009-04-16 Thread Patrick Hunt
I added a new ZK page detailing how to troubleshoot the operating environment. A FAQ of sorts, it has a number of examples of situations we've seen and how to determine/resolve the issue. We'll expand on the content as new issues are seen.

Re: Some one send me some demo of programming with C client API for Zookeeper

2009-04-20 Thread Patrick Hunt
time? It would really help me much, 3x~ On Fri, Apr 17, 2009 at 1:20 AM, Patrick Hunt ph...@apache.org wrote: You can generate the doxygen C API docs using make doxygen-doc (see the README). Mahadev Konar wrote: Please take a look at src/c/src/cli.c for some examples on zookeeper c client

Re: NodeChildrenChanged WatchedEvent

2009-05-08 Thread Patrick Hunt
Javier, also note that the subsequent getChildren you mention in your original email is usually not entirely superfluous given that you generally want to watch the parent node for further changes, and a getChildren is required to set that watch. Patrick Benjamin Reed wrote: i'm adding a faq

Apache CXF integrating ZooKeeper

2009-05-08 Thread Patrick Hunt
I noticed this today, sounds interesting: http://cxf.apache.org/ Apache CXF is an open source services framework. CXF helps you build and develop services using frontend programming APIs, like JAX-WS. These services can speak a variety of protocols such as SOAP, XML/HTTP, RESTful HTTP, or

Re: ZooKeeper heavy CPU utilisation

2009-06-02 Thread Patrick Hunt
According to your trace I see you are using jvm 1.6.0_03-b05 One of the bugs fixed in: http://java.sun.com/javase/6/webnotes/6u4.html specifically: http://bugs.sun.com/bugdatabase/view_bug.do?bug_id=6403933 seems to have a description very close to what you are seeing. Perhaps you can try

Re: ConnectionLoss (node too big?)

2009-06-03 Thread Patrick Hunt
Agree, created a new JIRA for this: https://issues.apache.org/jira/browse/ZOOKEEPER-430 See the following JIRA for one example why not to do this: https://issues.apache.org/jira/browse/ZOOKEEPER-327 In general you don't want to create large node sizes since all of the data/nodes are stored in

Re: ConnectionLoss (node too big?)

2009-06-03 Thread Patrick Hunt
Would love to see it - the wiki might be a better choice in terms of visibility, I created this page fairly recently: http://wiki.apache.org/hadoop/ZooKeeper/Troubleshooting A section on data size impact would be great (incl general information on the cluster configuration (ie 3 vs 5 servers

Re: ZooKeeper heavy CPU utilisation

2009-06-03 Thread Patrick Hunt
solution is to restart the server that experiences this problem. Patrick Patrick Hunt wrote: According to your trace I see you are using jvm 1.6.0_03-b05 One of the bugs fixed in: http://java.sun.com/javase/6/webnotes/6u4.html specifically: http://bugs.sun.com/bugdatabase/view_bug.do?bug_id

Re: Win32 as a production platform

2009-06-04 Thread Patrick Hunt
Agree. I developed on a windows box for about 9months w/o any windows specific issues being found. If someone were willing to do the work and take on responsibility for long term dev/test/support I don't see why we wouldn't add it. Patrick ps - notice that the dev level use of windows

Show your ZooKeeper pride!

2009-06-08 Thread Patrick Hunt
The Hadoop summit is Wednesday. If you're attending please feel free to say hi -- Mahadev is presenting @4, Ben and I will be attending as well. Also, regardless of whether you're attending or not we'd appreciate any updates to the powered by page, if you're too busy to update it yourself

Re: ZK quota

2009-06-29 Thread Patrick Hunt
Do we have a JIRA for this? If not we should add one for 3.3. Patrick Mahadev Konar wrote: Hi Raghu, We do have plans to enforce quota in future. Enforcing requires some more work then just reporting. Reporting is a good enough tool for operations to manage a zookeeper cluster but we would

Re: zookeeper on ec2

2009-07-06 Thread Patrick Hunt
Ted thanks for the info. I've created a wiki page http://wiki.apache.org/hadoop/ZooKeeper/ZooKeeperOnEC2 to capture details of running ZK on EC2. If you or anyone else would like to update it with information please do so. Regards, Patrick Ted Dunning wrote: I disagree with the original

Re: zookeeper on ec2

2009-07-07 Thread Patrick Hunt
Henry Robinson wrote: Effectively, EC2 does not introduce any new failure modes but potentially exacerbates some existing ones. If a majority of EC2 nodes fail (in the sense that their hard drive images cannot be recovered), there is no way to restart the cluster, and persistence is lost. As you

[ANNOUNCE] Apache ZooKeeper 3.2.0

2009-07-08 Thread Patrick Hunt
The Apache ZooKeeper team is proud to announce Apache ZooKeeper version 3.2.0. ZooKeeper is a high-performance coordination service for distributed applications. It exposes common services - such as naming, configuration management, synchronization, and group services - in a simple interface

Re: Question about the sequential flag on create.

2009-07-14 Thread Patrick Hunt
Nodes are maintained un-ordered on the server. A node can store any subnodes, not exclusively sequential nodes. If we added an ordering guarantee then then server would have to store the children sorted for every parent node. This is a problem for a few reasons; 1) in many cases you don't care

Re: Read-your-writes consistency?

2009-07-14 Thread Patrick Hunt
Yes, this is a strong guarantee: http://hadoop.apache.org/zookeeper/docs/current/zookeeperProgrammers.html#ch_zkGuarantees Sync is only necessary if client A makes a change, then client B wishes to read that change with guarantee that it will see the successfully applied change previously made

Re: queue with limit to number of simultaneous tasks

2009-07-14 Thread Patrick Hunt
It's hard to say, there are a number of variables. Some things to think about: Are the tasks idempotent? do they have leases (like SQS)? Is one process responsible for processing the tasks or will you have many vying for the jobs? Are the tasks ordered by creation date, or weighted by some

Re: Instantiating HashSet for DataNode?

2009-07-14 Thread Patrick Hunt
Erik, if you'd like enter a JIRA and take a whack at it go ahead. Perhaps a subclass of DataNode specific for ephemerals? That way it can handle any particulars - and should also minimize the number of if(children==null) type checks that would be needed. (don't neg. impact performance or b/w

Re: Queue code

2009-07-17 Thread Patrick Hunt
Thanks for the report, looks like something we need to address, would you mind going the extra step and adding a JIRA on this? https://issues.apache.org/jira/browse/ZOOKEEPER Thanks, Patrick kishore g wrote: Hi All, Zookeeper recipe queue code has a bug. byte[] b = zk.getData(root +

Re: Zookeeper WAN Configuration

2009-07-28 Thread Patrick Hunt
Flavio, please enter a doc jira for this if there are no docs, it should be in forrest, not twiki btw. It would be good if you could review the current quorum docs (any type) and create a jira/patch that addresses any/all shortfall. Patrick Flavio Junqueira wrote: Todd, Some more answers.

Re: test failures in branch-3.2

2009-07-30 Thread Patrick Hunt
Todd Greenwood wrote: The build succeeds, but not the all of the tests. In previous test runs, I noticed an error in org.apache.zookeeper.test.FLETest. It was not able to bind to a port or something. Now, after a machine reboot, I'm getting different failures. address in use? That's a problem

Re: test failures in branch-3.2

2009-07-30 Thread Patrick Hunt
it always fail?) I've entered a jira to address this: https://issues.apache.org/jira/browse/ZOOKEEPER-492 Patrick Patrick Hunt wrote: Todd Greenwood wrote: The build succeeds, but not the all of the tests. In previous test runs, I noticed an error in org.apache.zookeeper.test.FLETest

Re: test failures in branch-3.2

2009-07-30 Thread Patrick Hunt
well try running these two tests individually and see if they always fail or just occassionally. that will be a good start (and the env detail). Patrick Todd Greenwood wrote: No edits to conf/log4j.properties. -Original Message- From: Patrick Hunt [mailto:ph...@apache.org] Sent

Re: test failures in branch-3.2

2009-07-30 Thread Patrick Hunt
Todd Greenwood wrote: [Todd] Yes, I believe address in use was the problem w/ FLETest. I assumed it was a timing issue w/ respect to test A not fully releasing resources before test B started. Might be, but actually I think it's related to this:

Re: test failures in branch-3.2

2009-07-30 Thread Patrick Hunt
until the VM exit. -Todd -Original Message- From: Patrick Hunt [mailto:ph...@apache.org] Sent: Thursday, July 30, 2009 10:13 PM To: zookeeper-user@hadoop.apache.org Subject: Re: test failures in branch-3.2 Todd Greenwood wrote: [Todd] Yes, I believe address in use was the problem

Re: test failures in branch-3.2

2009-07-31 Thread Patrick Hunt
if you can wait a week or so... Regards, Patrick Todd Greenwood wrote: Inline. -Original Message- From: Patrick Hunt [mailto:ph...@apache.org] Sent: Thursday, July 30, 2009 10:57 PM To: zookeeper-user@hadoop.apache.org Subject: Re: test failures in branch-3.2 Todd Greenwood wrote

Re: test failures in branch-3.2

2009-07-31 Thread Patrick Hunt
with the manual patches to branch-3.2, as I am under some time constraints to get our infrastructure deployed such that QA can start playing with it. However, I'll switch to 3.2.1 as soon as I can. Understood. Patrick -Original Message- From: Patrick Hunt [mailto:ph...@apache.org] Sent

Re: c client error message with chroot

2009-08-03 Thread Patrick Hunt
Please do enter a JIRA. Looking at the source it seems that we log and error, but the calling code continues. I think this is happening because the chroot c lib code is not handling znode watches separate from state change notifications. The calling code just continues after logging an

Re: Errors when run zookeeper in windows ?

2009-08-18 Thread Patrick Hunt
you are using java 1.6 right? more detail on the class not found would be useful (is that missing or just not included in your email?) Also the command line you're using to start the app would be interesting. Patrick Mahadev Konar wrote: Hi Zhang, Are you using cygwin? mahadev On 8/17/09

Re: Errors when run zookeeper in windows ?

2009-08-19 Thread Patrick Hunt
-1.2.15.jar. Program will exit. $ Thank you Jeff zhang On Tue, Aug 18, 2009 at 12:53 PM, Patrick Hunt ph...@apache.org wrote: you are using java 1.6 right? more detail on the class not found would be useful (is that missing or just not included in your email?) Also the command line you're

Re: Errors when run zookeeper in windows ?

2009-08-19 Thread Patrick Hunt
On Tue, Aug 18, 2009 at 12:53 PM, Patrick Hunt ph...@apache.org wrote: you are using java 1.6 right? more detail on the class not found would be useful (is that missing or just not included in your email?) Also the command line you're using to start the app would be interesting. Patrick

Re: Errors when run zookeeper in windows ?

2009-08-19 Thread Patrick Hunt
One more thing, please enter a jira on this so that we can track/fix it. https://issues.apache.org/jira/browse/ZOOKEEPER Thanks, Patrick Patrick Hunt wrote: I suspect it has to do with the classpath - specifically having spaces in the directory name. Notice that one of the lines you included

Re: A question about Connection timed out and operation timeout

2009-08-20 Thread Patrick Hunt
Hi Qian, it would good if you could create a jira for this: https://issues.apache.org/jira/browse/ZOOKEEPER include both the client logs and the server logs (for overlapping client/server time period where you see the problem). also the server config if you're using a quorum vs standalone. If

Re: configuring Zookeeper in HBase with IP addresses only

2009-08-24 Thread Patrick Hunt
Hey Pythonner, thanks for the followup! if I understand correctly, this error message is generated by hbase and not zk? Can you tell at all? (seems likely to me as the configuration you re referring to is hbase specific... but would like to verify). Can anyone confirm, is this an issue with

Re: configuring Zookeeper in HBase with IP addresses only

2009-08-24 Thread Patrick Hunt
Nice! Jean-Daniel Cryans wrote: Added here http://wiki.apache.org/hadoop/Hbase/Troubleshooting#12 J-D On Mon, Aug 24, 2009 at 5:20 PM, Patrick Huntph...@apache.org wrote: No worries. The details are actually interesting/useful, you might consider adding to your docs in case another user runs

Re: Creating ephemeral nodes: First time returns ZNODEEXISTS

2009-08-31 Thread Patrick Hunt
Hi Leonard, Between 00:43:23,035 and 00:43:23,157 I see client session 0x123730dbe6e0001 get 15 node exists exceptions in a row. Are you expecting this? (ie are you attempting to create this node 15 times in a row or is this unexpected? I can't tell from the client snippet you included)

Re: zookeeper on ec2

2009-09-01 Thread Patrick Hunt
What is your client timeout? It may be too low. also see this section on handling recoverable errors: http://wiki.apache.org/hadoop/ZooKeeper/ErrorHandling connection loss in particular needs special care since: When a ZooKeeper client loses a connection to the ZooKeeper server there may be

Re: zookeeper on ec2

2009-09-01 Thread Patrick Hunt
are not swapping (see gc pressure), etc...) Patrick Satish Bhatti wrote: Session timeout is 30 seconds. On Tue, Sep 1, 2009 at 4:26 PM, Patrick Hunt ph...@apache.org wrote: What is your client timeout? It may be too low. also see this section on handling recoverable errors: http

Re: zookeeper on ec2

2009-09-01 Thread Patrick Hunt
has one server. Not sure if this is exacerbating the problem? I will check out the trouble shooting link you sent me. On Tue, Sep 1, 2009 at 5:01 PM, Patrick Hunt ph...@apache.org wrote: I'm not very familiar with ec2 environment, are you doing any monitoring? In particular network connectivity

Re: ACL question w/ Zookeeper 3.1.1

2009-09-18 Thread Patrick Hunt
Todd, there were other responses as well. Are you seeing other traffic from the lists? (perhaps a spam filtering issue?) Patrick Mahadev Konar wrote: HI todd, We did respond on zookeeper-user. Here is my response in case you didn't see it... HI todd, From what I understand, you are

Re: ACL question w/ Zookeeper 3.1.1

2009-09-21 Thread Patrick Hunt
Greenwood [mailto:to...@audiencescience.com] Sent: Friday, September 18, 2009 11:27 AM To: Patrick Hunt; zookeeper-...@hadoop.apache.org; zookeeper- u...@hadoop.apache.org Subject: RE: ACL question w/ Zookeeper 3.1.1 Patrick / Mahadev, Thanks for the heads-up! Apparently I *am* receiving email from

Re: ACL question w/ Zookeeper 3.1.1

2009-09-21 Thread Patrick Hunt
= {org.apache.zookeeper.proto.createrespo...@1360}'/ACLTest\n r = {org.apache.zookeeper.proto.replyhea...@1389}2,2,0\n -Todd -Original Message- From: Patrick Hunt [mailto:ph...@apache.org] Sent: Monday, September 21, 2009 4:14 PM To: zookeeper-user@hadoop.apache.org; Todd Greenwood

Re: The idea behind 'myid'

2009-09-26 Thread Patrick Hunt
Not sure if you'll find this interesting but my zk configuration generator is available on github: http://github.com/phunt/zkconf zkconf.py will generate all of the configuration needed to run a ZooKeeper ensemble. I mainly use this tool for localhost based testing, but it can generate

Re: How to reconnect server when the session expired?

2009-09-29 Thread Patrick Hunt
When the session expires your only option is to create a new session and re-register the watches, re-create ephemerals, etc... Patrick yeqian@gmail.com wrote: Hi all: When used Zookeeper in my project, I noticed a problem about the session. When a session expired, the io thread of the

Re: The idea behind 'myid'

2009-09-29 Thread Patrick Hunt
Jason Venner wrote: I do find having to have a custom file in each zk root somewhat awkward, as I like to rsync my configuration files around. I also would prefer not to have to have all of my zk nodes listed in the configuration file by id. I think I would prefer it if there was a mechanism

Re: feedback zkclient

2009-09-30 Thread Patrick Hunt
Hi Stefan, two suggestions off the bat: 1) fill in something in the README, doesn't have to be final or polished, but give some insight into the what/why/how/where/goals/etc... to get things moving quickly for reviewers new users. 2) you should really discuss on the dev list. It's up to you

Re: feedback zkclient

2009-10-01 Thread Patrick Hunt
) in particular that I should look at to see how zkclient is used, and the benefits incurred? Regards, Patrick Patrick Hunt wrote: Hi Stefan, two suggestions off the bat: 1) fill in something in the README, doesn't have to be final or polished, but give some insight into the what/why/how/where/goals

Re: feedback zkclient

2009-10-01 Thread Patrick Hunt
Peter Voss wrote: On 01.10.2009, at 08:57, Patrick Hunt wrote: 2) what purpose does ZkEventThread serve? ZkClient updates it's connection state from the ZooKeeper events. Based on these it notifies listeners, updates it's connection state or reconnects to ZooKeeper. ZkClient has its own

Re: feedback zkclient

2009-10-01 Thread Patrick Hunt
Not to harp on this ;-) but this sounds like something that would be a very helpful addition to the README. Ted Dunning wrote: I think that another way to say this is that zkClient is going a bit for the Spring philosophy that if the caller can't (or won't) be handling the situation, then they

Re: feedback zkclient

2009-10-01 Thread Patrick Hunt
Ted Dunning wrote: You may be able to tell if the file is yours be examining the content and ownership, but this is pretty implementation dependent. In particular, it makes queues very difficult to implement correctly. If this happens during the creation of an ephemeral file, the only option

Re: feedback zkclient

2009-10-01 Thread Patrick Hunt
Ted Dunning wrote: Judging by history and that fact that only 40/127 issues are resolved, 3.3 is probably 3-6 months away. Is that a fair assessment? Yes, that's fair. Patrick On Thu, Oct 1, 2009 at 11:13 AM, Patrick Hunt ph...@apache.org wrote: One nice thing about ephemeral

Re: How do we find the Server the client is connected to?

2009-10-01 Thread Patrick Hunt
That detail is purposefully not exposed through the client api, however it is output to the log on connection establishment. Why would your client code need to know which server in the ensemble it is connected to? Patrick Rob Baccus wrote: How do I determine the server the client is

Re: How do we find the Server the client is connected to?

2009-10-01 Thread Patrick Hunt
().toString(); } Feel free to add a JIRA, I think we could make this a protected method on ZooKeeper to make testing easier (and not expose internals). Regards, Patrick Todd Greenwood wrote: Failover testing. -Original Message- From: Patrick Hunt [mailto:ph...@apache.org] Sent

Re: How do we find the Server the client is connected to?

2009-10-01 Thread Patrick Hunt
to...@audiencescience.comwrote: Failover testing. -Original Message- From: Patrick Hunt [mailto:ph...@apache.org] Sent: Thursday, October 01, 2009 3:44 PM To: zookeeper-user@hadoop.apache.org; Rob Baccus Subject: Re: How do we find the Server the client is connected to? That detail is purposefully

  1   2   3   4   >