Mahadev, Some quick questions:
1. Version I see that the CHANGES.txt calls this 3.2.1, but the build.xml is still calling this 3.2.0. Should this be rev'd, and am I correct in calling this release 3.2.1? 2. Build targets The package target fails b/c the create-cppunit-configure target fails due to various problems w/ respect to autoconf. Are these dependencies documented somewhere ? I'd like to have a fully building system. create-cppunit-configure: [exec] Can't exec "libtoolize": No such file or directory at /usr/bin/autoreconf line 188. [exec] Use of uninitialized value $libtoolize in pattern match (m//) at /usr/bin/autoreconf line 188. [exec] configure.ac:33: warning: macro `AM_PATH_CPPUNIT' not found in library [exec] configure.ac:33: error: possibly undefined macro: AM_PATH_CPPUNIT [exec] If this token and others are legitimate, please use m4_pattern_allow. [exec] See the Autoconf documentation. [exec] configure.ac:53: error: possibly undefined macro: AC_PROG_LIBTOOL [exec] autoreconf: /usr/bin/autoconf failed with exit status: 1 3. Sync failure: This is still failing. svn: URL 'http://svn.apache.org/repos/asf/hadoop/common/nightly/test-patch' doesn't exist -Todd > -----Original Message----- > From: Todd Greenwood > Sent: Tuesday, August 04, 2009 11:26 AM > To: 'zookeeper-u...@hadoop.apache.org' > Subject: RE: Unending Leader Elections in WAN deploy > > Great news. Thank you Mahadev. I'll report our findings later today. > -Todd > > > -----Original Message----- > > From: Mahadev Konar [mailto:maha...@yahoo-inc.com] > > Sent: Tuesday, August 04, 2009 11:20 AM > > To: zookeeper-u...@hadoop.apache.org > > Subject: Re: Unending Leader Elections in WAN deploy > > > > Hi Todd, > > I just committed 480 and 491. You can checkout the 3.2 branch now. > > > > Thanks > > mahadev > > > > > > On 8/3/09 4:29 PM, "Todd Greenwood" <to...@audiencescience.com> wrote: > > > > > That'd be perfect. Thanks! > > > > > >> -----Original Message----- > > >> From: Mahadev Konar [mailto:maha...@yahoo-inc.com] > > >> Sent: Monday, August 03, 2009 4:24 PM > > >> To: zookeeper-u...@hadoop.apache.org > > >> Subject: Re: Unending Leader Elections in WAN deploy > > >> > > >> Hi Todd, > > >> Most of the patches that you mention should be in the branch 3.2 by > > > tomm > > >> or so. 481, 479 are already in. 480 and 491 should be in by tomm. > > > Would > > >> that > > >> suffice for you? > > >> > > >> Thanks > > >> mahadev > > >> > > >> > > >> On 8/3/09 4:21 PM, "Todd Greenwood" <to...@audiencescience.com> > wrote: > > >> > > >>> Another problem...I've reverted to the latest versions of the > > > patches > > >>> that are not specific to branch-3.2, and I'm getting two compilation > > >>> errors: > > >>> > > >>> build-generated: > > >>> [javac] Compiling 44 source files to > > >>> > > > > /home/toddg/asi/workspaces/main/Main/RSI/etc/holmes/main/zookeeper/src/p > > >>> atched/branch-3.2/build/classes > > >>> > > >>> compile-main: > > >>> [javac] Compiling 2 source files to > > >>> > > > > /home/toddg/asi/workspaces/main/Main/RSI/etc/holmes/main/zookeeper/src/p > > >>> atched/branch-3.2/build/classes > > >>> [javac] > > >>> > > > > /home/toddg/asi/workspaces/main/Main/RSI/etc/holmes/main/zookeeper/src/p > > >>> > > > atched/branch- > 3.2/src/java/main/org/apache/zookeeper/server/quorum/Quoru > > >>> mStats.java:30: name clash: getQuorumPeers() and getQuorumPeers() > > > have > > >>> the same erasure > > >>> [javac] public String[] getQuorumPeers(); > > >>> [javac] ^ > > >>> [javac] > > >>> > > > > /home/toddg/asi/workspaces/main/Main/RSI/etc/holmes/main/zookeeper/src/p > > >>> > > > atched/branch- > 3.2/src/java/main/org/apache/zookeeper/server/quorum/Quoru > > >>> mStats.java:31: name clash: getServerState() and getServerState() > > > have > > >>> the same erasure > > >>> [javac] public String getServerState(); > > >>> [javac] ^ > > >>> [javac] 2 errors > > >>> > > >>> My build process is pretty simple: > > >>> > > >>> 1. copy the branch-3.2 source to a temp directory > > >>> (src/patched/branch-3.2) > > >>> 2. apply the ZOOKEEPER patches in my patches directory > > >>> 3. build zookeeper in the temp directory > > >>> > > >>> -Todd > > >>>> -----Original Message----- > > >>>> From: Todd Greenwood [mailto:to...@audiencescience.com] > > >>>> Sent: Monday, August 03, 2009 4:09 PM > > >>>> To: zookeeper-u...@hadoop.apache.org > > >>>> Subject: RE: Unending Leader Elections in WAN deploy > > >>>> > > >>>> Flavio, > > >>>> I notice that you've updated the patches referenced for the WAN > > >>>> deployment. There appears to be an order dependency w/ respect to > > >>> these > > >>>> four patches... > > >>>> > > >>>> ZOOKEEPER-473.patch ZOOKEEPER-479-branch3.2.patch > > >>>> ZOOKEEPER-481-branch3.2.patch ZOOKEEPER-491.patch > > >>>> > > >>>> 473 -> 479 (479 fails) > > >>>> > > >>>> > > >>> > > > > to...@toddg01lt:~/asi/workspaces/main/Main/RSI/etc/holmes/main/zookeeper > > >>>> /src/patched/branch-3.2$ patch -p0 < > > >>>> ../patches/ZOOKEEPER-479-branch3.2.patch > > >>>> patching file > > >>>> > > >>> > > > > src/java/main/org/apache/zookeeper/server/quorum/flexible/QuorumHierarch > > >>>> ical.java > > >>>> patching file > > >>>> > > >>> > > > > src/java/main/org/apache/zookeeper/server/quorum/flexible/QuorumMaj.java > > >>>> patching file > > >>>> > > >>> > > > > src/java/main/org/apache/zookeeper/server/quorum/flexible/QuorumVerifier > > >>>> .java > > >>>> patching file > > >>>> src/java/test/org/apache/zookeeper/test/HierarchicalQuorumTest.java > > >>>> Hunk #1 FAILED at 93. > > >>>> Hunk #2 FAILED at 145. > > >>>> 2 out of 2 hunks FAILED -- saving rejects to file > > >>>> > > >>> > > > > src/java/test/org/apache/zookeeper/test/HierarchicalQuorumTest.java.rej > > >>>> > > >>> > > > > to...@toddg01lt:~/asi/workspaces/main/Main/RSI/etc/holmes/main/zookeeper > > >>>> /src/patched/branch-3.2$ h ../patches/ > > >>>> > > >>>> Could you advise as to which patches I need to apply, and in what > > >>> order? > > >>>> > > >>>> -Todd > > >>>> > > >>>>> -----Original Message----- > > >>>>> From: Flavio Junqueira [mailto:f...@yahoo-inc.com] > > >>>>> Sent: Friday, July 31, 2009 9:51 PM > > >>>>> To: zookeeper-u...@hadoop.apache.org > > >>>>> Subject: Re: Unending Leader Elections in WAN deploy > > >>>>> > > >>>>> Perfect! Thanks for the update, Todd. > > >>>>> > > >>>>> -Flavio > > >>>>> > > >>>>> On Jul 31, 2009, at 8:17 PM, Todd Greenwood wrote: > > >>>>> > > >>>>>> Thanks. You were right, I had a stale version of 479. Compilation > > >>>>>> succeeds and all tests pass on branch-3.2 with the latest patches > > >>>> 473, > > >>>>>> 479, 481, and 491. > > >>>>>> > > >>>>>> -Todd > > >>>>>> > > >>>>>>> -----Original Message----- > > >>>>>>> From: Flavio Junqueira [mailto:f...@yahoo-inc.com] > > >>>>>>> Sent: Friday, July 31, 2009 7:48 PM > > >>>>>>> To: zookeeper-u...@hadoop.apache.org > > >>>>>>> Subject: Re: Unending Leader Elections in WAN deploy > > >>>>>>> > > >>>>>>> It should be in 479. Perhaps you have a stale version of the > > >>> patch. > > >>>>>>> > > >>>>>>> -Flavio > > >>>>>>> > > >>>>>>> On Jul 31, 2009, at 7:46 PM, Todd Greenwood wrote: > > >>>>>>> > > >>>>>>>> Flavio, > > >>>>>>>> > > >>>>>>>> I'm getting a compilation error for patch 491: > > >>>>>>>> > > >>>>>>>> compile-main: > > >>>>>>>> [javac] Compiling 1 source file to > > >>>>>>>> > > >>>> /home/toddg/asi/workspaces/main/Main/RSI/etc/holmes/main/zookeeper/ > > >>>>>>>> src/p > > >>>>>>>> atched/branch-3.2/build/classes > > >>>>>>>> [javac] > > >>>>>>>> > > >>>> /home/toddg/asi/workspaces/main/Main/RSI/etc/holmes/main/zookeeper/ > > >>>>>>>> src/p > > >>>>>>>> > > >>>> atched/branch-3.2/src/java/main/org/apache/zookeeper/server/quorum/ > > >>>>>>>> FastL > > >>>>>>>> eaderElection.java:601: cannot find symbol > > >>>>>>>> [javac] symbol : method getWeight(long) > > >>>>>>>> [javac] location: interface > > >>>>>>>> org.apache.zookeeper.server.quorum.flexible.QuorumVerifier > > >>>>>>>> [javac] > > >>>>>>>> if(self.getQuorumVerifier().getWeight(n.sid) != 0) > > >>>>>>>> [javac] ^ > > >>>>>>>> [javac] 1 error > > >>>>>>>> > > >>>>>>>> I see a reference to getWeight in both FastLeaderElection.java > > >>> in > > >>>>>>>> patch > > >>>>>>>> 491: > > >>>>>>>> > > >>>>>>>> patches/ZOOKEEPER-491.patch:+ > > >>>>>>>> if(self.getQuorumVerifier().getWeight(n.sid) != 0) > > >>>>>>>> src/java/main/org/apache/zookeeper/server/quorum/ > > >>>>>>>> FastLeaderElection.java > > >>>>>>>> : > > >>>>>>>> if(self.getQuorumVerifier().getWeight(n.sid) != > > >>>>>>>> 0) > > >>>>>>>> > > >>>>>>>> However, I don't see a reference to this method in patches 473, > > >>>> 479, > > >>>>>>>> or > > >>>>>>>> 481. I also don't see a reference to this method in the > > > trunk... > > >>>>>>>> > > >>>>>>>> -Todd > > >>>>>>>> > > >>>>>>>>> -----Original Message----- > > >>>>>>>>> From: Todd Greenwood [mailto:to...@audiencescience.com] > > >>>>>>>>> Sent: Friday, July 31, 2009 7:30 PM > > >>>>>>>>> To: zookeeper-u...@hadoop.apache.org > > >>>>>>>>> Subject: RE: Unending Leader Elections in WAN deploy > > >>>>>>>>> > > >>>>>>>>> Ok, I'll apply that patch and report back. > > >>>>>>>>> -Todd > > >>>>>>>>> > > >>>>>>>>>> -----Original Message----- > > >>>>>>>>>> From: Flavio Junqueira [mailto:f...@yahoo-inc.com] > > >>>>>>>>>> Sent: Friday, July 31, 2009 7:18 PM > > >>>>>>>>>> To: zookeeper-u...@hadoop.apache.org > > >>>>>>>>>> Subject: Re: Unending Leader Elections in WAN deploy > > >>>>>>>>>> > > >>>>>>>>>> You're missing 491 from your set of patches. > > >>>>>>>>>> > > >>>>>>>>>> -Flavio > > >>>>>>>>>> > > >>>>>>>>>> On Jul 31, 2009, at 7:15 PM, Todd Greenwood wrote: > > >>>>>>>>>> > > >>>>>>>>>>> This repro's in both branch-3.2, and branch-3.2+patches(473, > > >>>> 479, > > >>>>>>>>>>> 481). > > >>>>>>>>>>> > > >>>>>>>>>>> Basically, it seems like the nodes are electing pd4-zook02 > > > to > > >>>> be > > >>>>>>>> the > > >>>>>>>>>>> leader. However, pd4-zook02 seems to realize it's not > > >>> supposed > > >>>> to > > >>>>>>>> be > > >>>>>>>>>>> and > > >>>>>>>>>>> then disconnects everyone. Then they re-elect it again, and > > >>> it > > >>>>>>>> loops > > >>>>>>>>>>> over and over. > > >>>>>>>>>>> > > >>>>>>>>>>> ------------- > > >>>>>>>>>>> Server config > > >>>>>>>>>>> ------------- > > >>>>>>>>>>> > > >>>>>>>>>>> server.1=dc1-zook01.dc01.revsci.net:2888:3888 > > >>>>>>>>>>> server.2=dc1-zook02.dc01.revsci.net:2888:3888 > > >>>>>>>>>>> server.3=dc1-zook03.dc01.revsci.net:2888:3888 > > >>>>>>>>>>> server.4=dc1-zook04.dc01.revsci.net:2888:3888 > > >>>>>>>>>>> server.5=dc1-zook05.dc01.revsci.net:2888:3888 > > >>>>>>>>>>> server.6=pd1-zook01.pd01.revsci.net:2888:3888 > > >>>>>>>>>>> server.7=pd1-zook02.pd01.revsci.net:2888:3888 > > >>>>>>>>>>> server.8=pd4-zook01.iad1.audsci.net:2888:3888 > > >>>>>>>>>>> server.9=pd4-zook02.iad1.audsci.net:2888:3888 > > >>>>>>>>>>> > > >>>>>>>>>>> group.1:1:2:3:4:5 > > >>>>>>>>>>> weight.1=1 > > >>>>>>>>>>> weight.2=1 > > >>>>>>>>>>> weight.3=1 > > >>>>>>>>>>> weight.4=1 > > >>>>>>>>>>> weight.5=1 > > >>>>>>>>>>> > > >>>>>>>>>>> group.2:6:7:8:9 > > >>>>>>>>>>> weight.6=0 > > >>>>>>>>>>> weight.7=0 > > >>>>>>>>>>> weight.8=0 > > >>>>>>>>>>> weight.9=0 > > >>>>>>>>>>> > > >>>>>>>>>>> Note that we have 2 groups, composed of machines in 3 > > >>> different > > >>>>>>>>>>> locations (dc1, pd1, and pd4). The idea is that only > > > machines > > >>>> in > > >>>>>>>> dc1 > > >>>>>>>>>>> have voting rights, and the ability to become a leader. The > > >>>>>>>> machines > > >>>>>>>>>>> in > > >>>>>>>>>>> the pods all have a weight of zero, and are not expected to > > >>>>>> become > > >>>>>>>>>>> leaders, or to vote on transactions. > > >>>>>>>>>>> > > >>>>>>>>>>> Let me know what I can do to help resolve this issue. > > >>>>>>>>>>> > > >>>>>>>>>>> -Todd > > >>>>>>>> > > >>>>>> > > >>> > > >