Build failed in Hudson: ZooKeeper-trunk #269

2009-04-02 Thread Apache Hudson Server
See http://hudson.zones.apache.org/hudson/job/ZooKeeper-trunk/269/changes

Changes:

[mahadev] ZOOKEEPER-305.  Replace timers with semaphores in FLENewEpochTest. 
(flavio via mahadev)

[mahadev] ZOOKEEPER-288. Cleanup and fixes to BookKeeper (flavio via mahadev)

--
[...truncated 54918 lines...]
[junit] 2009-04-02 12:16:00,828 - INFO  [main:finalrequestproces...@268] - 
shutdown of request processor complete
[junit] 2009-04-02 12:16:00,828 - INFO  
[SyncThread:0:syncrequestproces...@119] - SyncRequestProcessor exited!
[junit] 2009-04-02 12:16:00,828 - INFO  
[ProcessThread:-1:preprequestproces...@111] - PrepRequestProcessor exited loop!
[junit] 2009-04-02 12:16:00,928 - INFO  [main:clientb...@306] - STARTING 
server
[junit] 2009-04-02 12:16:00,928 - INFO  [main:zookeeperser...@160] - 
Created server
[junit] 2009-04-02 12:16:00,929 - INFO  [main:files...@71] - Reading 
snapshot 
http://hudson.zones.apache.org/hudson/job/ZooKeeper-trunk/ws/trunk/build/test/tmp/test7123064128084596242.junit.dir/version-2/snapshot.0
 
[junit] 2009-04-02 12:16:00,930 - INFO  [main:filetxnsnap...@198] - 
Snapshotting: 3
[junit] 2009-04-02 12:16:00,932 - INFO  
[NIOServerCxn.Factory:33221:nioserverc...@635] - Processing stat command from 
/127.0.0.1:51705
[junit] 2009-04-02 12:16:00,933 - WARN  
[NIOServerCxn.Factory:33221:nioserverc...@431] - Exception causing close of 
session 0x0 due to java.io.IOException: Responded to info probe
[junit] 2009-04-02 12:16:00,934 - INFO  
[NIOServerCxn.Factory:33221:nioserverc...@766] - closing session:0x0 
NIOServerCnxn: java.nio.channels.SocketChannel[connected local=/127.0.0.1:33221 
remote=/127.0.0.1:51705]
[junit] 2009-04-02 12:16:02,224 - INFO  
[main-SendThread:clientcnxn$sendthr...@800] - Attempting connection to server 
/127.0.0.1:33221
[junit] 2009-04-02 12:16:02,224 - INFO  
[main-SendThread:clientcnxn$sendthr...@716] - Priming connection to 
java.nio.channels.SocketChannel[connected local=/127.0.0.1:51706 
remote=/127.0.0.1:33221]
[junit] 2009-04-02 12:16:02,225 - INFO  
[main-SendThread:clientcnxn$sendthr...@868] - Server connection successful
[junit] 2009-04-02 12:16:02,225 - INFO  
[NIOServerCxn.Factory:33221:nioserverc...@517] - Connected to /127.0.0.1:51706 
lastZxid 3
[junit] 2009-04-02 12:16:02,226 - INFO  
[NIOServerCxn.Factory:33221:nioserverc...@895] - Finished init of 
0x12066c1d72c valid:true
[junit] 2009-04-02 12:16:02,226 - INFO  
[NIOServerCxn.Factory:33221:nioserverc...@545] - Renewing session 
0x12066c1d72c
[junit] 2009-04-02 12:16:03,000 - INFO  
[SessionTracker:sessiontrackeri...@142] - SessionTrackerImpl exited loop!
[junit] 2009-04-02 12:16:03,000 - INFO  
[SessionTracker:sessiontrackeri...@142] - SessionTrackerImpl exited loop!
[junit] 2009-04-02 12:16:36,231 - INFO  [main:clientb...@300] - STOPPING 
server
[junit] 2009-04-02 12:16:36,232 - INFO  [main:nioserverc...@766] - closing 
session:0x12066c1d72c NIOServerCnxn: 
java.nio.channels.SocketChannel[connected local=/127.0.0.1:33221 
remote=/127.0.0.1:51706]
[junit] 2009-04-02 12:16:36,232 - WARN  
[main-SendThread:clientcnxn$sendthr...@898] - Exception closing session 
0x12066c1d72c to sun.nio.ch.selectionkeyi...@93df2c
[junit] java.io.IOException: Read error rc = -1 
java.nio.DirectByteBuffer[pos=0 lim=4 cap=4]
[junit] at 
org.apache.zookeeper.ClientCnxn$SendThread.doIO(ClientCnxn.java:632)
[junit] at 
org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:876)
[junit] 2009-04-02 12:16:36,233 - INFO  
[NIOServerCxn.Factory:33221:nioservercnxn$fact...@177] - NIOServerCnxn factory 
exited run method
[junit] 2009-04-02 12:16:36,233 - INFO  [main:finalrequestproces...@268] - 
shutdown of request processor complete
[junit] 2009-04-02 12:16:36,233 - INFO  
[ProcessThread:-1:preprequestproces...@111] - PrepRequestProcessor exited loop!
[junit] 2009-04-02 12:16:36,233 - INFO  
[SyncThread:0:syncrequestproces...@119] - SyncRequestProcessor exited!
[junit] 2009-04-02 12:16:36,333 - INFO  [main:clientb...@306] - STARTING 
server
[junit] 2009-04-02 12:16:36,333 - INFO  [main:zookeeperser...@160] - 
Created server
[junit] 2009-04-02 12:16:36,334 - INFO  [main:files...@71] - Reading 
snapshot 
http://hudson.zones.apache.org/hudson/job/ZooKeeper-trunk/ws/trunk/build/test/tmp/test7123064128084596242.junit.dir/version-2/snapshot.3
 
[junit] 2009-04-02 12:16:36,336 - INFO  [main:filetxnsnap...@198] - 
Snapshotting: 5
[junit] 2009-04-02 12:16:36,338 - INFO  
[NIOServerCxn.Factory:33221:nioserverc...@635] - Processing stat command from 
/127.0.0.1:51708
[junit] 2009-04-02 12:16:36,339 - WARN  
[NIOServerCxn.Factory:33221:nioserverc...@431] - Exception causing close of 
session 0x0 due to java.io.IOException: Responded to info probe
[junit] 2009-04-02 12:16:36,348 - INFO  
[NIOServerCxn.Factory:33221:nioserverc...@766] - 

[jira] Created: (ZOOKEEPER-360) WeakHashMap in Bookie.java causes NPE

2009-04-02 Thread Flavio Paiva Junqueira (JIRA)
WeakHashMap in Bookie.java causes NPE
-

 Key: ZOOKEEPER-360
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-360
 Project: Zookeeper
  Issue Type: Bug
  Components: contrib-bookkeeper
Affects Versions: 3.1.1
Reporter: Flavio Paiva Junqueira
Assignee: Flavio Paiva Junqueira
 Fix For: 3.2.0


We need a strong reference to prevent a key in masterKeys on Bookie.java to be 
garbage collected.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (ZOOKEEPER-360) WeakHashMap in Bookie.java causes NPE

2009-04-02 Thread Flavio Paiva Junqueira (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Flavio Paiva Junqueira updated ZOOKEEPER-360:
-

Attachment: ZOOKEEPER-BOOKKEEPER-360.patch

This patch gets rid of the masterKeys WeakHashMap in Bookie.java and adds an 
attribute to LedgerDescriptor to keep it.

 WeakHashMap in Bookie.java causes NPE
 -

 Key: ZOOKEEPER-360
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-360
 Project: Zookeeper
  Issue Type: Bug
  Components: contrib-bookkeeper
Affects Versions: 3.1.1
Reporter: Flavio Paiva Junqueira
Assignee: Flavio Paiva Junqueira
 Fix For: 3.2.0

 Attachments: ZOOKEEPER-BOOKKEEPER-360.patch


 We need a strong reference to prevent a key in masterKeys on Bookie.java to 
 be garbage collected.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (ZOOKEEPER-360) WeakHashMap in Bookie.java causes NPE

2009-04-02 Thread Benjamin Reed (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12695056#action_12695056
 ] 

Benjamin Reed commented on ZOOKEEPER-360:
-

+1 looks good

 WeakHashMap in Bookie.java causes NPE
 -

 Key: ZOOKEEPER-360
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-360
 Project: Zookeeper
  Issue Type: Bug
  Components: contrib-bookkeeper
Affects Versions: 3.1.1
Reporter: Flavio Paiva Junqueira
Assignee: Flavio Paiva Junqueira
 Fix For: 3.2.0

 Attachments: ZOOKEEPER-BOOKKEEPER-360.patch


 We need a strong reference to prevent a key in masterKeys on Bookie.java to 
 be garbage collected.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



Re: ZooKeeper Perl module

2009-04-02 Thread Patrick Hunt
Hey Chris this is really great! Thanks for making it available to the 
community, very cool.


Patrick

Chris Darroch wrote:

Hi --

  The http://wiki.apache.org/hadoop/ZooKeeper page includes the
comment that someday we hope to get Python, Perl, and REST interfaces.
I hope I can help with one item from that list now, at least.  I recently
put together a Perl module named Net::ZooKeeper which is now available
on CPAN:

http://cpan.org/modules/by-category/05_Networking_Devices_IPC/Net/Net-ZooKeeper-0.32.tar.gz 


http://search.cpan.org/~cdarroch/Net-ZooKeeper-0.32/ZooKeeper.pm

  Modelled on the DBI module, it provides an interface to ZooKeeper
through the synchronous C API functions, e.g.:

   my $zkh = Net::ZooKeeper-new('localhost:7000');

   my $ret = $zkh-set('/foo', 'baz');

  Net::ZooKeeper currently requires ZooKeeper 3.1.1 (or at least
that version of the C API code) and Perl 5.8.8 or up, including 5.10.x.

  The test suite is reasonably complete, I think, and covers a
fair bit of ground.  I've found it useful for testing the ZooKeeper
C API as well as learning more than I wanted to know about
XS programming.

  I've licensed the module under the Apache license 2.0 so it should
be compatible with ZooKeeper itself if there's interest in including
it under src/contribs.

  For those who ask why Perl 5 and not Rakudo/Ruby/Lua/Python/
[insert cool new dynamic language here], the answer is just that I
needed an old-style Perl module first.  (As a thought experiment,
though, I wonder if one could write a Parrot extension that communicated
directly with ZooKeeper, handled the ping requests internally via
a Parrot scheduler/thread/whatever, and didn't need the C API at all.
You could support any language running on Parrot with that.  Well,
maybe in a few years, anyway.  :-)

  In the meantime, please report any suggestions or bugs to
me -- thanks!

Chris.



[Fwd: [GSoC 2009] Apache is officially participating in Google Summer of Code 2009]

2009-04-02 Thread Patrick Hunt

FYI. If anyone has ideas for ZooKeeper that might fit GSoC.

Patrick
---BeginMessage---
Dear PMC,

It's now official, Google has announced that The Apache Software
Foundation was selected as one of the participating organization in
the GSoC 2009 program. This is an excellent opportunity for Apache, as
it allows projects to build/increase its community, it also help
students to learn about open source, about Apache Software Foundation
and how to do things the Apache Way.

Please start advertising the program and discussing project ideas with
your project community. Then add your project ideas to the ASF GSoC
2009 Official Project Ideas page [1], as this is going to be the most
visible place for students. Per GSoC 2009 timeline, Students should
officially start applying for these ideas on March 23.

ASF Members and committers can volunteer to be mentors in the GSoC
2009 program (see [2] for more details about being a mentor). We
invite all mentors and GSoC interested parties to subscribe to the
code-awa...@a.o mailing list [3] to coordinate work between mentors,
ask questions and get updates related to GSoC 2009 program.

Please, feel free to forward this announce to any appropriate dev@ or
users@ lists so your larger community can hear about the GSoC 2009
program.

[1] http://wiki.apache.org/general/SummerOfCode2009
[2] http://wiki.apache.org/general/SummerOfCodeMentor
[3] mailto:code-awards-subscr...@apache.org


-- 
Luciano Resende
http://people.apache.org/~lresende
http://lresende.blogspot.com/
---End Message---


[jira] Commented: (ZOOKEEPER-360) WeakHashMap in Bookie.java causes NPE

2009-04-02 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12695134#action_12695134
 ] 

Hadoop QA commented on ZOOKEEPER-360:
-

-1 overall.  Here are the results of testing the latest attachment 
  
http://issues.apache.org/jira/secure/attachment/12404469/ZOOKEEPER-BOOKKEEPER-360.patch
  against trunk revision 761126.

+1 @author.  The patch does not contain any @author tags.

-1 tests included.  The patch doesn't appear to include any new or modified 
tests.
Please justify why no tests are needed for this patch.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 findbugs.  The patch does not introduce any new Findbugs warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

+1 core tests.  The patch passed core unit tests.

+1 contrib tests.  The patch passed contrib unit tests.

Test results: 
http://hudson.zones.apache.org/hudson/job/Zookeeper-Patch-vesta.apache.org/11/testReport/
Findbugs warnings: 
http://hudson.zones.apache.org/hudson/job/Zookeeper-Patch-vesta.apache.org/11/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Console output: 
http://hudson.zones.apache.org/hudson/job/Zookeeper-Patch-vesta.apache.org/11/console

This message is automatically generated.

 WeakHashMap in Bookie.java causes NPE
 -

 Key: ZOOKEEPER-360
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-360
 Project: Zookeeper
  Issue Type: Bug
  Components: contrib-bookkeeper
Affects Versions: 3.1.1
Reporter: Flavio Paiva Junqueira
Assignee: Flavio Paiva Junqueira
 Fix For: 3.2.0

 Attachments: ZOOKEEPER-BOOKKEEPER-360.patch


 We need a strong reference to prevent a key in masterKeys on Bookie.java to 
 be garbage collected.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (ZOOKEEPER-344) doIO in NioServerCnxn: Exception causing close of session : cause is read error

2009-04-02 Thread bryan thompson (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12695136#action_12695136
 ] 

bryan thompson commented on ZOOKEEPER-344:
--

I am not sure how to boil this down into a problem which can be run on a single 
machine.  This is a distributed database benchmark.  The problem shows up when 
the cluster is under load.  How would I go about isolating that further outside 
of writing stress tests for zookeeper?

If this is indeed a zookeeper bug and you have some idea of the possible issues 
involved, then perhaps you can suggest some additional instrumentation of 
zookeeper and I could run against a version with more instrumentation which 
might reveal something?

Thanks,

-bryan


 doIO in NioServerCnxn: Exception causing close of session : cause is read 
 error
 -

 Key: ZOOKEEPER-344
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-344
 Project: Zookeeper
  Issue Type: Bug
  Components: java client, server
Affects Versions: 3.1.0
 Environment: jdk1.6.0_07
 Linux blade2 2.6.27.7-134.fc10.x86_64 #1 SMP Mon Dec 1 22:21:35 EST 2008 
 x86_64 x86_64 x86_64 GNU/Linux
Reporter: bryan thompson
 Fix For: 3.2.0


 I have been having a problem with zookeeper 3.0.1 and now with 3.1.0 where I 
 see a lot of expired sessions.  I am using a 16 node cluster which is all on 
 the same local network.  There is a single zookeeper instance (these are 
 benchmarking runs).
 The problem appears to be correlated with either run time or system load.\
 Personally I think that it is system load because I have session session 
 expired events under a Windows platform running zookeeper and the application 
 (i.e., everthing is local) when the application load suddenly spikes.  To me 
 this suggests that the client is not able to renew (ping) the zookeeper 
 service in a timely manner and is expired.  But the log messages below with 
 the read error suggest that maybe there is something else going on?
 Zookeeper Configuration
 #Wed Mar 18 12:41:05 GMT-05:00 2009
 clientPort=2181
 dataDir=/var/bigdata/benchmark/zookeeper/1
 syncLimit=2
 dataLogDir=/var/bigdata/benchmark/zookeeper/1
 tickTime=2000
 Some representative log messages are below.
 Client side messages (from our app)
 ERROR [main-EventThread] 
 com.bigdata.zookeeper.ZLockImpl$ZLockWatcher.process(ZLockImpl.java:400) 
 2009-03-18 13:35:40,335 - Session expired: WatchedEvent: Server state change. 
 New state: Expired : 
 zpath=/benchmark/jobs/com.bigdata.service.jini.benchmark.ThroughputMaster/test_1/client1160/locknode
 ERROR [main-EventThread] 
 com.bigdata.zookeeper.ZLockImpl$ZLockWatcher.process(ZLockImpl.java:400) 
 2009-03-18 13:35:40,335 - Session expired: WatchedEvent: Server state change. 
 New state: Expired : 
 zpath=/benchmark/jobs/com.bigdata.service.jini.benchmark.ThroughputMaster/test_1/client1356/locknode
 Server side messages:
  WARN [NIOServerCxn.Factory:2181] 
 org.apache.zookeeper.server.NIOServerCnxn.doIO(NIOServerCnxn.java:417) 
 2009-03-18 13:06:57,252 - Exception causing close of session 
 0x1201aac14300022 due to java.io.IOException: Read error
  WARN [NIOServerCxn.Factory:2181] 
 org.apache.zookeeper.server.NIOServerCnxn.doIO(NIOServerCnxn.java:417) 
 2009-03-18 13:06:58,198 - Exception causing close of session 
 0x1201aac143f due to java.io.IOException: Read error

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (ZOOKEEPER-360) WeakHashMap in Bookie.java causes NPE

2009-04-02 Thread Flavio Paiva Junqueira (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12695138#action_12695138
 ] 

Flavio Paiva Junqueira commented on ZOOKEEPER-360:
--

The part of the bookie code changed with this patch was already exercised 
before on tests, and the reason why they didn't catch the bug is because the 
error is caused by garbage collected entries in masterKeys (WeakHashMap). I 
have removed this data structure in this patch, so that there is no more such 
garbarge collected entries. The code path is still exercised by unit tests, 
though. 

 WeakHashMap in Bookie.java causes NPE
 -

 Key: ZOOKEEPER-360
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-360
 Project: Zookeeper
  Issue Type: Bug
  Components: contrib-bookkeeper
Affects Versions: 3.1.1
Reporter: Flavio Paiva Junqueira
Assignee: Flavio Paiva Junqueira
 Fix For: 3.2.0

 Attachments: ZOOKEEPER-BOOKKEEPER-360.patch


 We need a strong reference to prevent a key in masterKeys on Bookie.java to 
 be garbage collected.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (ZOOKEEPER-361) integrate cppunit testing as part of hudson patch process.

2009-04-02 Thread Mahadev konar (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mahadev konar updated ZOOKEEPER-361:


Component/s: build

 integrate cppunit testing as part of hudson patch process.
 --

 Key: ZOOKEEPER-361
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-361
 Project: Zookeeper
  Issue Type: New Feature
  Components: build
Affects Versions: 3.0.0, 3.0.1, 3.1.0, 3.1.1
Reporter: Mahadev konar
Assignee: Giridharan Kesavan

 we need to test the c tests as part of our hudson patch testing process.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (ZOOKEEPER-60) Get cppunit tests running as part of Hudson CI

2009-04-02 Thread Mahadev konar (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-60?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12695142#action_12695142
 ] 

Mahadev konar commented on ZOOKEEPER-60:


I opened another jira for intergrating this patch into the hudson patch testing 
for all the patches ZOOKEEPER-361.

 Get cppunit tests running as part of Hudson CI
 --

 Key: ZOOKEEPER-60
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-60
 Project: Zookeeper
  Issue Type: Improvement
  Components: build
Affects Versions: 3.0.0, 3.0.1, 3.1.0, 3.1.1
Reporter: Patrick Hunt
Assignee: Giridharan Kesavan
 Fix For: 3.2.0

 Attachments: ZK-60.patch, ZK-60.patch, ZK-60.patch, ZOOKEEPER-60.patch


 Investigate if it is possible to run cppunit tests as part of Hudson.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (ZOOKEEPER-60) Get cppunit tests running as part of Hudson CI

2009-04-02 Thread Mahadev konar (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-60?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mahadev konar updated ZOOKEEPER-60:
---

  Resolution: Fixed
Release Note: Get cppunit tests running from ant.
Hadoop Flags: [Reviewed]
  Status: Resolved  (was: Patch Available)

+1 for the patch ... I just committed this. thanks giri.

 Get cppunit tests running as part of Hudson CI
 --

 Key: ZOOKEEPER-60
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-60
 Project: Zookeeper
  Issue Type: Improvement
  Components: build
Affects Versions: 3.0.0, 3.0.1, 3.1.0, 3.1.1
Reporter: Patrick Hunt
Assignee: Giridharan Kesavan
 Fix For: 3.2.0

 Attachments: ZK-60.patch, ZK-60.patch, ZK-60.patch, ZOOKEEPER-60.patch


 Investigate if it is possible to run cppunit tests as part of Hudson.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (ZOOKEEPER-344) doIO in NioServerCnxn: Exception causing close of session : cause is read error

2009-04-02 Thread Mahadev konar (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12695148#action_12695148
 ] 

Mahadev konar commented on ZOOKEEPER-344:
-

brayn,
 the one thing you can do is run with tracefile option in the config. Please 
take a look at
http://hadoop.apache.org/zookeeper/docs/r3.1.1/zookeeperAdmin.html to see how 
to set up a tracefile. The tracefile has logs of all the transactions that go 
though the server and all 
the components of the server and helps in debugging problems such as these, 
where you can point out which transaction got delayed and at what time and 
sometimes can point out
the reason why... 

 doIO in NioServerCnxn: Exception causing close of session : cause is read 
 error
 -

 Key: ZOOKEEPER-344
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-344
 Project: Zookeeper
  Issue Type: Bug
  Components: java client, server
Affects Versions: 3.1.0
 Environment: jdk1.6.0_07
 Linux blade2 2.6.27.7-134.fc10.x86_64 #1 SMP Mon Dec 1 22:21:35 EST 2008 
 x86_64 x86_64 x86_64 GNU/Linux
Reporter: bryan thompson
 Fix For: 3.2.0


 I have been having a problem with zookeeper 3.0.1 and now with 3.1.0 where I 
 see a lot of expired sessions.  I am using a 16 node cluster which is all on 
 the same local network.  There is a single zookeeper instance (these are 
 benchmarking runs).
 The problem appears to be correlated with either run time or system load.\
 Personally I think that it is system load because I have session session 
 expired events under a Windows platform running zookeeper and the application 
 (i.e., everthing is local) when the application load suddenly spikes.  To me 
 this suggests that the client is not able to renew (ping) the zookeeper 
 service in a timely manner and is expired.  But the log messages below with 
 the read error suggest that maybe there is something else going on?
 Zookeeper Configuration
 #Wed Mar 18 12:41:05 GMT-05:00 2009
 clientPort=2181
 dataDir=/var/bigdata/benchmark/zookeeper/1
 syncLimit=2
 dataLogDir=/var/bigdata/benchmark/zookeeper/1
 tickTime=2000
 Some representative log messages are below.
 Client side messages (from our app)
 ERROR [main-EventThread] 
 com.bigdata.zookeeper.ZLockImpl$ZLockWatcher.process(ZLockImpl.java:400) 
 2009-03-18 13:35:40,335 - Session expired: WatchedEvent: Server state change. 
 New state: Expired : 
 zpath=/benchmark/jobs/com.bigdata.service.jini.benchmark.ThroughputMaster/test_1/client1160/locknode
 ERROR [main-EventThread] 
 com.bigdata.zookeeper.ZLockImpl$ZLockWatcher.process(ZLockImpl.java:400) 
 2009-03-18 13:35:40,335 - Session expired: WatchedEvent: Server state change. 
 New state: Expired : 
 zpath=/benchmark/jobs/com.bigdata.service.jini.benchmark.ThroughputMaster/test_1/client1356/locknode
 Server side messages:
  WARN [NIOServerCxn.Factory:2181] 
 org.apache.zookeeper.server.NIOServerCnxn.doIO(NIOServerCnxn.java:417) 
 2009-03-18 13:06:57,252 - Exception causing close of session 
 0x1201aac14300022 due to java.io.IOException: Read error
  WARN [NIOServerCxn.Factory:2181] 
 org.apache.zookeeper.server.NIOServerCnxn.doIO(NIOServerCnxn.java:417) 
 2009-03-18 13:06:58,198 - Exception causing close of session 
 0x1201aac143f due to java.io.IOException: Read error

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (ZOOKEEPER-60) Get cppunit tests running as part of Hudson CI

2009-04-02 Thread Giridharan Kesavan (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-60?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12695205#action_12695205
 ] 

Giridharan Kesavan commented on ZOOKEEPER-60:
-

Thanks again for you help on this patch.


 Get cppunit tests running as part of Hudson CI
 --

 Key: ZOOKEEPER-60
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-60
 Project: Zookeeper
  Issue Type: Improvement
  Components: build
Affects Versions: 3.0.0, 3.0.1, 3.1.0, 3.1.1
Reporter: Patrick Hunt
Assignee: Giridharan Kesavan
 Fix For: 3.2.0

 Attachments: ZK-60.patch, ZK-60.patch, ZK-60.patch, ZOOKEEPER-60.patch


 Investigate if it is possible to run cppunit tests as part of Hudson.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.