ZooKeeper_branch34_solaris - Build # 753 - Failure

2013-12-30 Thread Apache Jenkins Server
See https://builds.apache.org/job/ZooKeeper_branch34_solaris/753/

###
## LAST 60 LINES OF THE CONSOLE 
###
[...truncated 146749 lines...]
[junit] 2013-12-30 08:09:06,121 [myid:] - INFO  [main:JMXEnv@105] - 
expect:InMemoryDataTree
[junit] 2013-12-30 08:09:06,121 [myid:] - INFO  [main:JMXEnv@108] - 
found:InMemoryDataTree 
org.apache.ZooKeeperService:name0=StandaloneServer_port-1,name1=InMemoryDataTree
[junit] 2013-12-30 08:09:06,121 [myid:] - INFO  [main:JMXEnv@105] - 
expect:StandaloneServer_port
[junit] 2013-12-30 08:09:06,121 [myid:] - INFO  [main:JMXEnv@108] - 
found:StandaloneServer_port 
org.apache.ZooKeeperService:name0=StandaloneServer_port-1
[junit] 2013-12-30 08:09:06,121 [myid:] - INFO  [main:ClientBase@420] - 
STOPPING server
[junit] 2013-12-30 08:09:06,122 [myid:] - INFO  [main:ZooKeeperServer@441] 
- shutting down
[junit] 2013-12-30 08:09:06,122 [myid:] - INFO  
[main:SessionTrackerImpl@225] - Shutting down
[junit] 2013-12-30 08:09:06,122 [myid:] - INFO  
[main:PrepRequestProcessor@761] - Shutting down
[junit] 2013-12-30 08:09:06,122 [myid:] - INFO  
[main:SyncRequestProcessor@209] - Shutting down
[junit] 2013-12-30 08:09:06,122 [myid:] - INFO  [ProcessThread(sid:0 
cport:-1)::PrepRequestProcessor@143] - PrepRequestProcessor exited loop!
[junit] 2013-12-30 08:09:06,122 [myid:] - INFO  
[SyncThread:0:SyncRequestProcessor@187] - SyncRequestProcessor exited!
[junit] 2013-12-30 08:09:06,123 [myid:] - INFO  
[main:FinalRequestProcessor@415] - shutdown of request processor complete
[junit] 2013-12-30 08:09:06,123 [myid:] - INFO  
[main:FourLetterWordMain@43] - connecting to 127.0.0.1 11221
[junit] 2013-12-30 08:09:06,124 [myid:] - INFO  [main:JMXEnv@135] - 
ensureOnly:[]
[junit] 2013-12-30 08:09:06,125 [myid:] - INFO  [main:ClientBase@413] - 
STARTING server
[junit] 2013-12-30 08:09:06,125 [myid:] - INFO  [main:ZooKeeperServer@162] 
- Created server with tickTime 3000 minSessionTimeout 6000 maxSessionTimeout 
6 datadir 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/ZooKeeper_branch34_solaris/trunk/build/test/tmp/test4318699480841147001.junit.dir/version-2
 snapdir 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/ZooKeeper_branch34_solaris/trunk/build/test/tmp/test4318699480841147001.junit.dir/version-2
[junit] 2013-12-30 08:09:06,126 [myid:] - INFO  
[main:NIOServerCnxnFactory@94] - binding to port 0.0.0.0/0.0.0.0:11221
[junit] 2013-12-30 08:09:06,129 [myid:] - INFO  
[main:FourLetterWordMain@43] - connecting to 127.0.0.1 11221
[junit] 2013-12-30 08:09:06,130 [myid:] - INFO  
[NIOServerCxn.Factory:0.0.0.0/0.0.0.0:11221:NIOServerCnxnFactory@197] - 
Accepted socket connection from /127.0.0.1:53975
[junit] 2013-12-30 08:09:06,130 [myid:] - INFO  
[NIOServerCxn.Factory:0.0.0.0/0.0.0.0:11221:NIOServerCnxn@827] - Processing 
stat command from /127.0.0.1:53975
[junit] 2013-12-30 08:09:06,130 [myid:] - INFO  
[Thread-5:NIOServerCnxn$StatCommand@663] - Stat command output
[junit] 2013-12-30 08:09:06,131 [myid:] - INFO  
[Thread-5:NIOServerCnxn@1007] - Closed socket connection for client 
/127.0.0.1:53975 (no session established for client)
[junit] 2013-12-30 08:09:06,131 [myid:] - INFO  [main:JMXEnv@135] - 
ensureOnly:[InMemoryDataTree, StandaloneServer_port]
[junit] 2013-12-30 08:09:06,132 [myid:] - INFO  [main:JMXEnv@105] - 
expect:InMemoryDataTree
[junit] 2013-12-30 08:09:06,132 [myid:] - INFO  [main:JMXEnv@108] - 
found:InMemoryDataTree 
org.apache.ZooKeeperService:name0=StandaloneServer_port-1,name1=InMemoryDataTree
[junit] 2013-12-30 08:09:06,133 [myid:] - INFO  [main:JMXEnv@105] - 
expect:StandaloneServer_port
[junit] 2013-12-30 08:09:06,133 [myid:] - INFO  [main:JMXEnv@108] - 
found:StandaloneServer_port 
org.apache.ZooKeeperService:name0=StandaloneServer_port-1
[junit] 2013-12-30 08:09:06,135 [myid:] - INFO  
[main:JUnit4ZKTestRunner$LoggedInvokeMethod@55] - Memory used 8563
[junit] 2013-12-30 08:09:06,135 [myid:] - INFO  
[main:JUnit4ZKTestRunner$LoggedInvokeMethod@60] - Number of threads 20
[junit] 2013-12-30 08:09:06,135 [myid:] - INFO  
[main:JUnit4ZKTestRunner$LoggedInvokeMethod@65] - FINISHED TEST METHOD testQuota
[junit] 2013-12-30 08:09:06,135 [myid:] - INFO  [main:ClientBase@450] - 
tearDown starting
[junit] 2013-12-30 08:09:06,210 [myid:] - INFO  [main:ZooKeeper@684] - 
Session: 0x143428c94b2 closed
[junit] 2013-12-30 08:09:06,210 [myid:] - INFO  [main:ClientBase@420] - 
STOPPING server
[junit] 2013-12-30 08:09:06,210 [myid:] - INFO  
[main-EventThread:ClientCnxn$EventThread@509] - EventThread shut down
[junit] 2013-12-30 08:09:06,212 [myid:] - INFO  [main:ZooKeeperServer@441] 
- shutting down
[junit] 2013-12-30 08:09:06,212 [myid:] - INFO  
[main:SessionTrackerImpl@225] - 

ZooKeeper-trunk-solaris - Build # 776 - Still Failing

2013-12-30 Thread Apache Jenkins Server
See https://builds.apache.org/job/ZooKeeper-trunk-solaris/776/

###
## LAST 60 LINES OF THE CONSOLE 
###
[...truncated 208478 lines...]
[junit] 2013-12-30 09:08:54,555 [myid:] - INFO  
[main:SessionTrackerImpl@183] - Shutting down
[junit] 2013-12-30 09:08:54,555 [myid:] - INFO  
[main:PrepRequestProcessor@972] - Shutting down
[junit] 2013-12-30 09:08:54,555 [myid:] - INFO  
[main:SyncRequestProcessor@190] - Shutting down
[junit] 2013-12-30 09:08:54,555 [myid:] - INFO  [ProcessThread(sid:0 
cport:-1)::PrepRequestProcessor@156] - PrepRequestProcessor exited loop!
[junit] 2013-12-30 09:08:54,555 [myid:] - INFO  
[SyncThread:0:SyncRequestProcessor@168] - SyncRequestProcessor exited!
[junit] 2013-12-30 09:08:54,555 [myid:] - INFO  
[main:FinalRequestProcessor@442] - shutdown of request processor complete
[junit] 2013-12-30 09:08:54,556 [myid:] - INFO  
[main:FourLetterWordMain@43] - connecting to 127.0.0.1 11221
[junit] 2013-12-30 09:08:54,556 [myid:] - INFO  [main:JMXEnv@133] - 
ensureOnly:[]
[junit] 2013-12-30 09:08:54,557 [myid:] - INFO  [main:ClientBase@414] - 
STARTING server
[junit] 2013-12-30 09:08:54,558 [myid:] - INFO  [main:ZooKeeperServer@149] 
- Created server with tickTime 3000 minSessionTimeout 6000 maxSessionTimeout 
6 datadir 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/ZooKeeper-trunk-solaris/trunk/build/test/tmp/test2032500173245020069.junit.dir/version-2
 snapdir 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/ZooKeeper-trunk-solaris/trunk/build/test/tmp/test2032500173245020069.junit.dir/version-2
[junit] 2013-12-30 09:08:54,558 [myid:] - INFO  
[main:NIOServerCnxnFactory@670] - Configuring NIO connection handler with 10s 
sessionless connection timeout, 2 selector thread(s), 16 worker threads, and 64 
kB direct buffers.
[junit] 2013-12-30 09:08:54,559 [myid:] - INFO  
[main:NIOServerCnxnFactory@683] - binding to port 0.0.0.0/0.0.0.0:11221
[junit] 2013-12-30 09:08:54,559 [myid:] - INFO  [main:FileSnap@83] - 
Reading snapshot 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/ZooKeeper-trunk-solaris/trunk/build/test/tmp/test2032500173245020069.junit.dir/version-2/snapshot.b
[junit] 2013-12-30 09:08:54,562 [myid:] - INFO  [main:FileTxnSnapLog@297] - 
Snapshotting: 0xb to 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/ZooKeeper-trunk-solaris/trunk/build/test/tmp/test2032500173245020069.junit.dir/version-2/snapshot.b
[junit] 2013-12-30 09:08:54,563 [myid:] - INFO  
[main:FourLetterWordMain@43] - connecting to 127.0.0.1 11221
[junit] 2013-12-30 09:08:54,564 [myid:] - INFO  
[NIOServerCxnFactory.AcceptThread:0.0.0.0/0.0.0.0:11221:NIOServerCnxnFactory$AcceptThread@296]
 - Accepted socket connection from /127.0.0.1:65165
[junit] 2013-12-30 09:08:54,564 [myid:] - INFO  
[NIOWorkerThread-1:NIOServerCnxn@835] - Processing stat command from 
/127.0.0.1:65165
[junit] 2013-12-30 09:08:54,564 [myid:] - INFO  
[NIOWorkerThread-1:NIOServerCnxn$StatCommand@684] - Stat command output
[junit] 2013-12-30 09:08:54,565 [myid:] - INFO  
[NIOWorkerThread-1:NIOServerCnxn@1006] - Closed socket connection for client 
/127.0.0.1:65165 (no session established for client)
[junit] 2013-12-30 09:08:54,565 [myid:] - INFO  [main:JMXEnv@133] - 
ensureOnly:[InMemoryDataTree, StandaloneServer_port]
[junit] 2013-12-30 09:08:54,566 [myid:] - INFO  [main:JMXEnv@105] - 
expect:InMemoryDataTree
[junit] 2013-12-30 09:08:54,566 [myid:] - INFO  [main:JMXEnv@108] - 
found:InMemoryDataTree 
org.apache.ZooKeeperService:name0=StandaloneServer_port-1,name1=InMemoryDataTree
[junit] 2013-12-30 09:08:54,566 [myid:] - INFO  [main:JMXEnv@105] - 
expect:StandaloneServer_port
[junit] 2013-12-30 09:08:54,567 [myid:] - INFO  [main:JMXEnv@108] - 
found:StandaloneServer_port 
org.apache.ZooKeeperService:name0=StandaloneServer_port-1
[junit] 2013-12-30 09:08:54,567 [myid:] - INFO  
[main:JUnit4ZKTestRunner$LoggedInvokeMethod@55] - Memory used 13412
[junit] 2013-12-30 09:08:54,567 [myid:] - INFO  
[main:JUnit4ZKTestRunner$LoggedInvokeMethod@60] - Number of threads 24
[junit] 2013-12-30 09:08:54,567 [myid:] - INFO  
[main:JUnit4ZKTestRunner$LoggedInvokeMethod@65] - FINISHED TEST METHOD testQuota
[junit] 2013-12-30 09:08:54,567 [myid:] - INFO  [main:ClientBase@451] - 
tearDown starting
[junit] 2013-12-30 09:08:54,641 [myid:] - INFO  [main:ZooKeeper@777] - 
Session: 0x14342c3550f closed
[junit] 2013-12-30 09:08:54,641 [myid:] - INFO  
[main-EventThread:ClientCnxn$EventThread@513] - EventThread shut down
[junit] 2013-12-30 09:08:54,641 [myid:] - INFO  [main:ClientBase@421] - 
STOPPING server
[junit] 2013-12-30 09:08:54,642 [myid:] - INFO  
[NIOServerCxnFactory.AcceptThread:0.0.0.0/0.0.0.0:11221:NIOServerCnxnFactory$AcceptThread@219]
 - 

ZooKeeper-3.4-WinVS2008_java - Build # 393 - Still Failing

2013-12-30 Thread Apache Jenkins Server
See https://builds.apache.org/job/ZooKeeper-3.4-WinVS2008_java/393/

###
## LAST 60 LINES OF THE CONSOLE 
###
[...truncated 179891 lines...]
[junit] 2013-12-30 09:30:15,486 [myid:] - INFO  
[main:FinalRequestProcessor@415] - shutdown of request processor complete
[junit] 2013-12-30 09:30:15,487 [myid:] - INFO  
[main:FourLetterWordMain@43] - connecting to 127.0.0.1 11221
[junit] 2013-12-30 09:30:15,728 [myid:] - INFO  
[main-SendThread(127.0.0.1:11221):ClientCnxn$SendThread@968] - Opening socket 
connection to server 127.0.0.1/127.0.0.1:11221. Will not attempt to 
authenticate using SASL (unknown error)
[junit] 2013-12-30 09:30:16,476 [myid:] - INFO  [main:JMXEnv@135] - 
ensureOnly:[]
[junit] 2013-12-30 09:30:16,477 [myid:] - INFO  [main:ClientBase@413] - 
STARTING server
[junit] 2013-12-30 09:30:16,477 [myid:] - INFO  [main:ZooKeeperServer@162] 
- Created server with tickTime 3000 minSessionTimeout 6000 maxSessionTimeout 
6 datadir 
f:\hudson\hudson-slave\workspace\ZooKeeper-3.4-WinVS2008_java\branch-3.4\build\test\tmp\test173821282786237.junit.dir\version-2
 snapdir 
f:\hudson\hudson-slave\workspace\ZooKeeper-3.4-WinVS2008_java\branch-3.4\build\test\tmp\test173821282786237.junit.dir\version-2
[junit] 2013-12-30 09:30:16,479 [myid:] - INFO  
[main:NIOServerCnxnFactory@94] - binding to port 0.0.0.0/0.0.0.0:11221
[junit] 2013-12-30 09:30:16,482 [myid:] - INFO  
[main:FourLetterWordMain@43] - connecting to 127.0.0.1 11221
[junit] 2013-12-30 09:30:16,483 [myid:] - INFO  
[NIOServerCxn.Factory:0.0.0.0/0.0.0.0:11221:NIOServerCnxnFactory@197] - 
Accepted socket connection from /127.0.0.1:64262
[junit] 2013-12-30 09:30:16,483 [myid:] - INFO  
[NIOServerCxn.Factory:0.0.0.0/0.0.0.0:11221:NIOServerCnxn@827] - Processing 
stat command from /127.0.0.1:64262
[junit] 2013-12-30 09:30:16,483 [myid:] - INFO  
[Thread-4:NIOServerCnxn$StatCommand@663] - Stat command output
[junit] 2013-12-30 09:30:16,484 [myid:] - INFO  
[Thread-4:NIOServerCnxn@1007] - Closed socket connection for client 
/127.0.0.1:64262 (no session established for client)
[junit] 2013-12-30 09:30:16,484 [myid:] - INFO  [main:JMXEnv@135] - 
ensureOnly:[InMemoryDataTree, StandaloneServer_port]
[junit] 2013-12-30 09:30:16,485 [myid:] - INFO  [main:JMXEnv@105] - 
expect:InMemoryDataTree
[junit] 2013-12-30 09:30:16,485 [myid:] - INFO  [main:JMXEnv@108] - 
found:InMemoryDataTree 
org.apache.ZooKeeperService:name0=StandaloneServer_port-1,name1=InMemoryDataTree
[junit] 2013-12-30 09:30:16,485 [myid:] - INFO  [main:JMXEnv@105] - 
expect:StandaloneServer_port
[junit] 2013-12-30 09:30:16,486 [myid:] - INFO  [main:JMXEnv@108] - 
found:StandaloneServer_port 
org.apache.ZooKeeperService:name0=StandaloneServer_port-1
[junit] 2013-12-30 09:30:16,488 [myid:] - INFO  
[main:JUnit4ZKTestRunner$LoggedInvokeMethod@55] - Memory used 10761
[junit] 2013-12-30 09:30:16,488 [myid:] - INFO  
[main:JUnit4ZKTestRunner$LoggedInvokeMethod@60] - Number of threads 20
[junit] 2013-12-30 09:30:16,489 [myid:] - INFO  
[main:JUnit4ZKTestRunner$LoggedInvokeMethod@65] - FINISHED TEST METHOD testQuota
[junit] 2013-12-30 09:30:16,489 [myid:] - INFO  [main:ClientBase@450] - 
tearDown starting
[junit] 2013-12-30 09:30:16,723 [myid:] - INFO  
[main-SendThread(127.0.0.1:11221):ClientCnxn$SendThread@849] - Socket 
connection established to 127.0.0.1/127.0.0.1:11221, initiating session
[junit] 2013-12-30 09:30:16,723 [myid:] - INFO  
[NIOServerCxn.Factory:0.0.0.0/0.0.0.0:11221:NIOServerCnxnFactory@197] - 
Accepted socket connection from /127.0.0.1:64258
[junit] 2013-12-30 09:30:16,723 [myid:] - INFO  
[NIOServerCxn.Factory:0.0.0.0/0.0.0.0:11221:ZooKeeperServer@861] - Client 
attempting to renew session 0x14342d6dd23 at /127.0.0.1:64258
[junit] 2013-12-30 09:30:16,724 [myid:] - INFO  
[NIOServerCxn.Factory:0.0.0.0/0.0.0.0:11221:ZooKeeperServer@617] - Established 
session 0x14342d6dd23 with negotiated timeout 3 for client 
/127.0.0.1:64258
[junit] 2013-12-30 09:30:16,724 [myid:] - INFO  
[main-SendThread(127.0.0.1:11221):ClientCnxn$SendThread@1228] - Session 
establishment complete on server 127.0.0.1/127.0.0.1:11221, sessionid = 
0x14342d6dd23, negotiated timeout = 3
[junit] 2013-12-30 09:30:16,725 [myid:] - INFO  [ProcessThread(sid:0 
cport:-1)::PrepRequestProcessor@494] - Processed session termination for 
sessionid: 0x14342d6dd23
[junit] 2013-12-30 09:30:16,725 [myid:] - INFO  
[SyncThread:0:FileTxnLog@199] - Creating new log file: log.c
[junit] 2013-12-30 09:30:16,735 [myid:] - INFO  [main:ZooKeeper@684] - 
Session: 0x14342d6dd23 closed
[junit] 2013-12-30 09:30:16,735 [myid:] - INFO  [main:ClientBase@420] - 
STOPPING server
[junit] 2013-12-30 09:30:16,735 [myid:] - WARN  

[jira] [Commented] (ZOOKEEPER-102) Need to replace Jute with supported code

2013-12-30 Thread Leader Ni (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13858679#comment-13858679
 ] 

Leader Ni commented on ZOOKEEPER-102:
-

So, how about avro now in hadoop,and any paln to replace jute?

 Need to replace Jute with supported code
 

 Key: ZOOKEEPER-102
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-102
 Project: ZooKeeper
  Issue Type: Improvement
Reporter: Benjamin Reed

 ZooKeeper currently uses Jute to serialize objects to put on the wire and on 
 disk. We pulled Jute out of Hadoop and added a C binding. Both versions of 
 Jute have evolved (although Hadoop still doesn't have a C binding). It would 
 be nice to use a more standard serialization library. Some options include 
 Thrift or Google's protocol buffers.
 Our main requirements would be Java and C bindings and good performance. (For 
 example, serializing to XML would give us incredibly bad performance and 
 would not be acceptible!)



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (ZOOKEEPER-1833) fix windows build

2013-12-30 Thread Flavio Junqueira (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1833?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13858680#comment-13858680
 ] 

Flavio Junqueira commented on ZOOKEEPER-1833:
-

Makes sense, thanks for pointing it out! The tests that failed in build #393 
are these, in the case anyone is interested in having a look and helping out:

{noformat}
TEST METHOD FAILED testHighestZxidJoinLate
TEST METHOD FAILED testUpdatingEpoch
TEST METHOD FAILED testAcls
TEST METHOD FAILED testHammerBasic
TEST METHOD FAILED testFourLetterWords
TEST METHOD FAILED testQuotaWithQuorum
{noformat}

 fix windows build
 -

 Key: ZOOKEEPER-1833
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1833
 Project: ZooKeeper
  Issue Type: Bug
Affects Versions: 3.4.5
Reporter: Michi Mutsuzaki
Assignee: Michi Mutsuzaki
Priority: Blocker
 Fix For: 3.4.6

 Attachments: TEST-org.apache.zookeeper.test.QuorumTest.zip, 
 ZOOKEEPER-1833-b3.4.patch, ZOOKEEPER-1833.patch, ZOOKEEPER-1833.patch


 A bunch of 3.4 tests are failing on windows.
 {noformat}
 [junit] 2013-12-06 08:40:59,692 [myid:] - INFO  [main:ZKTestCase$1@65] - 
 FAILED testEarlyLeaderAbandonment
 [junit] 2013-12-06 08:41:10,472 [myid:] - INFO  [main:ZKTestCase$1@65] - 
 FAILED testHighestZxidJoinLate
 [junit] 2013-12-06 08:45:31,085 [myid:] - INFO  [main:ZKTestCase$1@65] - 
 FAILED testUpdatingEpoch
 [junit] 2013-12-06 08:55:34,630 [myid:] - INFO  [main:ZKTestCase$1@65] - 
 FAILED testObserversHammer
 [junit] 2013-12-06 08:55:59,889 [myid:] - INFO  [main:ZKTestCase$1@65] - 
 FAILED testAsyncExistsFailure_NoNode
 [junit] 2013-12-06 08:56:00,571 [myid:] - INFO  [main:ZKTestCase$1@65] - 
 FAILED testAsyncGetACL
 [junit] 2013-12-06 08:56:02,626 [myid:] - INFO  [main:ZKTestCase$1@65] - 
 FAILED testAsyncGetChildrenEmpty
 [junit] 2013-12-06 08:56:03,491 [myid:] - INFO  [main:ZKTestCase$1@65] - 
 FAILED testAsyncGetChildrenSingle
 [junit] 2013-12-06 08:56:11,276 [myid:] - INFO  [main:ZKTestCase$1@65] - 
 FAILED testAsyncGetChildrenTwo
 [junit] 2013-12-06 08:56:13,878 [myid:] - INFO  [main:ZKTestCase$1@65] - 
 FAILED testAsyncGetChildrenFailure_NoNode
 [junit] 2013-12-06 08:56:16,294 [myid:] - INFO  [main:ZKTestCase$1@65] - 
 FAILED testAsyncGetChildren2Empty
 [junit] 2013-12-06 08:56:18,622 [myid:] - INFO  [main:ZKTestCase$1@65] - 
 FAILED testAsyncGetChildren2Single
 [junit] 2013-12-06 08:56:21,224 [myid:] - INFO  [main:ZKTestCase$1@65] - 
 FAILED testAsyncGetChildren2Two
 [junit] 2013-12-06 08:56:23,738 [myid:] - INFO  [main:ZKTestCase$1@65] - 
 FAILED testAsyncGetChildren2Failure_NoNode
 [junit] 2013-12-06 08:56:26,058 [myid:] - INFO  [main:ZKTestCase$1@65] - 
 FAILED testAsyncGetData
 [junit] 2013-12-06 08:56:28,482 [myid:] - INFO  [main:ZKTestCase$1@65] - 
 FAILED testAsyncGetDataFailure_NoNode
 [junit] 2013-12-06 08:57:35,527 [myid:] - INFO  [main:ZKTestCase$1@65] - 
 FAILED testStartupFailureCreate
 [junit] 2013-12-06 08:57:38,645 [myid:] - INFO  [main:ZKTestCase$1@65] - 
 FAILED testStartupFailureSet
 [junit] 2013-12-06 08:57:41,261 [myid:] - INFO  [main:ZKTestCase$1@65] - 
 FAILED testStartupFailureSnapshot
 [junit] 2013-12-06 08:59:22,222 [myid:] - INFO  [main:ZKTestCase$1@65] - 
 FAILED testClientWithWatcherObj
 [junit] 2013-12-06 09:00:05,592 [myid:] - INFO  [main:ZKTestCase$1@65] - 
 FAILED testClientCleanup
 [junit] 2013-12-06 09:01:24,113 [myid:] - INFO  [main:ZKTestCase$1@65] - 
 FAILED testBindByAddress
 [junit] 2013-12-06 09:02:14,123 [myid:] - INFO  [main:ZKTestCase$1@65] - 
 FAILED testClientwithoutWatcherObj
 [junit] 2013-12-06 09:05:56,461 [myid:] - INFO  [main:ZKTestCase$1@65] - 
 FAILED testZeroWeightQuorum
 [junit] 2013-12-06 09:08:18,747 [myid:] - INFO  [main:ZKTestCase$1@65] - 
 FAILED testResyncByDiffAfterFollowerCrashes
 [junit] 2013-12-06 09:09:42,271 [myid:] - INFO  [main:ZKTestCase$1@65] - 
 FAILED testFourLetterWords
 [junit] 2013-12-06 09:14:03,770 [myid:] - INFO  [main:ZKTestCase$1@65] - 
 FAILED testLE
 [junit] 2013-12-06 09:46:30,002 [myid:] - INFO  [main:ZKTestCase$1@65] - 
 FAILED testHierarchicalQuorum
 [junit] 2013-12-06 09:50:26,912 [myid:] - INFO  [main:ZKTestCase$1@65] - 
 FAILED testHammerBasic
 [junit] 2013-12-06 09:51:07,604 [myid:] - INFO  [main:ZKTestCase$1@65] - 
 FAILED testQuotaWithQuorum
 [junit] 2013-12-06 09:52:41,515 [myid:] - INFO  [main:ZKTestCase$1@65] - 
 FAILED testNull
 [junit] 2013-12-06 09:53:22,648 [myid:] - INFO  [main:ZKTestCase$1@65] - 
 FAILED testDeleteWithChildren
 [junit] 2013-12-06 09:56:49,061 [myid:] - INFO  [main:ZKTestCase$1@65] - 
 FAILED testClientwithoutWatcherObj
 [junit] 2013-12-06 09:58:27,705 [myid:] - INFO  [main:ZKTestCase$1@65] - 
 FAILED testGetView
 

[jira] [Created] (ZOOKEEPER-1854) ClientBase ZooKeeper server clean-up

2013-12-30 Thread JIRA
Germán Blanco created ZOOKEEPER-1854:


 Summary: ClientBase ZooKeeper server clean-up
 Key: ZOOKEEPER-1854
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1854
 Project: ZooKeeper
  Issue Type: Sub-task
  Components: tests
Affects Versions: 3.4.5, 3.5.0
 Environment: Windows 7, Java 1.7
Reporter: Germán Blanco
Assignee: Germán Blanco
 Fix For: 3.4.6, 3.5.0


The ClientBase utility for tests provides methods for creating a ZooKeeper 
server, however the close up methods don't seem to shutdown that ZooKeeper 
server.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


ZooKeeper-trunk-WinVS2008_java - Build # 639 - Still Failing

2013-12-30 Thread Apache Jenkins Server
See https://builds.apache.org/job/ZooKeeper-trunk-WinVS2008_java/639/

###
## LAST 60 LINES OF THE CONSOLE 
###
[...truncated 250254 lines...]
[junit] 2013-12-30 09:55:33,000 [myid:] - INFO  
[SessionTracker:SessionTrackerImpl@134] - SessionTrackerImpl exited loop!
[junit] 2013-12-30 09:55:33,001 [myid:] - INFO  
[SessionTracker:SessionTrackerImpl@134] - SessionTrackerImpl exited loop!
[junit] 2013-12-30 09:55:33,074 [myid:] - INFO  
[main-SendThread(127.0.0.1:11221):ClientCnxn$SendThread@1008] - Opening socket 
connection to server 127.0.0.1/127.0.0.1:11221. Will not attempt to 
authenticate using SASL (unknown error)
[junit] 2013-12-30 09:55:33,208 [myid:] - INFO  [main:JMXEnv@133] - 
ensureOnly:[]
[junit] 2013-12-30 09:55:33,209 [myid:] - INFO  [main:ClientBase@414] - 
STARTING server
[junit] 2013-12-30 09:55:33,209 [myid:] - INFO  [main:ZooKeeperServer@149] 
- Created server with tickTime 3000 minSessionTimeout 6000 maxSessionTimeout 
6 datadir 
f:\hudson\hudson-slave\workspace\ZooKeeper-trunk-WinVS2008_java\trunk\build\test\tmp\test625099667631270729.junit.dir\version-2
 snapdir 
f:\hudson\hudson-slave\workspace\ZooKeeper-trunk-WinVS2008_java\trunk\build\test\tmp\test625099667631270729.junit.dir\version-2
[junit] 2013-12-30 09:55:33,210 [myid:] - INFO  
[main:NIOServerCnxnFactory@670] - Configuring NIO connection handler with 10s 
sessionless connection timeout, 1 selector thread(s), 4 worker threads, and 64 
kB direct buffers.
[junit] 2013-12-30 09:55:33,211 [myid:] - INFO  
[main:NIOServerCnxnFactory@683] - binding to port 0.0.0.0/0.0.0.0:11221
[junit] 2013-12-30 09:55:33,212 [myid:] - INFO  [main:FileSnap@83] - 
Reading snapshot 
f:\hudson\hudson-slave\workspace\ZooKeeper-trunk-WinVS2008_java\trunk\build\test\tmp\test625099667631270729.junit.dir\version-2\snapshot.b
[junit] 2013-12-30 09:55:33,214 [myid:] - INFO  [main:FileTxnSnapLog@297] - 
Snapshotting: 0xb to 
f:\hudson\hudson-slave\workspace\ZooKeeper-trunk-WinVS2008_java\trunk\build\test\tmp\test625099667631270729.junit.dir\version-2\snapshot.b
[junit] 2013-12-30 09:55:33,216 [myid:] - INFO  
[main:FourLetterWordMain@43] - connecting to 127.0.0.1 11221
[junit] 2013-12-30 09:55:33,217 [myid:] - INFO  
[NIOServerCxnFactory.AcceptThread:0.0.0.0/0.0.0.0:11221:NIOServerCnxnFactory$AcceptThread@296]
 - Accepted socket connection from /127.0.0.1:50990
[junit] 2013-12-30 09:55:33,217 [myid:] - INFO  
[NIOWorkerThread-1:NIOServerCnxn@835] - Processing stat command from 
/127.0.0.1:50990
[junit] 2013-12-30 09:55:33,218 [myid:] - INFO  
[NIOWorkerThread-1:NIOServerCnxn$StatCommand@684] - Stat command output
[junit] 2013-12-30 09:55:33,218 [myid:] - INFO  
[NIOWorkerThread-1:NIOServerCnxn@1006] - Closed socket connection for client 
/127.0.0.1:50990 (no session established for client)
[junit] 2013-12-30 09:55:33,218 [myid:] - INFO  [main:JMXEnv@133] - 
ensureOnly:[InMemoryDataTree, StandaloneServer_port]
[junit] 2013-12-30 09:55:33,220 [myid:] - INFO  [main:JMXEnv@105] - 
expect:InMemoryDataTree
[junit] 2013-12-30 09:55:33,220 [myid:] - INFO  [main:JMXEnv@108] - 
found:InMemoryDataTree 
org.apache.ZooKeeperService:name0=StandaloneServer_port-1,name1=InMemoryDataTree
[junit] 2013-12-30 09:55:33,220 [myid:] - INFO  [main:JMXEnv@105] - 
expect:StandaloneServer_port
[junit] 2013-12-30 09:55:33,220 [myid:] - INFO  [main:JMXEnv@108] - 
found:StandaloneServer_port 
org.apache.ZooKeeperService:name0=StandaloneServer_port-1
[junit] 2013-12-30 09:55:33,220 [myid:] - INFO  
[main:JUnit4ZKTestRunner$LoggedInvokeMethod@55] - Memory used 13054
[junit] 2013-12-30 09:55:33,221 [myid:] - INFO  
[main:JUnit4ZKTestRunner$LoggedInvokeMethod@60] - Number of threads 22
[junit] 2013-12-30 09:55:33,221 [myid:] - INFO  
[main:JUnit4ZKTestRunner$LoggedInvokeMethod@65] - FINISHED TEST METHOD testQuota
[junit] 2013-12-30 09:55:33,221 [myid:] - INFO  [main:ClientBase@451] - 
tearDown starting
[junit] 2013-12-30 09:55:33,563 [myid:] - INFO  
[main-SendThread(127.0.0.1:11221):ClientCnxn$SendThread@882] - Socket 
connection established to 127.0.0.1/127.0.0.1:11221, initiating session
[junit] 2013-12-30 09:55:33,563 [myid:] - INFO  
[NIOServerCxnFactory.AcceptThread:0.0.0.0/0.0.0.0:11221:NIOServerCnxnFactory$AcceptThread@296]
 - Accepted socket connection from /127.0.0.1:50985
[junit] 2013-12-30 09:55:33,564 [myid:] - INFO  
[NIOWorkerThread-2:ZooKeeperServer@858] - Client attempting to renew session 
0x14342ee01cd at /127.0.0.1:50985
[junit] 2013-12-30 09:55:33,565 [myid:] - INFO  
[NIOWorkerThread-2:ZooKeeperServer@604] - Established session 0x14342ee01cd 
with negotiated timeout 3 for client /127.0.0.1:50985
[junit] 2013-12-30 09:55:33,565 [myid:] - INFO  
[main-SendThread(127.0.0.1:11221):ClientCnxn$SendThread@1261] - Session 

[jira] [Updated] (ZOOKEEPER-1854) ClientBase ZooKeeper server clean-up

2013-12-30 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Germán Blanco updated ZOOKEEPER-1854:
-

Attachment: ZOOKEEPER-1854-b3.4.patch

 ClientBase ZooKeeper server clean-up
 

 Key: ZOOKEEPER-1854
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1854
 Project: ZooKeeper
  Issue Type: Sub-task
  Components: tests
Affects Versions: 3.4.5, 3.5.0
 Environment: Windows 7, Java 1.7
Reporter: Germán Blanco
Assignee: Germán Blanco
 Fix For: 3.4.6, 3.5.0

 Attachments: ZOOKEEPER-1854-b3.4.patch


 The ClientBase utility for tests provides methods for creating a ZooKeeper 
 server, however the close up methods don't seem to shutdown that ZooKeeper 
 server.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (ZOOKEEPER-1854) ClientBase ZooKeeper server clean-up

2013-12-30 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Germán Blanco updated ZOOKEEPER-1854:
-

Attachment: ZOOKEEPER-1854.patch

Please correct me if I am wrong, but I don't see this server being shutdown 
anywhere. I see the database being closed, and I don't know why that is done, 
but that is not the only thing that needs to be closed, right?

 ClientBase ZooKeeper server clean-up
 

 Key: ZOOKEEPER-1854
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1854
 Project: ZooKeeper
  Issue Type: Sub-task
  Components: tests
Affects Versions: 3.4.5, 3.5.0
 Environment: Windows 7, Java 1.7
Reporter: Germán Blanco
Assignee: Germán Blanco
 Fix For: 3.4.6, 3.5.0

 Attachments: ZOOKEEPER-1854-b3.4.patch, ZOOKEEPER-1854.patch


 The ClientBase utility for tests provides methods for creating a ZooKeeper 
 server, however the close up methods don't seem to shutdown that ZooKeeper 
 server.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


ZooKeeper-trunk-jdk7 - Build # 741 - Failure

2013-12-30 Thread Apache Jenkins Server
See https://builds.apache.org/job/ZooKeeper-trunk-jdk7/741/

###
## LAST 60 LINES OF THE CONSOLE 
###
[...truncated 206963 lines...]
[junit] 2013-12-30 10:18:14,340 [myid:] - INFO  [ProcessThread(sid:0 
cport:-1)::PrepRequestProcessor@156] - PrepRequestProcessor exited loop!
[junit] 2013-12-30 10:18:14,340 [myid:] - INFO  
[SyncThread:0:SyncRequestProcessor@168] - SyncRequestProcessor exited!
[junit] 2013-12-30 10:18:14,341 [myid:] - INFO  
[main:FinalRequestProcessor@442] - shutdown of request processor complete
[junit] 2013-12-30 10:18:14,341 [myid:] - INFO  
[main:FourLetterWordMain@43] - connecting to 127.0.0.1 11221
[junit] 2013-12-30 10:18:14,342 [myid:] - INFO  [main:JMXEnv@133] - 
ensureOnly:[]
[junit] 2013-12-30 10:18:14,344 [myid:] - INFO  [main:ClientBase@414] - 
STARTING server
[junit] 2013-12-30 10:18:14,344 [myid:] - INFO  [main:ZooKeeperServer@149] 
- Created server with tickTime 3000 minSessionTimeout 6000 maxSessionTimeout 
6 datadir 
/x1/jenkins/jenkins-slave/workspace/ZooKeeper-trunk-jdk7/trunk/build/test/tmp/test6757438650396793087.junit.dir/version-2
 snapdir 
/x1/jenkins/jenkins-slave/workspace/ZooKeeper-trunk-jdk7/trunk/build/test/tmp/test6757438650396793087.junit.dir/version-2
[junit] 2013-12-30 10:18:14,344 [myid:] - INFO  
[main:NIOServerCnxnFactory@670] - Configuring NIO connection handler with 10s 
sessionless connection timeout, 3 selector thread(s), 48 worker threads, and 64 
kB direct buffers.
[junit] 2013-12-30 10:18:14,345 [myid:] - INFO  
[main:NIOServerCnxnFactory@683] - binding to port 0.0.0.0/0.0.0.0:11221
[junit] 2013-12-30 10:18:14,346 [myid:] - INFO  [main:FileSnap@83] - 
Reading snapshot 
/x1/jenkins/jenkins-slave/workspace/ZooKeeper-trunk-jdk7/trunk/build/test/tmp/test6757438650396793087.junit.dir/version-2/snapshot.b
[junit] 2013-12-30 10:18:14,349 [myid:] - INFO  [main:FileTxnSnapLog@297] - 
Snapshotting: 0xb to 
/x1/jenkins/jenkins-slave/workspace/ZooKeeper-trunk-jdk7/trunk/build/test/tmp/test6757438650396793087.junit.dir/version-2/snapshot.b
[junit] 2013-12-30 10:18:14,351 [myid:] - INFO  
[main:FourLetterWordMain@43] - connecting to 127.0.0.1 11221
[junit] 2013-12-30 10:18:14,352 [myid:] - INFO  
[NIOServerCxnFactory.AcceptThread:0.0.0.0/0.0.0.0:11221:NIOServerCnxnFactory$AcceptThread@296]
 - Accepted socket connection from /127.0.0.1:45340
[junit] 2013-12-30 10:18:14,353 [myid:] - INFO  
[NIOWorkerThread-1:NIOServerCnxn@835] - Processing stat command from 
/127.0.0.1:45340
[junit] 2013-12-30 10:18:14,353 [myid:] - INFO  
[NIOWorkerThread-1:NIOServerCnxn$StatCommand@684] - Stat command output
[junit] 2013-12-30 10:18:14,354 [myid:] - INFO  
[NIOWorkerThread-1:NIOServerCnxn@1006] - Closed socket connection for client 
/127.0.0.1:45340 (no session established for client)
[junit] 2013-12-30 10:18:14,354 [myid:] - INFO  [main:JMXEnv@133] - 
ensureOnly:[InMemoryDataTree, StandaloneServer_port]
[junit] 2013-12-30 10:18:14,356 [myid:] - INFO  [main:JMXEnv@105] - 
expect:InMemoryDataTree
[junit] 2013-12-30 10:18:14,356 [myid:] - INFO  [main:JMXEnv@108] - 
found:InMemoryDataTree 
org.apache.ZooKeeperService:name0=StandaloneServer_port-1,name1=InMemoryDataTree
[junit] 2013-12-30 10:18:14,356 [myid:] - INFO  [main:JMXEnv@105] - 
expect:StandaloneServer_port
[junit] 2013-12-30 10:18:14,357 [myid:] - INFO  [main:JMXEnv@108] - 
found:StandaloneServer_port 
org.apache.ZooKeeperService:name0=StandaloneServer_port-1
[junit] 2013-12-30 10:18:14,357 [myid:] - INFO  
[main:JUnit4ZKTestRunner$LoggedInvokeMethod@55] - Memory used 18198
[junit] 2013-12-30 10:18:14,357 [myid:] - INFO  
[main:JUnit4ZKTestRunner$LoggedInvokeMethod@60] - Number of threads 25
[junit] 2013-12-30 10:18:14,358 [myid:] - INFO  
[main:JUnit4ZKTestRunner$LoggedInvokeMethod@65] - FINISHED TEST METHOD testQuota
[junit] 2013-12-30 10:18:14,358 [myid:] - INFO  [main:ClientBase@451] - 
tearDown starting
[junit] 2013-12-30 10:18:14,410 [myid:] - INFO  [main:ZooKeeper@777] - 
Session: 0x1434302cd95 closed
[junit] 2013-12-30 10:18:14,411 [myid:] - INFO  [main:ClientBase@421] - 
STOPPING server
[junit] 2013-12-30 10:18:14,410 [myid:] - INFO  
[main-EventThread:ClientCnxn$EventThread@513] - EventThread shut down
[junit] 2013-12-30 10:18:14,411 [myid:] - INFO  
[ConnnectionExpirer:NIOServerCnxnFactory$ConnectionExpirerThread@583] - 
ConnnectionExpirerThread interrupted
[junit] 2013-12-30 10:18:14,411 [myid:] - INFO  
[NIOServerCxnFactory.AcceptThread:0.0.0.0/0.0.0.0:11221:NIOServerCnxnFactory$AcceptThread@219]
 - accept thread exitted run method
[junit] 2013-12-30 10:18:14,421 [myid:] - INFO  
[NIOServerCxnFactory.SelectorThread-2:NIOServerCnxnFactory$SelectorThread@420] 
- selector thread exitted run method
[junit] 2013-12-30 10:18:14,411 [myid:] - INFO  

ZooKeeper_branch34_jdk7 - Build # 430 - Failure

2013-12-30 Thread Apache Jenkins Server
See https://builds.apache.org/job/ZooKeeper_branch34_jdk7/430/

###
## LAST 60 LINES OF THE CONSOLE 
###
[...truncated 410 lines...]
at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:334)
at java.util.concurrent.FutureTask.run(FutureTask.java:166)
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1146)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
at java.lang.Thread.run(Thread.java:679)
Caused by: org.tmatesoft.svn.core.SVNException: svn: E175002: OPTIONS 
/repos/asf/zookeeper/branches/branch-3.4 failed
at 
org.tmatesoft.svn.core.internal.io.dav.http.HTTPConnection.request(HTTPConnection.java:388)
at 
org.tmatesoft.svn.core.internal.io.dav.http.HTTPConnection.request(HTTPConnection.java:373)
at 
org.tmatesoft.svn.core.internal.io.dav.http.HTTPConnection.request(HTTPConnection.java:361)
at 
org.tmatesoft.svn.core.internal.io.dav.DAVConnection.performHttpRequest(DAVConnection.java:707)
at 
org.tmatesoft.svn.core.internal.io.dav.DAVConnection.exchangeCapabilities(DAVConnection.java:627)
at 
org.tmatesoft.svn.core.internal.io.dav.DAVConnection.open(DAVConnection.java:102)
at 
org.tmatesoft.svn.core.internal.io.dav.DAVRepository.openConnection(DAVRepository.java:1020)
at 
org.tmatesoft.svn.core.internal.io.dav.DAVRepository.getLatestRevision(DAVRepository.java:180)
at 
org.tmatesoft.svn.core.internal.wc16.SVNBasicDelegate.getRevisionNumber(SVNBasicDelegate.java:480)
at 
org.tmatesoft.svn.core.internal.wc16.SVNBasicDelegate.getLocations(SVNBasicDelegate.java:833)
at 
org.tmatesoft.svn.core.internal.wc16.SVNBasicDelegate.createRepository(SVNBasicDelegate.java:527)
at 
org.tmatesoft.svn.core.internal.wc16.SVNUpdateClient16.doCheckout(SVNUpdateClient16.java:875)
at 
org.tmatesoft.svn.core.internal.wc2.old.SvnOldCheckout.run(SvnOldCheckout.java:19)
at 
org.tmatesoft.svn.core.internal.wc2.old.SvnOldCheckout.run(SvnOldCheckout.java:8)
at 
org.tmatesoft.svn.core.internal.wc2.SvnOperationRunner.run(SvnOperationRunner.java:20)
at 
org.tmatesoft.svn.core.wc2.SvnOperationFactory.run(SvnOperationFactory.java:1238)
at org.tmatesoft.svn.core.wc2.SvnOperation.run(SvnOperation.java:294)
at 
org.tmatesoft.svn.core.wc.SVNUpdateClient.doCheckout(SVNUpdateClient.java:777)
at 
hudson.scm.subversion.CheckoutUpdater$1.perform(CheckoutUpdater.java:99)
... 17 more
Caused by: svn: E175002: OPTIONS /repos/asf/zookeeper/branches/branch-3.4 failed
at 
org.tmatesoft.svn.core.SVNErrorMessage.create(SVNErrorMessage.java:208)
at 
org.tmatesoft.svn.core.SVNErrorMessage.create(SVNErrorMessage.java:154)
at 
org.tmatesoft.svn.core.SVNErrorMessage.create(SVNErrorMessage.java:97)
... 36 more
Caused by: org.tmatesoft.svn.core.SVNException: svn: E175002: OPTIONS request 
failed on '/repos/asf/zookeeper/branches/branch-3.4'
svn: E175002: timed out waiting for server
at 
org.tmatesoft.svn.core.internal.wc.SVNErrorManager.error(SVNErrorManager.java:64)
at 
org.tmatesoft.svn.core.internal.wc.SVNErrorManager.error(SVNErrorManager.java:51)
at 
org.tmatesoft.svn.core.internal.io.dav.http.HTTPConnection._request(HTTPConnection.java:777)
at 
org.tmatesoft.svn.core.internal.io.dav.http.HTTPConnection.request(HTTPConnection.java:382)
... 35 more
Caused by: svn: E175002: OPTIONS request failed on 
'/repos/asf/zookeeper/branches/branch-3.4'
at 
org.tmatesoft.svn.core.SVNErrorMessage.create(SVNErrorMessage.java:208)
at 
org.tmatesoft.svn.core.internal.io.dav.http.HTTPConnection._request(HTTPConnection.java:775)
... 36 more
Caused by: svn: E175002: timed out waiting for server
at 
org.tmatesoft.svn.core.SVNErrorMessage.create(SVNErrorMessage.java:208)
at 
org.tmatesoft.svn.core.internal.io.dav.http.HTTPConnection._request(HTTPConnection.java:514)
... 36 more
Caused by: java.net.SocketTimeoutException: connect timed out
at java.net.PlainSocketImpl.socketConnect(Native Method)
at 
java.net.AbstractPlainSocketImpl.doConnect(AbstractPlainSocketImpl.java:327)
at 
java.net.AbstractPlainSocketImpl.connectToAddress(AbstractPlainSocketImpl.java:193)
at 
java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:180)
at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:385)
at java.net.Socket.connect(Socket.java:546)
at 
org.tmatesoft.svn.core.internal.util.SVNSocketConnection.run(SVNSocketConnection.java:57)
at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471)
... 5 more
Recording test results
Email was triggered for: Failure
Sending email for 

ZooKeeper_branch34_openjdk7 - Build # 420 - Failure

2013-12-30 Thread Apache Jenkins Server
See https://builds.apache.org/job/ZooKeeper_branch34_openjdk7/420/

###
## LAST 60 LINES OF THE CONSOLE 
###
[...truncated 410 lines...]
at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:334)
at java.util.concurrent.FutureTask.run(FutureTask.java:166)
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1146)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
at java.lang.Thread.run(Thread.java:679)
Caused by: org.tmatesoft.svn.core.SVNException: svn: E175002: OPTIONS 
/repos/asf/zookeeper/branches/branch-3.4 failed
at 
org.tmatesoft.svn.core.internal.io.dav.http.HTTPConnection.request(HTTPConnection.java:388)
at 
org.tmatesoft.svn.core.internal.io.dav.http.HTTPConnection.request(HTTPConnection.java:373)
at 
org.tmatesoft.svn.core.internal.io.dav.http.HTTPConnection.request(HTTPConnection.java:361)
at 
org.tmatesoft.svn.core.internal.io.dav.DAVConnection.performHttpRequest(DAVConnection.java:707)
at 
org.tmatesoft.svn.core.internal.io.dav.DAVConnection.exchangeCapabilities(DAVConnection.java:627)
at 
org.tmatesoft.svn.core.internal.io.dav.DAVConnection.open(DAVConnection.java:102)
at 
org.tmatesoft.svn.core.internal.io.dav.DAVRepository.openConnection(DAVRepository.java:1020)
at 
org.tmatesoft.svn.core.internal.io.dav.DAVRepository.getLatestRevision(DAVRepository.java:180)
at 
org.tmatesoft.svn.core.internal.wc16.SVNBasicDelegate.getRevisionNumber(SVNBasicDelegate.java:480)
at 
org.tmatesoft.svn.core.internal.wc16.SVNBasicDelegate.getLocations(SVNBasicDelegate.java:833)
at 
org.tmatesoft.svn.core.internal.wc16.SVNBasicDelegate.createRepository(SVNBasicDelegate.java:527)
at 
org.tmatesoft.svn.core.internal.wc16.SVNUpdateClient16.doCheckout(SVNUpdateClient16.java:875)
at 
org.tmatesoft.svn.core.internal.wc2.old.SvnOldCheckout.run(SvnOldCheckout.java:19)
at 
org.tmatesoft.svn.core.internal.wc2.old.SvnOldCheckout.run(SvnOldCheckout.java:8)
at 
org.tmatesoft.svn.core.internal.wc2.SvnOperationRunner.run(SvnOperationRunner.java:20)
at 
org.tmatesoft.svn.core.wc2.SvnOperationFactory.run(SvnOperationFactory.java:1238)
at org.tmatesoft.svn.core.wc2.SvnOperation.run(SvnOperation.java:294)
at 
org.tmatesoft.svn.core.wc.SVNUpdateClient.doCheckout(SVNUpdateClient.java:777)
at 
hudson.scm.subversion.CheckoutUpdater$1.perform(CheckoutUpdater.java:99)
... 17 more
Caused by: svn: E175002: OPTIONS /repos/asf/zookeeper/branches/branch-3.4 failed
at 
org.tmatesoft.svn.core.SVNErrorMessage.create(SVNErrorMessage.java:208)
at 
org.tmatesoft.svn.core.SVNErrorMessage.create(SVNErrorMessage.java:154)
at 
org.tmatesoft.svn.core.SVNErrorMessage.create(SVNErrorMessage.java:97)
... 36 more
Caused by: org.tmatesoft.svn.core.SVNException: svn: E175002: OPTIONS request 
failed on '/repos/asf/zookeeper/branches/branch-3.4'
svn: E175002: timed out waiting for server
at 
org.tmatesoft.svn.core.internal.wc.SVNErrorManager.error(SVNErrorManager.java:64)
at 
org.tmatesoft.svn.core.internal.wc.SVNErrorManager.error(SVNErrorManager.java:51)
at 
org.tmatesoft.svn.core.internal.io.dav.http.HTTPConnection._request(HTTPConnection.java:777)
at 
org.tmatesoft.svn.core.internal.io.dav.http.HTTPConnection.request(HTTPConnection.java:382)
... 35 more
Caused by: svn: E175002: OPTIONS request failed on 
'/repos/asf/zookeeper/branches/branch-3.4'
at 
org.tmatesoft.svn.core.SVNErrorMessage.create(SVNErrorMessage.java:208)
at 
org.tmatesoft.svn.core.internal.io.dav.http.HTTPConnection._request(HTTPConnection.java:775)
... 36 more
Caused by: svn: E175002: timed out waiting for server
at 
org.tmatesoft.svn.core.SVNErrorMessage.create(SVNErrorMessage.java:208)
at 
org.tmatesoft.svn.core.internal.io.dav.http.HTTPConnection._request(HTTPConnection.java:514)
... 36 more
Caused by: java.net.SocketTimeoutException: connect timed out
at java.net.PlainSocketImpl.socketConnect(Native Method)
at 
java.net.AbstractPlainSocketImpl.doConnect(AbstractPlainSocketImpl.java:327)
at 
java.net.AbstractPlainSocketImpl.connectToAddress(AbstractPlainSocketImpl.java:193)
at 
java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:180)
at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:385)
at java.net.Socket.connect(Socket.java:546)
at 
org.tmatesoft.svn.core.internal.util.SVNSocketConnection.run(SVNSocketConnection.java:57)
at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471)
... 5 more
Recording test results
Email was triggered for: Failure
Sending email for 

[jira] [Commented] (ZOOKEEPER-1854) ClientBase ZooKeeper server clean-up

2013-12-30 Thread Rakesh R (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13858696#comment-13858696
 ] 

Rakesh R commented on ZOOKEEPER-1854:
-

Hi German,  Please see factory#shutdown logic, here I could see 
zkServer#shutdown and I feel there is no need of explicit zks#shutdown. Am I 
missing anything?. It would be great if you can look at ZOOKEEPER-1852, there I 
mentioned one case of missing server shutdown and will affect other test cases.

 ClientBase ZooKeeper server clean-up
 

 Key: ZOOKEEPER-1854
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1854
 Project: ZooKeeper
  Issue Type: Sub-task
  Components: tests
Affects Versions: 3.4.5, 3.5.0
 Environment: Windows 7, Java 1.7
Reporter: Germán Blanco
Assignee: Germán Blanco
 Fix For: 3.4.6, 3.5.0

 Attachments: ZOOKEEPER-1854-b3.4.patch, ZOOKEEPER-1854.patch


 The ClientBase utility for tests provides methods for creating a ZooKeeper 
 server, however the close up methods don't seem to shutdown that ZooKeeper 
 server.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (ZOOKEEPER-1854) ClientBase ZooKeeper server clean-up

2013-12-30 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13858701#comment-13858701
 ] 

Hadoop QA commented on ZOOKEEPER-1854:
--

-1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12620825/ZOOKEEPER-1854.patch
  against trunk revision 1553693.

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 3 new or modified tests.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 findbugs.  The patch does not introduce any new Findbugs (version 1.3.9) 
warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

-1 core tests.  The patch failed core unit tests.

+1 contrib tests.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/1868//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/1868//artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Console output: 
https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/1868//console

This message is automatically generated.

 ClientBase ZooKeeper server clean-up
 

 Key: ZOOKEEPER-1854
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1854
 Project: ZooKeeper
  Issue Type: Sub-task
  Components: tests
Affects Versions: 3.4.5, 3.5.0
 Environment: Windows 7, Java 1.7
Reporter: Germán Blanco
Assignee: Germán Blanco
 Fix For: 3.4.6, 3.5.0

 Attachments: ZOOKEEPER-1854-b3.4.patch, ZOOKEEPER-1854.patch


 The ClientBase utility for tests provides methods for creating a ZooKeeper 
 server, however the close up methods don't seem to shutdown that ZooKeeper 
 server.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


Failed: ZOOKEEPER-1854 PreCommit Build #1868

2013-12-30 Thread Apache Jenkins Server
Jira: https://issues.apache.org/jira/browse/ZOOKEEPER-1854
Build: https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/1868/

###
## LAST 60 LINES OF THE CONSOLE 
###
[...truncated 237494 lines...]
 [exec] 
 [exec] 
 [exec] 
 [exec] -1 overall.  Here are the results of testing the latest attachment 
 [exec]   
http://issues.apache.org/jira/secure/attachment/12620825/ZOOKEEPER-1854.patch
 [exec]   against trunk revision 1553693.
 [exec] 
 [exec] +1 @author.  The patch does not contain any @author tags.
 [exec] 
 [exec] +1 tests included.  The patch appears to include 3 new or 
modified tests.
 [exec] 
 [exec] +1 javadoc.  The javadoc tool did not generate any warning 
messages.
 [exec] 
 [exec] +1 javac.  The applied patch does not increase the total number 
of javac compiler warnings.
 [exec] 
 [exec] +1 findbugs.  The patch does not introduce any new Findbugs 
(version 1.3.9) warnings.
 [exec] 
 [exec] +1 release audit.  The applied patch does not increase the 
total number of release audit warnings.
 [exec] 
 [exec] -1 core tests.  The patch failed core unit tests.
 [exec] 
 [exec] +1 contrib tests.  The patch passed contrib unit tests.
 [exec] 
 [exec] Test results: 
https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/1868//testReport/
 [exec] Findbugs warnings: 
https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/1868//artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
 [exec] Console output: 
https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/1868//console
 [exec] 
 [exec] This message is automatically generated.
 [exec] 
 [exec] 
 [exec] 
==
 [exec] 
==
 [exec] Adding comment to Jira.
 [exec] 
==
 [exec] 
==
 [exec] 
 [exec] 
 [exec] Comment added.
 [exec] 539338496e6cb757fd3508d1d47ac95f49354137 logged out
 [exec] 
 [exec] 
 [exec] 
==
 [exec] 
==
 [exec] Finished build.
 [exec] 
==
 [exec] 
==
 [exec] 
 [exec] 

BUILD FAILED
/home/jenkins/jenkins-slave/workspace/PreCommit-ZOOKEEPER-Build/trunk/build.xml:1674:
 exec returned: 1

Total time: 33 minutes 3 seconds
Build step 'Execute shell' marked build as failure
Archiving artifacts
Recording test results
Description set: ZOOKEEPER-1854
Email was triggered for: Failure
Sending email for trigger: Failure



###
## FAILED TESTS (if any) 
##
All tests passed

ZooKeeper-trunk - Build # 2170 - Failure

2013-12-30 Thread Apache Jenkins Server
See https://builds.apache.org/job/ZooKeeper-trunk/2170/

###
## LAST 60 LINES OF THE CONSOLE 
###
[...truncated 293567 lines...]
 [exec] Log Message Received: [2013-12-30 
11:04:35,130:23636(0x2b5dda82eda0):ZOO_INFO@log_env@922: Client 
environment:os.version=#71-Ubuntu SMP Wed Jul 20 17:42:25 UTC 2011]
 [exec] Log Message Received: [2013-12-30 
11:04:35,130:23636(0x2b5dda82eda0):ZOO_INFO@log_env@930: Client 
environment:user.name=(null)]
 [exec] Log Message Received: [2013-12-30 
11:04:35,130:23636(0x2b5dda82eda0):ZOO_INFO@log_env@938: Client 
environment:user.home=/home/jenkins]
 [exec] Log Message Received: [2013-12-30 
11:04:35,130:23636(0x2b5dda82eda0):ZOO_INFO@log_env@950: Client 
environment:user.dir=/home/jenkins/jenkins-slave/workspace/ZooKeeper-trunk/trunk/build/test/test-cppunit]
 [exec] Log Message Received: [2013-12-30 
11:04:35,130:23636(0x2b5dda82eda0):ZOO_INFO@zookeeper_init_internal@993: 
Initiating client connection, host=127.0.0.1:22181 sessionTimeout=1 
watcher=0x4422e0 sessionId=0 sessionPasswd=null context=0x7fff8173d740 
flags=0]
 [exec] Log Message Received: [2013-12-30 
11:04:35,131:23636(0x2b5ddbc8e700):ZOO_INFO@check_events@2090: initiated 
connection to server [127.0.0.1:22181]]
 [exec] Log Message Received: [2013-12-30 
11:04:35,140:23636(0x2b5ddbc8e700):ZOO_INFO@check_events@2138: session 
establishment complete on server [127.0.0.1:22181], 
sessionId=0x143432c93dd000e, negotiated timeout=1]
 [exec]  : elapsed 1001 : OK
 [exec] Zookeeper_simpleSystem::testAsyncWatcherAutoReset ZooKeeper server 
started : elapsed 9757 : OK
 [exec] Zookeeper_simpleSystem::testDeserializeString : elapsed 0 : OK
 [exec] Zookeeper_simpleSystem::testNullData : elapsed 1027 : OK
 [exec] Zookeeper_simpleSystem::testIPV6 : elapsed 1007 : OK
 [exec] Zookeeper_simpleSystem::testCreate : elapsed 1017 : OK
 [exec] Zookeeper_simpleSystem::testPath : elapsed 1020 : OK
 [exec] Zookeeper_simpleSystem::testPathValidation : elapsed 1035 : OK
 [exec] Zookeeper_simpleSystem::testPing : elapsed 17450 : OK
 [exec] Zookeeper_simpleSystem::testAcl : elapsed 1036 : OK
 [exec] Zookeeper_simpleSystem::testChroot : elapsed 4042 : OK
 [exec] Zookeeper_simpleSystem::testAuth ZooKeeper server started ZooKeeper 
server started : elapsed 29565 : OK
 [exec] Zookeeper_simpleSystem::testHangingClient : elapsed 1032 : OK
 [exec] Zookeeper_simpleSystem::testWatcherAutoResetWithGlobal ZooKeeper 
server started ZooKeeper server started ZooKeeper server started : elapsed 
13428 : OK
 [exec] Zookeeper_simpleSystem::testWatcherAutoResetWithLocal ZooKeeper 
server started ZooKeeper server started ZooKeeper server started : elapsed 
14494 : OK
 [exec] Zookeeper_simpleSystem::testGetChildren2 : elapsed 1055 : OK
 [exec] Zookeeper_simpleSystem::testLastZxid : elapsed 2519 : OK
 [exec] Zookeeper_watchers::testDefaultSessionWatcher1 : elapsed 52 : OK
 [exec] Zookeeper_watchers::testDefaultSessionWatcher2 : elapsed 4 : OK
 [exec] Zookeeper_watchers::testObjectSessionWatcher1 : elapsed 54 : OK
 [exec] Zookeeper_watchers::testObjectSessionWatcher2 : elapsed 56 : OK
 [exec] Zookeeper_watchers::testNodeWatcher1 : elapsed 56 : OK
 [exec] Zookeeper_watchers::testChildWatcher1 : elapsed 104 : OK
 [exec] Zookeeper_watchers::testChildWatcher2 : elapsed 54 : OK
 [exec] 
/home/jenkins/jenkins-slave/workspace/ZooKeeper-trunk/trunk/src/c/tests/TestReconfig.cc:183:
 Assertion: equality assertion failed [Expected: 1, Actual  : 0]
 [exec] 
/home/jenkins/jenkins-slave/workspace/ZooKeeper-trunk/trunk/src/c/tests/TestReconfig.cc:473:
 Assertion: assertion failed [Expression: found != string::npos]
 [exec] Failures !!!
 [exec] Run: 69   Failure total: 2   Failures: 2   Errors: 0
 [exec] FAIL: zktest-mt
 [exec] ==
 [exec] 1 of 2 tests failed
 [exec] Please report to u...@zookeeper.apache.org
 [exec] ==
 [exec] make[1]: *** [check-TESTS] Error 1
 [exec] make[1]: Leaving directory 
`/home/jenkins/jenkins-slave/workspace/ZooKeeper-trunk/trunk/build/test/test-cppunit'
 [exec] make: *** [check-am] Error 2

BUILD FAILED
/home/jenkins/jenkins-slave/workspace/ZooKeeper-trunk/trunk/build.xml:1404: The 
following error occurred while executing this line:
/home/jenkins/jenkins-slave/workspace/ZooKeeper-trunk/trunk/build.xml:1364: The 
following error occurred while executing this line:
/home/jenkins/jenkins-slave/workspace/ZooKeeper-trunk/trunk/build.xml:1374: 
exec returned: 2

Total time: 31 minutes 48 seconds
Build step 'Execute shell' marked build as failure
[FINDBUGS] Skipping publisher since build result is FAILURE
[WARNINGS] Skipping publisher since build result is FAILURE
Archiving 

[jira] [Commented] (ZOOKEEPER-1854) ClientBase ZooKeeper server clean-up

2013-12-30 Thread Flavio Junqueira (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13858722#comment-13858722
 ] 

Flavio Junqueira commented on ZOOKEEPER-1854:
-

When some question is raised, please cancel the patch so that we don't end up 
committing patches before reaching agreement. The patch proposer isn't going to 
be offended (or at least shouldn't be), it is just a way of signalling that 
something is still not clear.

 ClientBase ZooKeeper server clean-up
 

 Key: ZOOKEEPER-1854
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1854
 Project: ZooKeeper
  Issue Type: Sub-task
  Components: tests
Affects Versions: 3.4.5, 3.5.0
 Environment: Windows 7, Java 1.7
Reporter: Germán Blanco
Assignee: Germán Blanco
 Fix For: 3.4.6, 3.5.0

 Attachments: ZOOKEEPER-1854-b3.4.patch, ZOOKEEPER-1854.patch


 The ClientBase utility for tests provides methods for creating a ZooKeeper 
 server, however the close up methods don't seem to shutdown that ZooKeeper 
 server.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Resolved] (ZOOKEEPER-1854) ClientBase ZooKeeper server clean-up

2013-12-30 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Germán Blanco resolved ZOOKEEPER-1854.
--

Resolution: Invalid

I wonder how I missed that.
This JIRA doesn't make sense.

 ClientBase ZooKeeper server clean-up
 

 Key: ZOOKEEPER-1854
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1854
 Project: ZooKeeper
  Issue Type: Sub-task
  Components: tests
Affects Versions: 3.4.5, 3.5.0
 Environment: Windows 7, Java 1.7
Reporter: Germán Blanco
Assignee: Germán Blanco
 Fix For: 3.4.6, 3.5.0

 Attachments: ZOOKEEPER-1854-b3.4.patch, ZOOKEEPER-1854.patch


 The ClientBase utility for tests provides methods for creating a ZooKeeper 
 server, however the close up methods don't seem to shutdown that ZooKeeper 
 server.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (ZOOKEEPER-1854) ClientBase ZooKeeper server clean-up

2013-12-30 Thread Flavio Junqueira (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13858730#comment-13858730
 ] 

Flavio Junqueira commented on ZOOKEEPER-1854:
-

Perhaps my logic is backwards but it seems to me that the server should be 
shutting down the cnxn server, not the other way around. This is just a 
digression, such a refactoring is out of the scope of this jira.

 ClientBase ZooKeeper server clean-up
 

 Key: ZOOKEEPER-1854
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1854
 Project: ZooKeeper
  Issue Type: Sub-task
  Components: tests
Affects Versions: 3.4.5, 3.5.0
 Environment: Windows 7, Java 1.7
Reporter: Germán Blanco
Assignee: Germán Blanco
 Fix For: 3.4.6, 3.5.0

 Attachments: ZOOKEEPER-1854-b3.4.patch, ZOOKEEPER-1854.patch


 The ClientBase utility for tests provides methods for creating a ZooKeeper 
 server, however the close up methods don't seem to shutdown that ZooKeeper 
 server.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (ZOOKEEPER-1852) ServerCnxnFactory instance is not properly cleanedup

2013-12-30 Thread JIRA

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13858731#comment-13858731
 ] 

Germán Blanco commented on ZOOKEEPER-1852:
--

So what happens now when createNewServerInstance fails and null is returned?
Apparently the startServerInstance is called and it fails with 
NullPointerException, is that intended?

 ServerCnxnFactory instance is not properly cleanedup
 

 Key: ZOOKEEPER-1852
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1852
 Project: ZooKeeper
  Issue Type: Sub-task
  Components: tests
Reporter: Rakesh R
Assignee: Rakesh R
 Fix For: 3.4.6, 3.5.0

 Attachments: ZOOKEEPER-1852-br-3.4.patch, 
 ZOOKEEPER-1852-br-3.4.patch, ZOOKEEPER-1852.patch, ZOOKEEPER-1852.patch


 ClientBase#createNewServerInstance() - Say the startup of the server fails, 
 this will not initialize 'serverFactory' and will be null. When the flow 
 comes to teardown/shutdown, it will bypass stopping of this server instance 
 due to the following check. This will affect other test case verifications 
 like, jmx check 'JMXEnv#ensureOnly'.  
 ClientBase#shutdownServerInstance
 {code}
 static void shutdownServerInstance(ServerCnxnFactory factory,
 String hostPort)
 {
 if (factory != null) {
//...shutdown logic
 }
 {code}



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (ZOOKEEPER-1852) ServerCnxnFactory instance is not properly cleanedup

2013-12-30 Thread Rakesh R (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13858742#comment-13858742
 ] 

Rakesh R commented on ZOOKEEPER-1852:
-

Hi German,
{quote}So what happens now when createNewServerInstance fails and null is 
returned?
Apparently the startServerInstance is called and it fails with 
NullPointerException, is that intended?{quote}
In my patch createNewServerInstance method call,  will just instantiate 
'factory' and return it. I think there wont be any case of returning null 
silently from the following code, if there is an exception it will throw back 
to the caller, isn't it?
{code}
factory = ServerCnxnFactory.createFactory(PORT, maxCnxns);
{code}

 ServerCnxnFactory instance is not properly cleanedup
 

 Key: ZOOKEEPER-1852
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1852
 Project: ZooKeeper
  Issue Type: Sub-task
  Components: tests
Reporter: Rakesh R
Assignee: Rakesh R
 Fix For: 3.4.6, 3.5.0

 Attachments: ZOOKEEPER-1852-br-3.4.patch, 
 ZOOKEEPER-1852-br-3.4.patch, ZOOKEEPER-1852.patch, ZOOKEEPER-1852.patch


 ClientBase#createNewServerInstance() - Say the startup of the server fails, 
 this will not initialize 'serverFactory' and will be null. When the flow 
 comes to teardown/shutdown, it will bypass stopping of this server instance 
 due to the following check. This will affect other test case verifications 
 like, jmx check 'JMXEnv#ensureOnly'.  
 ClientBase#shutdownServerInstance
 {code}
 static void shutdownServerInstance(ServerCnxnFactory factory,
 String hostPort)
 {
 if (factory != null) {
//...shutdown logic
 }
 {code}



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (ZOOKEEPER-1852) ServerCnxnFactory instance is not properly cleanedup

2013-12-30 Thread Rakesh R (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13858750#comment-13858750
 ] 

Rakesh R commented on ZOOKEEPER-1852:
-

For more understanding about the scenario, I'm adding few more information. 
Please see it.

The case what I have observed in the build is, say one of the test case failed 
at waiting for server up as shown below. Now in this case 
'ClientBase.serverFactory' reference will not be initialized and teardown will 
not be able to shutdown this server. This will leave server running and 
affecting other test cases. Example log pattern has been taken from 
https://builds.apache.org/job/ZooKeeper-3.4-WinVS2008_java/390/

{code}
[junit] 2013-12-25 09:08:00,837 [myid:] - INFO  
[main:JUnit4ZKTestRunner$LoggedInvokeMethod@62] - TEST METHOD FAILED 
testWatchAutoResetWithPending
[junit] java.lang.AssertionError: waiting for server up
[junit] at org.junit.Assert.fail(Assert.java:91)
[junit] at org.junit.Assert.assertTrue(Assert.java:43)
[junit] at 
org.apache.zookeeper.test.ClientBase.createNewServerInstance(ClientBase.java:337)
[junit] at 
org.apache.zookeeper.test.ClientBase.startServer(ClientBase.java:414)
[junit] at 
org.apache.zookeeper.test.WatcherTest.testWatchAutoResetWithPending(WatcherTest.java:208)
[junit] at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
[junit] at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
{code}

Following log showing the test case is failing by seeing unexpected beans which 
was left open in previous test case. 
{code}
[junit] 2013-12-25 09:08:06,018 [myid:] - INFO  [main:JMXEnv@142] - 
unexpected:org.apache.ZooKeeperService:name0=StandaloneServer_port-1,name1=Connections,name2=127.0.0.1,name3=0x1432900e409
[junit] 2013-12-25 09:08:06,018 [myid:] - INFO  [main:JMXEnv@142] - 
unexpected:org.apache.ZooKeeperService:name0=StandaloneServer_port-1,name1=InMemoryDataTree
[junit] 2013-12-25 09:08:06,018 [myid:] - INFO  [main:JMXEnv@142] - 
unexpected:org.apache.ZooKeeperService:name0=StandaloneServer_port-1,name1=Connections,name2=127.0.0.1,name3=0x1432900e4090002
[junit] 2013-12-25 09:08:06,018 [myid:] - INFO  [main:JMXEnv@142] - 
unexpected:org.apache.ZooKeeperService:name0=StandaloneServer_port-1
[junit] 2013-12-25 09:08:06,018 [myid:] - INFO  [main:ZKTestCase$1@65] - 
FAILED testWatchAutoResetWithPending
[junit] org.junit.internal.runners.model.MultipleFailureException
[junit] at 
org.junit.internal.runners.model.MultipleFailureException.assertEmpty(MultipleFailureException.java:23)
[junit] at 
org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:42)
[junit] at org.junit.rules.TestWatchman$1.evaluate(TestWatchman.java:48)
[junit] at 
org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:76)
{code}


 ServerCnxnFactory instance is not properly cleanedup
 

 Key: ZOOKEEPER-1852
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1852
 Project: ZooKeeper
  Issue Type: Sub-task
  Components: tests
Reporter: Rakesh R
Assignee: Rakesh R
 Fix For: 3.4.6, 3.5.0

 Attachments: ZOOKEEPER-1852-br-3.4.patch, 
 ZOOKEEPER-1852-br-3.4.patch, ZOOKEEPER-1852.patch, ZOOKEEPER-1852.patch


 ClientBase#createNewServerInstance() - Say the startup of the server fails, 
 this will not initialize 'serverFactory' and will be null. When the flow 
 comes to teardown/shutdown, it will bypass stopping of this server instance 
 due to the following check. This will affect other test case verifications 
 like, jmx check 'JMXEnv#ensureOnly'.  
 ClientBase#shutdownServerInstance
 {code}
 static void shutdownServerInstance(ServerCnxnFactory factory,
 String hostPort)
 {
 if (factory != null) {
//...shutdown logic
 }
 {code}



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (ZOOKEEPER-1852) ServerCnxnFactory instance is not properly cleanedup

2013-12-30 Thread JIRA

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13858861#comment-13858861
 ] 

Germán Blanco commented on ZOOKEEPER-1852:
--

So this is covering a problem when TruncateTest fails and that leaves a running 
server that affects additional test cases later on, right?
I see now the goal, thank you for explaining.
+1
Would you mind taking a look at the patch (wrongly) in ZOOKEEPER-1833?
I didn't put it in a subtask by mistake, and I don't know if it makes sense to 
do it now.
Anyway, I believe it solves a problem of a run-away server like this one, but 
one that runs away in all cases (failure or success).

 ServerCnxnFactory instance is not properly cleanedup
 

 Key: ZOOKEEPER-1852
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1852
 Project: ZooKeeper
  Issue Type: Sub-task
  Components: tests
Reporter: Rakesh R
Assignee: Rakesh R
 Fix For: 3.4.6, 3.5.0

 Attachments: ZOOKEEPER-1852-br-3.4.patch, 
 ZOOKEEPER-1852-br-3.4.patch, ZOOKEEPER-1852.patch, ZOOKEEPER-1852.patch


 ClientBase#createNewServerInstance() - Say the startup of the server fails, 
 this will not initialize 'serverFactory' and will be null. When the flow 
 comes to teardown/shutdown, it will bypass stopping of this server instance 
 due to the following check. This will affect other test case verifications 
 like, jmx check 'JMXEnv#ensureOnly'.  
 ClientBase#shutdownServerInstance
 {code}
 static void shutdownServerInstance(ServerCnxnFactory factory,
 String hostPort)
 {
 if (factory != null) {
//...shutdown logic
 }
 {code}



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (ZOOKEEPER-102) Need to replace Jute with supported code

2013-12-30 Thread JIRA

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13858886#comment-13858886
 ] 

Germán Blanco commented on ZOOKEEPER-102:
-

Ooops, should have read the entire thread ... so maybe there could be ways to 
enable the upgrade after all.

 Need to replace Jute with supported code
 

 Key: ZOOKEEPER-102
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-102
 Project: ZooKeeper
  Issue Type: Improvement
Reporter: Benjamin Reed

 ZooKeeper currently uses Jute to serialize objects to put on the wire and on 
 disk. We pulled Jute out of Hadoop and added a C binding. Both versions of 
 Jute have evolved (although Hadoop still doesn't have a C binding). It would 
 be nice to use a more standard serialization library. Some options include 
 Thrift or Google's protocol buffers.
 Our main requirements would be Java and C bindings and good performance. (For 
 example, serializing to XML would give us incredibly bad performance and 
 would not be acceptible!)



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (ZOOKEEPER-1852) ServerCnxnFactory instance is not properly cleanedup

2013-12-30 Thread Rakesh R (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13858961#comment-13858961
 ] 

Rakesh R commented on ZOOKEEPER-1852:
-

Yeah, +1 creating a subtask for PrepRequestProcessorTest cleanup and good to 
correct it, as run-away server may affect other testcases.

 ServerCnxnFactory instance is not properly cleanedup
 

 Key: ZOOKEEPER-1852
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1852
 Project: ZooKeeper
  Issue Type: Sub-task
  Components: tests
Reporter: Rakesh R
Assignee: Rakesh R
 Fix For: 3.4.6, 3.5.0

 Attachments: ZOOKEEPER-1852-br-3.4.patch, 
 ZOOKEEPER-1852-br-3.4.patch, ZOOKEEPER-1852.patch, ZOOKEEPER-1852.patch


 ClientBase#createNewServerInstance() - Say the startup of the server fails, 
 this will not initialize 'serverFactory' and will be null. When the flow 
 comes to teardown/shutdown, it will bypass stopping of this server instance 
 due to the following check. This will affect other test case verifications 
 like, jmx check 'JMXEnv#ensureOnly'.  
 ClientBase#shutdownServerInstance
 {code}
 static void shutdownServerInstance(ServerCnxnFactory factory,
 String hostPort)
 {
 if (factory != null) {
//...shutdown logic
 }
 {code}



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


ZooKeeper_branch33_solaris - Build # 753 - Still Failing

2013-12-30 Thread Apache Jenkins Server
See https://builds.apache.org/job/ZooKeeper_branch33_solaris/753/

###
## LAST 60 LINES OF THE CONSOLE 
###
[...truncated 98966 lines...]
[junit] 2013-12-31 07:04:15,791 - INFO  [main:ZooKeeperServer@154] - 
Created server with tickTime 3000 minSessionTimeout 6000 maxSessionTimeout 
6 datadir 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/ZooKeeper_branch33_solaris/trunk/build/test/tmp/test7698282616389788280.junit.dir/version-2
 snapdir 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/ZooKeeper_branch33_solaris/trunk/build/test/tmp/test7698282616389788280.junit.dir/version-2
[junit] 2013-12-31 07:04:15,792 - INFO  [main:NIOServerCnxn$Factory@143] - 
binding to port 0.0.0.0/0.0.0.0:11221
[junit] 2013-12-31 07:04:15,794 - INFO  [main:FileSnap@82] - Reading 
snapshot 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/ZooKeeper_branch33_solaris/trunk/build/test/tmp/test7698282616389788280.junit.dir/version-2/snapshot.0
[junit] 2013-12-31 07:04:15,797 - INFO  [main:FileTxnSnapLog@256] - 
Snapshotting: b
[junit] 2013-12-31 07:04:15,800 - INFO  [main:FourLetterWordMain@43] - 
connecting to 127.0.0.1 11221
[junit] 2013-12-31 07:04:15,801 - INFO  
[NIOServerCxn.Factory:0.0.0.0/0.0.0.0:11221:NIOServerCnxn$Factory@251] - 
Accepted socket connection from /127.0.0.1:41191
[junit] 2013-12-31 07:04:15,802 - INFO  
[NIOServerCxn.Factory:0.0.0.0/0.0.0.0:11221:NIOServerCnxn@1237] - Processing 
stat command from /127.0.0.1:41191
[junit] 2013-12-31 07:04:15,803 - INFO  
[Thread-4:NIOServerCnxn$StatCommand@1153] - Stat command output
[junit] 2013-12-31 07:04:15,803 - INFO  [Thread-4:NIOServerCnxn@1435] - 
Closed socket connection for client /127.0.0.1:41191 (no session established 
for client)
[junit] ensureOnly:[InMemoryDataTree, StandaloneServer_port]
[junit] expect:InMemoryDataTree
[junit] found:InMemoryDataTree 
org.apache.ZooKeeperService:name0=StandaloneServer_port-1,name1=InMemoryDataTree
[junit] expect:StandaloneServer_port
[junit] found:StandaloneServer_port 
org.apache.ZooKeeperService:name0=StandaloneServer_port-1
[junit] 2013-12-31 07:04:15,805 - INFO  [main:ClientBase@408] - STOPPING 
server
[junit] 2013-12-31 07:04:15,807 - INFO  
[ProcessThread:-1:PrepRequestProcessor@128] - PrepRequestProcessor exited loop!
[junit] 2013-12-31 07:04:15,807 - INFO  
[SyncThread:0:SyncRequestProcessor@151] - SyncRequestProcessor exited!
[junit] 2013-12-31 07:04:15,807 - INFO  [main:FinalRequestProcessor@370] - 
shutdown of request processor complete
[junit] 2013-12-31 07:04:15,809 - INFO  [main:FourLetterWordMain@43] - 
connecting to 127.0.0.1 11221
[junit] ensureOnly:[]
[junit] 2013-12-31 07:04:15,810 - INFO  [main:ClientBase@401] - STARTING 
server
[junit] 2013-12-31 07:04:15,811 - INFO  [main:ZooKeeperServer@154] - 
Created server with tickTime 3000 minSessionTimeout 6000 maxSessionTimeout 
6 datadir 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/ZooKeeper_branch33_solaris/trunk/build/test/tmp/test7698282616389788280.junit.dir/version-2
 snapdir 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/ZooKeeper_branch33_solaris/trunk/build/test/tmp/test7698282616389788280.junit.dir/version-2
[junit] 2013-12-31 07:04:15,812 - INFO  [main:NIOServerCnxn$Factory@143] - 
binding to port 0.0.0.0/0.0.0.0:11221
[junit] 2013-12-31 07:04:15,813 - INFO  [main:FileSnap@82] - Reading 
snapshot 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/ZooKeeper_branch33_solaris/trunk/build/test/tmp/test7698282616389788280.junit.dir/version-2/snapshot.b
[junit] 2013-12-31 07:04:15,815 - INFO  [main:FileTxnSnapLog@256] - 
Snapshotting: b
[junit] 2013-12-31 07:04:15,817 - INFO  [main:FourLetterWordMain@43] - 
connecting to 127.0.0.1 11221
[junit] 2013-12-31 07:04:15,818 - INFO  
[NIOServerCxn.Factory:0.0.0.0/0.0.0.0:11221:NIOServerCnxn$Factory@251] - 
Accepted socket connection from /127.0.0.1:41193
[junit] 2013-12-31 07:04:15,819 - INFO  
[NIOServerCxn.Factory:0.0.0.0/0.0.0.0:11221:NIOServerCnxn@1237] - Processing 
stat command from /127.0.0.1:41193
[junit] 2013-12-31 07:04:15,820 - INFO  
[Thread-5:NIOServerCnxn$StatCommand@1153] - Stat command output
[junit] 2013-12-31 07:04:15,820 - INFO  [Thread-5:NIOServerCnxn@1435] - 
Closed socket connection for client /127.0.0.1:41193 (no session established 
for client)
[junit] ensureOnly:[InMemoryDataTree, StandaloneServer_port]
[junit] expect:InMemoryDataTree
[junit] found:InMemoryDataTree 
org.apache.ZooKeeperService:name0=StandaloneServer_port-1,name1=InMemoryDataTree
[junit] expect:StandaloneServer_port
[junit] found:StandaloneServer_port 
org.apache.ZooKeeperService:name0=StandaloneServer_port-1
[junit] 2013-12-31 07:04:15,823 - INFO  

Build failed in Jenkins: bookkeeper-trunk #489

2013-12-30 Thread Apache Jenkins Server
See https://builds.apache.org/job/bookkeeper-trunk/489/

--
[...truncated 436 lines...]
[INFO] Changes detected - recompiling the module!
[INFO] Compiling 6 source files to 
https://builds.apache.org/job/bookkeeper-trunk/ws/hedwig-protocol/target/classes
[INFO] 
[INFO] --- maven-resources-plugin:2.4.3:testResources (default-testResources) @ 
hedwig-protocol ---
[INFO] Using 'UTF-8' encoding to copy filtered resources.
[INFO] skip non existing resourceDirectory 
https://builds.apache.org/job/bookkeeper-trunk/ws/hedwig-protocol/src/test/resources
[INFO] Copying 3 resources
[INFO] 
[INFO] --- maven-compiler-plugin:3.0:testCompile (default-testCompile) @ 
hedwig-protocol ---
[INFO] No sources to compile
[INFO] 
[INFO] --- maven-surefire-plugin:2.9:test (default-test) @ hedwig-protocol ---
[INFO] Surefire report directory: 
https://builds.apache.org/job/bookkeeper-trunk/ws/hedwig-protocol/target/surefire-reports

---
 T E S T S
---

---
 T E S T S
---

Results :

Tests run: 0, Failures: 0, Errors: 0, Skipped: 0

[INFO] 
[INFO] --- maven-jar-plugin:2.3.1:jar (default-jar) @ hedwig-protocol ---
[INFO] Building jar: 
https://builds.apache.org/job/bookkeeper-trunk/ws/hedwig-protocol/target/hedwig-protocol-4.3.0-SNAPSHOT.jar
[INFO] 
[INFO]  findbugs-maven-plugin:2.5.2:check (default-cli) @ hedwig-protocol 
[INFO] 
[INFO] --- findbugs-maven-plugin:2.5.2:findbugs (findbugs) @ hedwig-protocol ---
[INFO] Fork Value is true
[INFO] Done FindBugs Analysis
[INFO] 
[INFO]  findbugs-maven-plugin:2.5.2:check (default-cli) @ hedwig-protocol 
[INFO] 
[INFO] --- findbugs-maven-plugin:2.5.2:check (default-cli) @ hedwig-protocol ---
[INFO] BugInstance size is 0
[INFO] Error size is 0
[INFO] No errors/warnings found
[INFO] 
[INFO] 
[INFO] Building bookkeeper-server 4.3.0-SNAPSHOT
[INFO] 
[INFO] 
[INFO] --- maven-clean-plugin:2.5:clean (default-clean) @ bookkeeper-server ---
[INFO] 
[INFO] --- apache-rat-plugin:0.7:check (default-cli) @ bookkeeper-server ---
[INFO] Exclude: **/DataFormats.java
[INFO] 
[INFO] --- maven-remote-resources-plugin:1.1:process (default) @ 
bookkeeper-server ---
[INFO] 
[INFO] --- maven-resources-plugin:2.4.3:resources (default-resources) @ 
bookkeeper-server ---
[INFO] Using 'UTF-8' encoding to copy filtered resources.
[INFO] Copying 3 resources
[INFO] Copying 3 resources
[INFO] 
[INFO] --- maven-compiler-plugin:3.0:compile (default-compile) @ 
bookkeeper-server ---
[INFO] Changes detected - recompiling the module!
[INFO] Compiling 159 source files to 
https://builds.apache.org/job/bookkeeper-trunk/ws/bookkeeper-server/target/classes
[INFO] 
[INFO] --- maven-resources-plugin:2.4.3:testResources (default-testResources) @ 
bookkeeper-server ---
[INFO] Using 'UTF-8' encoding to copy filtered resources.
[INFO] Copying 1 resource
[INFO] Copying 3 resources
[INFO] 
[INFO] --- maven-compiler-plugin:3.0:testCompile (default-testCompile) @ 
bookkeeper-server ---
[INFO] Changes detected - recompiling the module!
[INFO] Compiling 79 source files to 
https://builds.apache.org/job/bookkeeper-trunk/ws/bookkeeper-server/target/test-classes
[INFO] 
[INFO] --- maven-surefire-plugin:2.9:test (default-test) @ bookkeeper-server ---
[INFO] Surefire report directory: 
https://builds.apache.org/job/bookkeeper-trunk/ws/bookkeeper-server/target/surefire-reports

---
 T E S T S
---

---
 T E S T S
---
Running org.apache.bookkeeper.replication.AutoRecoveryMainTest
Tests run: 3, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 4.3 sec
Running org.apache.bookkeeper.replication.BookieAutoRecoveryTest
Tests run: 12, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 28.838 sec
Running org.apache.bookkeeper.replication.BookieLedgerIndexTest
Tests run: 9, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 8.092 sec
Running org.apache.bookkeeper.replication.AuditorPeriodicCheckTest
Tests run: 4, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 17.831 sec
Running org.apache.bookkeeper.replication.AuditorLedgerCheckerTest
Tests run: 18, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 24.397 sec
Running org.apache.bookkeeper.replication.TestLedgerUnderreplicationManager
Tests run: 13, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 18.102 sec
Running org.apache.bookkeeper.replication.AuditorBookieTest
Tests run: 5, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 7.495 sec

Fwd: Build failed in Jenkins: bookkeeper-trunk #489

2013-12-30 Thread Flavio Junqueira
I was wondering if there is a jira open for the test that failed below, does 
anyone know?

-Flavio

Begin forwarded message:

 Tests in error: 
  
 testWithMultipleBookieFailuresInLastEnsemble[2](org.apache.bookkeeper.client.BookieWriteLedgerTest)



RE: Build failed in Jenkins: bookkeeper-trunk #489

2013-12-30 Thread Rakesh R
Hi Flavio,

As test case name says, it is testing multiple bookie failures. 

On bookiefailure, when doing the ensemble reformation, unfortunately it is 
failing to connect to the Bookie-15039. But it should suppose to get connected 
and continue write operation. This is the reason for the test case failure. 
Please see the following log pattern:

2013-12-30 12:29:36,731 - INFO  - 
[BookKeeperClientWorker-0-0:PerChannelBookieClient@167] - Connecting to bookie: 
/67.195.138.30:15039
2013-12-30 12:29:36,732 - ERROR - [New I/O client boss 
#5-1:PerChannelBookieClient$1@203] - Could not connect to bookie: [id: 
0x019a639b, /229.27.250.246:46509 : /67.195.138.30:15039], current state 
CONNECTING
2013-12-30 12:29:36,732 - WARN  - [BookKeeperClientWorker-0-0:PendingAddOp@158] 
- Write did not succeed: L0 E100 on /67.195.138.30:15039
2013-12-30 12:29:36,733 - INFO  - [BookKeeperClientWorker-0-0:LedgerHandle@659] 
- Handling failure of bookie: /67.195.138.30:15039 index: 2
2013-12-30 12:29:36,733 - WARN  - 
[BookKeeperClientWorker-0-0:RackawareEnsemblePlacementPolicy@491] - Failed to 
choose a bookie from /default-rack : excluded [Bookie:67.195.138.30:15036, 
Bookie:67.195.138.30:15038, Bookie:67.195.138.30:15039, 
Bookie:67.195.138.30:15040, Bookie:67.195.138.30:15035], fallback to choose 
bookie randomly from the cluster.


I'm thinking, there could be chance of small network fluctuations or slow 
machine and resulting in connection failure. 
To handle this IMHO, we should have netty client connection timeout in place 
and should retry for few intervals. Let me do a try with 
bootstrap.setOption(connectTimeoutMillis, timeoutvalue);
Shall I raise a JIRA to discuss about these concerns and will reach to a 
conclusion. Whats your opinion?

-Rakesh

-Original Message-
From: Flavio Junqueira [mailto:fpjunque...@yahoo.com] 
Sent: 31 December 2013 01:51
To: bookkeeper-dev@zookeeper.apache.org
Subject: Fwd: Build failed in Jenkins: bookkeeper-trunk #489

I was wondering if there is a jira open for the test that failed below, does 
anyone know?

-Flavio

Begin forwarded message:

 Tests in error: 
  
 testWithMultipleBookieFailuresInLastEnsemble[2](org.apache.bookkeeper.client.BookieWriteLedgerTest)



Re: Build failed in Jenkins: bookkeeper-trunk #489

2013-12-30 Thread Sijie Guo
I don't think its connect timeout setting issue. as by default, netty
channel connect timeout is 10 sec (
https://github.com/netty/netty/blob/3.2/src/main/java/org/jboss/netty/channel/DefaultChannelConfig.java#L38).
If you checked the log, the log statements show that the connect
operation is in same second.

2013-12-30 12:29:36,731 - INFO  -
[BookKeeperClientWorker-0-0:PerChannelBookieClient@167] - Connecting
to bookie: /67.195.138.30:15039
2013-12-30 12:29:36,732 - ERROR - [New I/O client boss
#5-1:PerChannelBookieClient$1@203] - Could not connect to bookie: [id:
0x019a639b, /229.27.250.246:46509 : /67.195.138.30:15039], current
state CONNECTING




On Mon, Dec 30, 2013 at 9:31 PM, Rakesh R rake...@huawei.com wrote:

 Hi Flavio,

 As test case name says, it is testing multiple bookie failures.

 On bookiefailure, when doing the ensemble reformation, unfortunately it is
 failing to connect to the Bookie-15039. But it should suppose to get
 connected and continue write operation. This is the reason for the test
 case failure. Please see the following log pattern:

 2013-12-30 12:29:36,731 - INFO  -
 [BookKeeperClientWorker-0-0:PerChannelBookieClient@167] - Connecting to
 bookie: /67.195.138.30:15039
 2013-12-30 12:29:36,732 - ERROR - [New I/O client boss
 #5-1:PerChannelBookieClient$1@203] - Could not connect to bookie: [id:
 0x019a639b, /229.27.250.246:46509 : /67.195.138.30:15039], current state
 CONNECTING
 2013-12-30 12:29:36,732 - WARN  -
 [BookKeeperClientWorker-0-0:PendingAddOp@158] - Write did not succeed: L0
 E100 on /67.195.138.30:15039
 2013-12-30 12:29:36,733 - INFO  -
 [BookKeeperClientWorker-0-0:LedgerHandle@659] - Handling failure of
 bookie: /67.195.138.30:15039 index: 2
 2013-12-30 12:29:36,733 - WARN  -
 [BookKeeperClientWorker-0-0:RackawareEnsemblePlacementPolicy@491] -
 Failed to choose a bookie from /default-rack : excluded [Bookie:
 67.195.138.30:15036, Bookie:67.195.138.30:15038, Bookie:
 67.195.138.30:15039, Bookie:67.195.138.30:15040, Bookie:
 67.195.138.30:15035], fallback to choose bookie randomly from the
 cluster.


 I'm thinking, there could be chance of small network fluctuations or slow
 machine and resulting in connection failure.
 To handle this IMHO, we should have netty client connection timeout in
 place and should retry for few intervals. Let me do a try with
 bootstrap.setOption(connectTimeoutMillis, timeoutvalue);
 Shall I raise a JIRA to discuss about these concerns and will reach to a
 conclusion. Whats your opinion?

 -Rakesh

 -Original Message-
 From: Flavio Junqueira [mailto:fpjunque...@yahoo.com]
 Sent: 31 December 2013 01:51
 To: bookkeeper-dev@zookeeper.apache.org
 Subject: Fwd: Build failed in Jenkins: bookkeeper-trunk #489

 I was wondering if there is a jira open for the test that failed below,
 does anyone know?

 -Flavio

 Begin forwarded message:

  Tests in error:
 
  
 testWithMultipleBookieFailuresInLastEnsemble[2](org.apache.bookkeeper.client.BookieWriteLedgerTest)




RE: Build failed in Jenkins: bookkeeper-trunk #489

2013-12-30 Thread Rakesh R
Hi Sijie,

But I didn't understand why the connection failure is immediately sending 
without waiting for the timeout. In general, client should wait for the 
connection timeout(10secs) and internally retries before throwing failure 
message. Am I correct?

Do we need to have an explicit retry mechanism in netty?

-Rakesh
-Original Message-
From: Sijie Guo [mailto:guosi...@gmail.com] 
Sent: 31 December 2013 11:59
To: bookkeeper-dev@zookeeper.apache.org
Subject: Re: Build failed in Jenkins: bookkeeper-trunk #489

I don't think its connect timeout setting issue. as by default, netty channel 
connect timeout is 10 sec ( 
https://github.com/netty/netty/blob/3.2/src/main/java/org/jboss/netty/channel/DefaultChannelConfig.java#L38).
If you checked the log, the log statements show that the connect operation is 
in same second.

2013-12-30 12:29:36,731 - INFO  -
[BookKeeperClientWorker-0-0:PerChannelBookieClient@167] - Connecting to bookie: 
/67.195.138.30:15039
2013-12-30 12:29:36,732 - ERROR - [New I/O client boss 
#5-1:PerChannelBookieClient$1@203] - Could not connect to bookie: [id:
0x019a639b, /229.27.250.246:46509 : /67.195.138.30:15039], current state 
CONNECTING




On Mon, Dec 30, 2013 at 9:31 PM, Rakesh R rake...@huawei.com wrote:

 Hi Flavio,

 As test case name says, it is testing multiple bookie failures.

 On bookiefailure, when doing the ensemble reformation, unfortunately 
 it is failing to connect to the Bookie-15039. But it should suppose to 
 get connected and continue write operation. This is the reason for the 
 test case failure. Please see the following log pattern:

 2013-12-30 12:29:36,731 - INFO  -
 [BookKeeperClientWorker-0-0:PerChannelBookieClient@167] - Connecting 
 to
 bookie: /67.195.138.30:15039
 2013-12-30 12:29:36,732 - ERROR - [New I/O client boss 
 #5-1:PerChannelBookieClient$1@203] - Could not connect to bookie: [id:
 0x019a639b, /229.27.250.246:46509 : /67.195.138.30:15039], current 
 state CONNECTING
 2013-12-30 12:29:36,732 - WARN  -
 [BookKeeperClientWorker-0-0:PendingAddOp@158] - Write did not succeed: 
 L0
 E100 on /67.195.138.30:15039
 2013-12-30 12:29:36,733 - INFO  -
 [BookKeeperClientWorker-0-0:LedgerHandle@659] - Handling failure of
 bookie: /67.195.138.30:15039 index: 2
 2013-12-30 12:29:36,733 - WARN  -
 [BookKeeperClientWorker-0-0:RackawareEnsemblePlacementPolicy@491] - 
 Failed to choose a bookie from /default-rack : excluded [Bookie:
 67.195.138.30:15036, Bookie:67.195.138.30:15038, Bookie:
 67.195.138.30:15039, Bookie:67.195.138.30:15040, Bookie:
 67.195.138.30:15035], fallback to choose bookie randomly from the 
 cluster.


 I'm thinking, there could be chance of small network fluctuations or 
 slow machine and resulting in connection failure.
 To handle this IMHO, we should have netty client connection timeout in 
 place and should retry for few intervals. Let me do a try with 
 bootstrap.setOption(connectTimeoutMillis, timeoutvalue); Shall I 
 raise a JIRA to discuss about these concerns and will reach to a 
 conclusion. Whats your opinion?

 -Rakesh

 -Original Message-
 From: Flavio Junqueira [mailto:fpjunque...@yahoo.com]
 Sent: 31 December 2013 01:51
 To: bookkeeper-dev@zookeeper.apache.org
 Subject: Fwd: Build failed in Jenkins: bookkeeper-trunk #489

 I was wondering if there is a jira open for the test that failed 
 below, does anyone know?

 -Flavio

 Begin forwarded message:

  Tests in error:
 
  
 testWithMultipleBookieFailuresInLastEnsemble[2](org.apache.bookkeeper.
 client.BookieWriteLedgerTest)




Re: Build failed in Jenkins: bookkeeper-trunk #489

2013-12-30 Thread Sijie Guo
On Mon, Dec 30, 2013 at 10:45 PM, Rakesh R rake...@huawei.com wrote:

 Hi Sijie,

 But I didn't understand why the connection failure is immediately sending
 without waiting for the timeout.

In general, client should wait for the connection timeout(10secs) and
 internally retries before throwing failure message. Am I correct?


No idea. from the log, there is less information to tell what was going on
at that time. I think the better solution is to add  logs about the failure
so we could catch the details of what's wrong inside.



 Do we need to have an explicit retry mechanism in netty?


I don't think we need retry connect in netty. As 1) we already have retry
mechanism in bookie client ; 2) if connect failed on any bookie, we should
let netty notify bookkeeper immediately. as connect failure means bookie
down in most of case, we should change bookie immediately to avoid high
latency.

- Sijie



 -Rakesh
 -Original Message-
 From: Sijie Guo [mailto:guosi...@gmail.com]
 Sent: 31 December 2013 11:59
 To: bookkeeper-dev@zookeeper.apache.org
 Subject: Re: Build failed in Jenkins: bookkeeper-trunk #489

 I don't think its connect timeout setting issue. as by default, netty
 channel connect timeout is 10 sec (
 https://github.com/netty/netty/blob/3.2/src/main/java/org/jboss/netty/channel/DefaultChannelConfig.java#L38
 ).
 If you checked the log, the log statements show that the connect operation
 is in same second.

 2013-12-30 12:29:36,731 - INFO  -
 [BookKeeperClientWorker-0-0:PerChannelBookieClient@167] - Connecting to
 bookie: /67.195.138.30:15039
 2013-12-30 12:29:36,732 - ERROR - [New I/O client boss
 #5-1:PerChannelBookieClient$1@203] - Could not connect to bookie: [id:
 0x019a639b, /229.27.250.246:46509 : /67.195.138.30:15039], current state
 CONNECTING




 On Mon, Dec 30, 2013 at 9:31 PM, Rakesh R rake...@huawei.com wrote:

  Hi Flavio,
 
  As test case name says, it is testing multiple bookie failures.
 
  On bookiefailure, when doing the ensemble reformation, unfortunately
  it is failing to connect to the Bookie-15039. But it should suppose to
  get connected and continue write operation. This is the reason for the
  test case failure. Please see the following log pattern:
 
  2013-12-30 12:29:36,731 - INFO  -
  [BookKeeperClientWorker-0-0:PerChannelBookieClient@167] - Connecting
  to
  bookie: /67.195.138.30:15039
  2013-12-30 12:29:36,732 - ERROR - [New I/O client boss
  #5-1:PerChannelBookieClient$1@203] - Could not connect to bookie: [id:
  0x019a639b, /229.27.250.246:46509 : /67.195.138.30:15039], current
  state CONNECTING
  2013-12-30 12:29:36,732 - WARN  -
  [BookKeeperClientWorker-0-0:PendingAddOp@158] - Write did not succeed:
  L0
  E100 on /67.195.138.30:15039
  2013-12-30 12:29:36,733 - INFO  -
  [BookKeeperClientWorker-0-0:LedgerHandle@659] - Handling failure of
  bookie: /67.195.138.30:15039 index: 2
  2013-12-30 12:29:36,733 - WARN  -
  [BookKeeperClientWorker-0-0:RackawareEnsemblePlacementPolicy@491] -
  Failed to choose a bookie from /default-rack : excluded [Bookie:
  67.195.138.30:15036, Bookie:67.195.138.30:15038, Bookie:
  67.195.138.30:15039, Bookie:67.195.138.30:15040, Bookie:
  67.195.138.30:15035], fallback to choose bookie randomly from the
  cluster.
 
 
  I'm thinking, there could be chance of small network fluctuations or
  slow machine and resulting in connection failure.
  To handle this IMHO, we should have netty client connection timeout in
  place and should retry for few intervals. Let me do a try with
  bootstrap.setOption(connectTimeoutMillis, timeoutvalue); Shall I
  raise a JIRA to discuss about these concerns and will reach to a
  conclusion. Whats your opinion?
 
  -Rakesh
 
  -Original Message-
  From: Flavio Junqueira [mailto:fpjunque...@yahoo.com]
  Sent: 31 December 2013 01:51
  To: bookkeeper-dev@zookeeper.apache.org
  Subject: Fwd: Build failed in Jenkins: bookkeeper-trunk #489
 
  I was wondering if there is a jira open for the test that failed
  below, does anyone know?
 
  -Flavio
 
  Begin forwarded message:
 
   Tests in error:
  
 
  testWithMultipleBookieFailuresInLastEnsemble[2](org.apache.bookkeeper.
  client.BookieWriteLedgerTest)
 
 



[jira] [Created] (BOOKKEEPER-714) Logging channel exceptions in PerChannelBookieClient

2013-12-30 Thread Sijie Guo (JIRA)
Sijie Guo created BOOKKEEPER-714:


 Summary: Logging channel exceptions in PerChannelBookieClient
 Key: BOOKKEEPER-714
 URL: https://issues.apache.org/jira/browse/BOOKKEEPER-714
 Project: Bookkeeper
  Issue Type: Bug
  Components: bookkeeper-client
Reporter: Sijie Guo
Assignee: Sijie Guo
 Fix For: 4.3.0


Logging channel exceptions in PerChannelBookieClient to identify channel 
connect issue.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


RE: Build failed in Jenkins: bookkeeper-trunk #489

2013-12-30 Thread Rakesh R
I've tried one simple test case:

Just before connecting 'bootstrap.connect(addr)', I have killed the 
Bookieserver. What I have observed is immediately returning the call with 
failure.

Any thoughts?

-Original Message-
From: Sijie Guo [mailto:guosi...@gmail.com] 
Sent: 31 December 2013 12:33
To: bookkeeper-dev@zookeeper.apache.org
Subject: Re: Build failed in Jenkins: bookkeeper-trunk #489

On Mon, Dec 30, 2013 at 10:45 PM, Rakesh R rake...@huawei.com wrote:

 Hi Sijie,

 But I didn't understand why the connection failure is immediately 
 sending without waiting for the timeout.

In general, client should wait for the connection timeout(10secs) and
 internally retries before throwing failure message. Am I correct?


No idea. from the log, there is less information to tell what was going on at 
that time. I think the better solution is to add  logs about the failure so we 
could catch the details of what's wrong inside.



 Do we need to have an explicit retry mechanism in netty?


I don't think we need retry connect in netty. As 1) we already have retry 
mechanism in bookie client ; 2) if connect failed on any bookie, we should let 
netty notify bookkeeper immediately. as connect failure means bookie down in 
most of case, we should change bookie immediately to avoid high latency.

- Sijie



 -Rakesh
 -Original Message-
 From: Sijie Guo [mailto:guosi...@gmail.com]
 Sent: 31 December 2013 11:59
 To: bookkeeper-dev@zookeeper.apache.org
 Subject: Re: Build failed in Jenkins: bookkeeper-trunk #489

 I don't think its connect timeout setting issue. as by default, netty 
 channel connect timeout is 10 sec (
 https://github.com/netty/netty/blob/3.2/src/main/java/org/jboss/netty/
 channel/DefaultChannelConfig.java#L38
 ).
 If you checked the log, the log statements show that the connect 
 operation is in same second.

 2013-12-30 12:29:36,731 - INFO  -
 [BookKeeperClientWorker-0-0:PerChannelBookieClient@167] - Connecting 
 to
 bookie: /67.195.138.30:15039
 2013-12-30 12:29:36,732 - ERROR - [New I/O client boss 
 #5-1:PerChannelBookieClient$1@203] - Could not connect to bookie: [id:
 0x019a639b, /229.27.250.246:46509 : /67.195.138.30:15039], current 
 state CONNECTING




 On Mon, Dec 30, 2013 at 9:31 PM, Rakesh R rake...@huawei.com wrote:

  Hi Flavio,
 
  As test case name says, it is testing multiple bookie failures.
 
  On bookiefailure, when doing the ensemble reformation, unfortunately 
  it is failing to connect to the Bookie-15039. But it should suppose 
  to get connected and continue write operation. This is the reason 
  for the test case failure. Please see the following log pattern:
 
  2013-12-30 12:29:36,731 - INFO  -
  [BookKeeperClientWorker-0-0:PerChannelBookieClient@167] - Connecting 
  to
  bookie: /67.195.138.30:15039
  2013-12-30 12:29:36,732 - ERROR - [New I/O client boss 
  #5-1:PerChannelBookieClient$1@203] - Could not connect to bookie: [id:
  0x019a639b, /229.27.250.246:46509 : /67.195.138.30:15039], current 
  state CONNECTING
  2013-12-30 12:29:36,732 - WARN  -
  [BookKeeperClientWorker-0-0:PendingAddOp@158] - Write did not succeed:
  L0
  E100 on /67.195.138.30:15039
  2013-12-30 12:29:36,733 - INFO  -
  [BookKeeperClientWorker-0-0:LedgerHandle@659] - Handling failure of
  bookie: /67.195.138.30:15039 index: 2
  2013-12-30 12:29:36,733 - WARN  -
  [BookKeeperClientWorker-0-0:RackawareEnsemblePlacementPolicy@491] - 
  Failed to choose a bookie from /default-rack : excluded [Bookie:
  67.195.138.30:15036, Bookie:67.195.138.30:15038, Bookie:
  67.195.138.30:15039, Bookie:67.195.138.30:15040, Bookie:
  67.195.138.30:15035], fallback to choose bookie randomly from the 
  cluster.
 
 
  I'm thinking, there could be chance of small network fluctuations or 
  slow machine and resulting in connection failure.
  To handle this IMHO, we should have netty client connection timeout 
  in place and should retry for few intervals. Let me do a try with 
  bootstrap.setOption(connectTimeoutMillis, timeoutvalue); Shall I 
  raise a JIRA to discuss about these concerns and will reach to a 
  conclusion. Whats your opinion?
 
  -Rakesh
 
  -Original Message-
  From: Flavio Junqueira [mailto:fpjunque...@yahoo.com]
  Sent: 31 December 2013 01:51
  To: bookkeeper-dev@zookeeper.apache.org
  Subject: Fwd: Build failed in Jenkins: bookkeeper-trunk #489
 
  I was wondering if there is a jira open for the test that failed 
  below, does anyone know?
 
  -Flavio
 
  Begin forwarded message:
 
   Tests in error:
  
 
  testWithMultipleBookieFailuresInLastEnsemble[2](org.apache.bookkeeper.
  client.BookieWriteLedgerTest)
 
 



Re: Build failed in Jenkins: bookkeeper-trunk #489

2013-12-30 Thread Sijie Guo
On Mon, Dec 30, 2013 at 11:16 PM, Rakesh R rake...@huawei.com wrote:

 I've tried one simple test case:

 Just before connecting 'bootstrap.connect(addr)', I have killed the
 Bookieserver. What I have observed is immediately returning the call with
 failure.


Isn't that expected? as there is not server listened on given port. this is
how TCP works, no?



 Any thoughts?

 -Original Message-
 From: Sijie Guo [mailto:guosi...@gmail.com]
 Sent: 31 December 2013 12:33
 To: bookkeeper-dev@zookeeper.apache.org
 Subject: Re: Build failed in Jenkins: bookkeeper-trunk #489

 On Mon, Dec 30, 2013 at 10:45 PM, Rakesh R rake...@huawei.com wrote:

  Hi Sijie,
 
  But I didn't understand why the connection failure is immediately
  sending without waiting for the timeout.

 In general, client should wait for the connection timeout(10secs) and
  internally retries before throwing failure message. Am I correct?
 

 No idea. from the log, there is less information to tell what was going on
 at that time. I think the better solution is to add  logs about the failure
 so we could catch the details of what's wrong inside.


 
  Do we need to have an explicit retry mechanism in netty?
 

 I don't think we need retry connect in netty. As 1) we already have retry
 mechanism in bookie client ; 2) if connect failed on any bookie, we should
 let netty notify bookkeeper immediately. as connect failure means bookie
 down in most of case, we should change bookie immediately to avoid high
 latency.

 - Sijie


 
  -Rakesh
  -Original Message-
  From: Sijie Guo [mailto:guosi...@gmail.com]
  Sent: 31 December 2013 11:59
  To: bookkeeper-dev@zookeeper.apache.org
  Subject: Re: Build failed in Jenkins: bookkeeper-trunk #489
 
  I don't think its connect timeout setting issue. as by default, netty
  channel connect timeout is 10 sec (
  https://github.com/netty/netty/blob/3.2/src/main/java/org/jboss/netty/
  channel/DefaultChannelConfig.java#L38
  ).
  If you checked the log, the log statements show that the connect
  operation is in same second.
 
  2013-12-30 12:29:36,731 - INFO  -
  [BookKeeperClientWorker-0-0:PerChannelBookieClient@167] - Connecting
  to
  bookie: /67.195.138.30:15039
  2013-12-30 12:29:36,732 - ERROR - [New I/O client boss
  #5-1:PerChannelBookieClient$1@203] - Could not connect to bookie: [id:
  0x019a639b, /229.27.250.246:46509 : /67.195.138.30:15039], current
  state CONNECTING
 
 
 
 
  On Mon, Dec 30, 2013 at 9:31 PM, Rakesh R rake...@huawei.com wrote:
 
   Hi Flavio,
  
   As test case name says, it is testing multiple bookie failures.
  
   On bookiefailure, when doing the ensemble reformation, unfortunately
   it is failing to connect to the Bookie-15039. But it should suppose
   to get connected and continue write operation. This is the reason
   for the test case failure. Please see the following log pattern:
  
   2013-12-30 12:29:36,731 - INFO  -
   [BookKeeperClientWorker-0-0:PerChannelBookieClient@167] - Connecting
   to
   bookie: /67.195.138.30:15039
   2013-12-30 12:29:36,732 - ERROR - [New I/O client boss
   #5-1:PerChannelBookieClient$1@203] - Could not connect to bookie: [id:
   0x019a639b, /229.27.250.246:46509 : /67.195.138.30:15039], current
   state CONNECTING
   2013-12-30 12:29:36,732 - WARN  -
   [BookKeeperClientWorker-0-0:PendingAddOp@158] - Write did not succeed:
   L0
   E100 on /67.195.138.30:15039
   2013-12-30 12:29:36,733 - INFO  -
   [BookKeeperClientWorker-0-0:LedgerHandle@659] - Handling failure of
   bookie: /67.195.138.30:15039 index: 2
   2013-12-30 12:29:36,733 - WARN  -
   [BookKeeperClientWorker-0-0:RackawareEnsemblePlacementPolicy@491] -
   Failed to choose a bookie from /default-rack : excluded [Bookie:
   67.195.138.30:15036, Bookie:67.195.138.30:15038, Bookie:
   67.195.138.30:15039, Bookie:67.195.138.30:15040, Bookie:
   67.195.138.30:15035], fallback to choose bookie randomly from the
   cluster.
  
  
   I'm thinking, there could be chance of small network fluctuations or
   slow machine and resulting in connection failure.
   To handle this IMHO, we should have netty client connection timeout
   in place and should retry for few intervals. Let me do a try with
   bootstrap.setOption(connectTimeoutMillis, timeoutvalue); Shall I
   raise a JIRA to discuss about these concerns and will reach to a
   conclusion. Whats your opinion?
  
   -Rakesh
  
   -Original Message-
   From: Flavio Junqueira [mailto:fpjunque...@yahoo.com]
   Sent: 31 December 2013 01:51
   To: bookkeeper-dev@zookeeper.apache.org
   Subject: Fwd: Build failed in Jenkins: bookkeeper-trunk #489
  
   I was wondering if there is a jira open for the test that failed
   below, does anyone know?
  
   -Flavio
  
   Begin forwarded message:
  
Tests in error:
   
  
   testWithMultipleBookieFailuresInLastEnsemble[2](org.apache.bookkeeper.
   client.BookieWriteLedgerTest)
  
  
 



Re: Build failed in Jenkins: bookkeeper-trunk #489

2013-12-30 Thread Sijie Guo
On Mon, Dec 30, 2013 at 11:02 PM, Sijie Guo guosi...@gmail.com wrote:


 On Mon, Dec 30, 2013 at 10:45 PM, Rakesh R rake...@huawei.com wrote:

 Hi Sijie,

 But I didn't understand why the connection failure is immediately sending
 without waiting for the timeout.

 In general, client should wait for the connection timeout(10secs) and
 internally retries before throwing failure message. Am I correct?


 No idea. from the log, there is less information to tell what was going on
 at that time. I think the better solution is to add  logs about the failure
 so we could catch the details of what's wrong inside.


Created https://issues.apache.org/jira/browse/BOOKKEEPER-714 to add
exception in logging, so we could catch more details when this test case
failed.





 Do we need to have an explicit retry mechanism in netty?


 I don't think we need retry connect in netty. As 1) we already have retry
 mechanism in bookie client ; 2) if connect failed on any bookie, we should
 let netty notify bookkeeper immediately. as connect failure means bookie
 down in most of case, we should change bookie immediately to avoid high
 latency.

 - Sijie



 -Rakesh
 -Original Message-
 From: Sijie Guo [mailto:guosi...@gmail.com]
 Sent: 31 December 2013 11:59
 To: bookkeeper-dev@zookeeper.apache.org
 Subject: Re: Build failed in Jenkins: bookkeeper-trunk #489

 I don't think its connect timeout setting issue. as by default, netty
 channel connect timeout is 10 sec (
 https://github.com/netty/netty/blob/3.2/src/main/java/org/jboss/netty/channel/DefaultChannelConfig.java#L38
 ).
 If you checked the log, the log statements show that the connect
 operation is in same second.

 2013-12-30 12:29:36,731 - INFO  -
 [BookKeeperClientWorker-0-0:PerChannelBookieClient@167] - Connecting to
 bookie: /67.195.138.30:15039
 2013-12-30 12:29:36,732 - ERROR - [New I/O client boss
 #5-1:PerChannelBookieClient$1@203] - Could not connect to bookie: [id:
 0x019a639b, /229.27.250.246:46509 : /67.195.138.30:15039], current
 state CONNECTING




 On Mon, Dec 30, 2013 at 9:31 PM, Rakesh R rake...@huawei.com wrote:

  Hi Flavio,
 
  As test case name says, it is testing multiple bookie failures.
 
  On bookiefailure, when doing the ensemble reformation, unfortunately
  it is failing to connect to the Bookie-15039. But it should suppose to
  get connected and continue write operation. This is the reason for the
  test case failure. Please see the following log pattern:
 
  2013-12-30 12:29:36,731 - INFO  -
  [BookKeeperClientWorker-0-0:PerChannelBookieClient@167] - Connecting
  to
  bookie: /67.195.138.30:15039
  2013-12-30 12:29:36,732 - ERROR - [New I/O client boss
  #5-1:PerChannelBookieClient$1@203] - Could not connect to bookie: [id:
  0x019a639b, /229.27.250.246:46509 : /67.195.138.30:15039], current
  state CONNECTING
  2013-12-30 12:29:36,732 - WARN  -
  [BookKeeperClientWorker-0-0:PendingAddOp@158] - Write did not succeed:
  L0
  E100 on /67.195.138.30:15039
  2013-12-30 12:29:36,733 - INFO  -
  [BookKeeperClientWorker-0-0:LedgerHandle@659] - Handling failure of
  bookie: /67.195.138.30:15039 index: 2
  2013-12-30 12:29:36,733 - WARN  -
  [BookKeeperClientWorker-0-0:RackawareEnsemblePlacementPolicy@491] -
  Failed to choose a bookie from /default-rack : excluded [Bookie:
  67.195.138.30:15036, Bookie:67.195.138.30:15038, Bookie:
  67.195.138.30:15039, Bookie:67.195.138.30:15040, Bookie:
  67.195.138.30:15035], fallback to choose bookie randomly from the
  cluster.
 
 
  I'm thinking, there could be chance of small network fluctuations or
  slow machine and resulting in connection failure.
  To handle this IMHO, we should have netty client connection timeout in
  place and should retry for few intervals. Let me do a try with
  bootstrap.setOption(connectTimeoutMillis, timeoutvalue); Shall I
  raise a JIRA to discuss about these concerns and will reach to a
  conclusion. Whats your opinion?
 
  -Rakesh
 
  -Original Message-
  From: Flavio Junqueira [mailto:fpjunque...@yahoo.com]
  Sent: 31 December 2013 01:51
  To: bookkeeper-dev@zookeeper.apache.org
  Subject: Fwd: Build failed in Jenkins: bookkeeper-trunk #489
 
  I was wondering if there is a jira open for the test that failed
  below, does anyone know?
 
  -Flavio
 
  Begin forwarded message:
 
   Tests in error:
  
 
  testWithMultipleBookieFailuresInLastEnsemble[2](org.apache.bookkeeper.
  client.BookieWriteLedgerTest)
 
 





[jira] [Updated] (BOOKKEEPER-714) Logging channel exceptions in PerChannelBookieClient

2013-12-30 Thread Sijie Guo (JIRA)

 [ 
https://issues.apache.org/jira/browse/BOOKKEEPER-714?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sijie Guo updated BOOKKEEPER-714:
-

Attachment: BOOKKEEPER-714.patch

attach a patch to log channel exceptions, which could help investigating the 
test case failure.

 Logging channel exceptions in PerChannelBookieClient
 

 Key: BOOKKEEPER-714
 URL: https://issues.apache.org/jira/browse/BOOKKEEPER-714
 Project: Bookkeeper
  Issue Type: Bug
  Components: bookkeeper-client
Reporter: Sijie Guo
Assignee: Sijie Guo
 Fix For: 4.3.0

 Attachments: BOOKKEEPER-714.patch


 Logging channel exceptions in PerChannelBookieClient to identify channel 
 connect issue.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


RE: Build failed in Jenkins: bookkeeper-trunk #489

2013-12-30 Thread Rakesh R
oopz...you are right.

-Original Message-
From: Sijie Guo [mailto:guosi...@gmail.com] 
Sent: 31 December 2013 12:55
To: bookkeeper-dev@zookeeper.apache.org
Subject: Re: Build failed in Jenkins: bookkeeper-trunk #489

On Mon, Dec 30, 2013 at 11:16 PM, Rakesh R rake...@huawei.com wrote:

 I've tried one simple test case:

 Just before connecting 'bootstrap.connect(addr)', I have killed the 
 Bookieserver. What I have observed is immediately returning the call 
 with failure.


Isn't that expected? as there is not server listened on given port. this is how 
TCP works, no?



 Any thoughts?

 -Original Message-
 From: Sijie Guo [mailto:guosi...@gmail.com]
 Sent: 31 December 2013 12:33
 To: bookkeeper-dev@zookeeper.apache.org
 Subject: Re: Build failed in Jenkins: bookkeeper-trunk #489

 On Mon, Dec 30, 2013 at 10:45 PM, Rakesh R rake...@huawei.com wrote:

  Hi Sijie,
 
  But I didn't understand why the connection failure is immediately 
  sending without waiting for the timeout.

 In general, client should wait for the connection timeout(10secs) and
  internally retries before throwing failure message. Am I correct?
 

 No idea. from the log, there is less information to tell what was 
 going on at that time. I think the better solution is to add  logs 
 about the failure so we could catch the details of what's wrong inside.


 
  Do we need to have an explicit retry mechanism in netty?
 

 I don't think we need retry connect in netty. As 1) we already have 
 retry mechanism in bookie client ; 2) if connect failed on any bookie, 
 we should let netty notify bookkeeper immediately. as connect failure 
 means bookie down in most of case, we should change bookie immediately 
 to avoid high latency.

 - Sijie


 
  -Rakesh
  -Original Message-
  From: Sijie Guo [mailto:guosi...@gmail.com]
  Sent: 31 December 2013 11:59
  To: bookkeeper-dev@zookeeper.apache.org
  Subject: Re: Build failed in Jenkins: bookkeeper-trunk #489
 
  I don't think its connect timeout setting issue. as by default, 
  netty channel connect timeout is 10 sec ( 
  https://github.com/netty/netty/blob/3.2/src/main/java/org/jboss/nett
  y/
  channel/DefaultChannelConfig.java#L38
  ).
  If you checked the log, the log statements show that the connect 
  operation is in same second.
 
  2013-12-30 12:29:36,731 - INFO  -
  [BookKeeperClientWorker-0-0:PerChannelBookieClient@167] - Connecting 
  to
  bookie: /67.195.138.30:15039
  2013-12-30 12:29:36,732 - ERROR - [New I/O client boss 
  #5-1:PerChannelBookieClient$1@203] - Could not connect to bookie: [id:
  0x019a639b, /229.27.250.246:46509 : /67.195.138.30:15039], current 
  state CONNECTING
 
 
 
 
  On Mon, Dec 30, 2013 at 9:31 PM, Rakesh R rake...@huawei.com wrote:
 
   Hi Flavio,
  
   As test case name says, it is testing multiple bookie failures.
  
   On bookiefailure, when doing the ensemble reformation, 
   unfortunately it is failing to connect to the Bookie-15039. But it 
   should suppose to get connected and continue write operation. This 
   is the reason for the test case failure. Please see the following log 
   pattern:
  
   2013-12-30 12:29:36,731 - INFO  -
   [BookKeeperClientWorker-0-0:PerChannelBookieClient@167] - 
   Connecting to
   bookie: /67.195.138.30:15039
   2013-12-30 12:29:36,732 - ERROR - [New I/O client boss 
   #5-1:PerChannelBookieClient$1@203] - Could not connect to bookie: [id:
   0x019a639b, /229.27.250.246:46509 : /67.195.138.30:15039], 
   current state CONNECTING
   2013-12-30 12:29:36,732 - WARN  -
   [BookKeeperClientWorker-0-0:PendingAddOp@158] - Write did not succeed:
   L0
   E100 on /67.195.138.30:15039
   2013-12-30 12:29:36,733 - INFO  -
   [BookKeeperClientWorker-0-0:LedgerHandle@659] - Handling failure 
   of
   bookie: /67.195.138.30:15039 index: 2
   2013-12-30 12:29:36,733 - WARN  -
   [BookKeeperClientWorker-0-0:RackawareEnsemblePlacementPolicy@491] 
   - Failed to choose a bookie from /default-rack : excluded [Bookie:
   67.195.138.30:15036, Bookie:67.195.138.30:15038, Bookie:
   67.195.138.30:15039, Bookie:67.195.138.30:15040, Bookie:
   67.195.138.30:15035], fallback to choose bookie randomly from the 
   cluster.
  
  
   I'm thinking, there could be chance of small network fluctuations 
   or slow machine and resulting in connection failure.
   To handle this IMHO, we should have netty client connection 
   timeout in place and should retry for few intervals. Let me do a 
   try with bootstrap.setOption(connectTimeoutMillis, 
   timeoutvalue); Shall I raise a JIRA to discuss about these 
   concerns and will reach to a conclusion. Whats your opinion?
  
   -Rakesh
  
   -Original Message-
   From: Flavio Junqueira [mailto:fpjunque...@yahoo.com]
   Sent: 31 December 2013 01:51
   To: bookkeeper-dev@zookeeper.apache.org
   Subject: Fwd: Build failed in Jenkins: bookkeeper-trunk #489
  
   I was wondering if there is a jira open for the test that failed 
   below, does anyone know?
  
   -Flavio
  
   Begin 

[jira] [Created] (BOOKKEEPER-715) bookie: delay dropping journal cached pages

2013-12-30 Thread Sijie Guo (JIRA)
Sijie Guo created BOOKKEEPER-715:


 Summary: bookie: delay dropping journal cached pages
 Key: BOOKKEEPER-715
 URL: https://issues.apache.org/jira/browse/BOOKKEEPER-715
 Project: Bookkeeper
  Issue Type: Sub-task
  Components: bookkeeper-server
Reporter: Sijie Guo
Assignee: Sijie Guo
 Fix For: 4.3.0


As journal writes aren't aligned to sector size, if bookie drops cached pages 
immediately, journal has to read those pages again, which would impact journal 
write performance.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)