ZooKeeper_branch33_solaris - Build # 644 - Still Failing

2013-09-12 Thread Apache Jenkins Server
See https://builds.apache.org/job/ZooKeeper_branch33_solaris/644/

###
## LAST 60 LINES OF THE CONSOLE 
###
[...truncated 108173 lines...]
[junit] 2013-09-12 07:09:29,441 - INFO  [main:ZooKeeperServer@154] - 
Created server with tickTime 3000 minSessionTimeout 6000 maxSessionTimeout 
6 datadir 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/ZooKeeper_branch33_solaris/trunk/build/test/tmp/test3231459883631206712.junit.dir/version-2
 snapdir 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/ZooKeeper_branch33_solaris/trunk/build/test/tmp/test3231459883631206712.junit.dir/version-2
[junit] 2013-09-12 07:09:29,442 - INFO  [main:NIOServerCnxn$Factory@143] - 
binding to port 0.0.0.0/0.0.0.0:11221
[junit] 2013-09-12 07:09:29,443 - INFO  [main:FileSnap@82] - Reading 
snapshot 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/ZooKeeper_branch33_solaris/trunk/build/test/tmp/test3231459883631206712.junit.dir/version-2/snapshot.0
[junit] 2013-09-12 07:09:29,447 - INFO  [main:FileTxnSnapLog@256] - 
Snapshotting: b
[junit] 2013-09-12 07:09:29,449 - INFO  [main:FourLetterWordMain@43] - 
connecting to 127.0.0.1 11221
[junit] 2013-09-12 07:09:29,450 - INFO  
[NIOServerCxn.Factory:0.0.0.0/0.0.0.0:11221:NIOServerCnxn$Factory@251] - 
Accepted socket connection from /127.0.0.1:56724
[junit] 2013-09-12 07:09:29,451 - INFO  
[NIOServerCxn.Factory:0.0.0.0/0.0.0.0:11221:NIOServerCnxn@1237] - Processing 
stat command from /127.0.0.1:56724
[junit] 2013-09-12 07:09:29,451 - INFO  
[Thread-4:NIOServerCnxn$StatCommand@1153] - Stat command output
[junit] 2013-09-12 07:09:29,452 - INFO  [Thread-4:NIOServerCnxn@1435] - 
Closed socket connection for client /127.0.0.1:56724 (no session established 
for client)
[junit] ensureOnly:[InMemoryDataTree, StandaloneServer_port]
[junit] expect:InMemoryDataTree
[junit] found:InMemoryDataTree 
org.apache.ZooKeeperService:name0=StandaloneServer_port-1,name1=InMemoryDataTree
[junit] expect:StandaloneServer_port
[junit] found:StandaloneServer_port 
org.apache.ZooKeeperService:name0=StandaloneServer_port-1
[junit] 2013-09-12 07:09:29,459 - INFO  [main:ClientBase@408] - STOPPING 
server
[junit] 2013-09-12 07:09:29,461 - INFO  
[SyncThread:0:SyncRequestProcessor@151] - SyncRequestProcessor exited!
[junit] 2013-09-12 07:09:29,461 - INFO  
[ProcessThread:-1:PrepRequestProcessor@128] - PrepRequestProcessor exited loop!
[junit] 2013-09-12 07:09:29,461 - INFO  [main:FinalRequestProcessor@370] - 
shutdown of request processor complete
[junit] 2013-09-12 07:09:29,463 - INFO  [main:FourLetterWordMain@43] - 
connecting to 127.0.0.1 11221
[junit] ensureOnly:[]
[junit] 2013-09-12 07:09:29,464 - INFO  [main:ClientBase@401] - STARTING 
server
[junit] 2013-09-12 07:09:29,465 - INFO  [main:ZooKeeperServer@154] - 
Created server with tickTime 3000 minSessionTimeout 6000 maxSessionTimeout 
6 datadir 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/ZooKeeper_branch33_solaris/trunk/build/test/tmp/test3231459883631206712.junit.dir/version-2
 snapdir 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/ZooKeeper_branch33_solaris/trunk/build/test/tmp/test3231459883631206712.junit.dir/version-2
[junit] 2013-09-12 07:09:29,466 - INFO  [main:NIOServerCnxn$Factory@143] - 
binding to port 0.0.0.0/0.0.0.0:11221
[junit] 2013-09-12 07:09:29,467 - INFO  [main:FileSnap@82] - Reading 
snapshot 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/ZooKeeper_branch33_solaris/trunk/build/test/tmp/test3231459883631206712.junit.dir/version-2/snapshot.b
[junit] 2013-09-12 07:09:29,470 - INFO  [main:FileTxnSnapLog@256] - 
Snapshotting: b
[junit] 2013-09-12 07:09:29,472 - INFO  [main:FourLetterWordMain@43] - 
connecting to 127.0.0.1 11221
[junit] 2013-09-12 07:09:29,473 - INFO  
[NIOServerCxn.Factory:0.0.0.0/0.0.0.0:11221:NIOServerCnxn$Factory@251] - 
Accepted socket connection from /127.0.0.1:56726
[junit] 2013-09-12 07:09:29,473 - INFO  
[NIOServerCxn.Factory:0.0.0.0/0.0.0.0:11221:NIOServerCnxn@1237] - Processing 
stat command from /127.0.0.1:56726
[junit] 2013-09-12 07:09:29,474 - INFO  
[Thread-5:NIOServerCnxn$StatCommand@1153] - Stat command output
[junit] 2013-09-12 07:09:29,475 - INFO  [Thread-5:NIOServerCnxn@1435] - 
Closed socket connection for client /127.0.0.1:56726 (no session established 
for client)
[junit] ensureOnly:[InMemoryDataTree, StandaloneServer_port]
[junit] expect:InMemoryDataTree
[junit] found:InMemoryDataTree 
org.apache.ZooKeeperService:name0=StandaloneServer_port-1,name1=InMemoryDataTree
[junit] expect:StandaloneServer_port
[junit] found:StandaloneServer_port 
org.apache.ZooKeeperService:name0=StandaloneServer_port-1
[junit] 2013-09-12 07:09:29,480 - INFO  

ZooKeeper-trunk-solaris - Build # 668 - Still Failing

2013-09-12 Thread Apache Jenkins Server
See https://builds.apache.org/job/ZooKeeper-trunk-solaris/668/

###
## LAST 60 LINES OF THE CONSOLE 
###
Started by timer
FATAL: null
java.lang.NullPointerException
at hudson.model.Slave.createLauncher(Slave.java:347)
at 
hudson.model.AbstractBuild$AbstractBuildExecution.createLauncher(AbstractBuild.java:612)
at 
hudson.model.AbstractBuild$AbstractBuildExecution.run(AbstractBuild.java:543)
at hudson.model.Run.execute(Run.java:1603)
at hudson.model.FreeStyleBuild.run(FreeStyleBuild.java:46)
at hudson.model.ResourceController.execute(ResourceController.java:88)
at hudson.model.Executor.run(Executor.java:246)



###
## FAILED TESTS (if any) 
##
No tests ran.

[jira] [Commented] (ZOOKEEPER-1718) Support JLine 2

2013-09-12 Thread Flavio Junqueira (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1718?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13765303#comment-13765303
 ] 

Flavio Junqueira commented on ZOOKEEPER-1718:
-

I think this issue is related to ZOOKEEPER-1655, no?

 Support JLine 2
 ---

 Key: ZOOKEEPER-1718
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1718
 Project: ZooKeeper
  Issue Type: Test
Reporter: Christopher Tubbs
Priority: Critical
 Fix For: 3.5.0

 Attachments: ZOOKEEPER-1718.patch


 not fixed

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (ZOOKEEPER-1750) Race condition producing NPE in NIOServerCnxn.toString

2013-09-12 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1750?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13765340#comment-13765340
 ] 

Hudson commented on ZOOKEEPER-1750:
---

SUCCESS: Integrated in ZooKeeper-trunk #2053 (See 
[https://builds.apache.org/job/ZooKeeper-trunk/2053/])
ZOOKEEPER-1750 Handle NIOServerCnxn#toString returning null (Rakesh R via 
michim) (michim: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1522079)
* 
/zookeeper/trunk/src/java/test/org/apache/zookeeper/server/NIOServerCnxnTest.java


 Race condition producing NPE in NIOServerCnxn.toString
 --

 Key: ZOOKEEPER-1750
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1750
 Project: ZooKeeper
  Issue Type: Bug
  Components: server
Affects Versions: 3.5.0
Reporter: Helen Hastings
Assignee: Rakesh R
Priority: Minor
 Fix For: 3.5.0

 Attachments: 0001-ZOOKEEPER-1750-trunk-version.patch, 
 0002-ZOOKEEPER-1750-trunk-version.patch, 
 0003-ZOOKEEPER-1750-branch-3_4.patch, 
 0004-ZOOKEEPER-1750-trunk-version-modified-test-case-assertion.patch


 The socket is closed and the variable sock is set to null for normal 
 reasons, but the toString method is called before sock can be set again, 
 producing a NullPointerException.
 Stack trace: 
 2013-08-29 01:49:19,991 6277 [CommitProcWorkThread-3] WARN 
 org.apache.zookeeper.server.WorkerService  - Unexpected exception
 java.lang.NullPointerException
 at 
 org.apach.zookeeper.server.NIOServerCnxn.toString(NIOServerCnxn.java:961)
 at java.lang.String.valueOf(String.java:2854)
 at java.lang.StringBuilder.append(StringBuilder.java:128)
 at 
 org.apache.zookeeper.server.NIOServerCnxn.process(NIOServerCnxn.java:1104)
 at 
 org.apache.zookeeper.server.WatchManager.triggerWatch(WatchManager.java:120)
 at 
 org.apache.zookeeper.server.WatchManager.triggerWatch(WatchManager.java:92)
 at org.apache.zookeeper.server.DataTree.createNode(DataTree.java:544)
 at org.apache.zookeeper.server.DataTree.processTxn(DataTree.java:805)
 at org.apache.zookeeper.server.ZKDatabase.processTxn(ZKDatabase.java:319)
 at 
 org.apache.zookeeper.server.ZooKeeperServer.processTxn(ZooKeeperServer.java:967)
 at 
 org.apache.zookeeper.server.FinalRequestProcessor.processRequest(FinalRequestProcessor.java:115)
 at 
 org.apache.zookeeper.server.quorum.Leader$ToBeAppliedRequestProcessor.processRequest(Leader.java:859)
 at 
 org.apache.zookeeper.server.quorum.CommitProcessor$CommitWorkRequest.doWork(CommitProcessor.java:271)
 at 
 org.apache.zookeeper.server.WorkerService$ScheduledWorkRequest.run(WorkerService.java:152)
 at 
 java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
 at java.lang.Thread.run(Thread.java:722)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (ZOOKEEPER-1747) Zookeeper server fails to start if transaction log file is corrupted

2013-09-12 Thread JIRA

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13765392#comment-13765392
 ] 

Germán Blanco commented on ZOOKEEPER-1747:
--

We have seen this issue also with inconsistent state between acceptedEpoch, 
currentEpoch and the transaction log. In that case the error is:
{noformat}
2013-09-12 12:30:51,586 [myid:10] - ERROR [main:QuorumPeer@453] - Unable to 
load database on disk
java.io.IOException: The current epoch, 6, is older than the last zxid, 
34359738487
at 
org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:435)
at 
org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:409)
at 
org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:151)
at 
org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:111)
at 
org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:78)
2013-09-12 12:30:51,587 [myid:10] - ERROR [main:QuorumPeerMain@89] - Unexpected 
exception, exiting abnormally
java.lang.RuntimeException: Unable to run quorum server
at 
org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:454)
at 
org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:409)
at 
org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:151)
at 
org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:111)
at 
org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:78)
Caused by: java.io.IOException: The current epoch, 6, is older than the last 
zxid, 34359738487
at 
org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:435)
... 4 more
{noformat}
I guess force-ignore means that the server just ignores whatever is in the 
disk and starts with zxid=0, or?

 Zookeeper server fails to start if transaction log file is corrupted
 

 Key: ZOOKEEPER-1747
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1747
 Project: ZooKeeper
  Issue Type: Bug
  Components: server
Affects Versions: 3.4.5
 Environment: Solaris10/x86, Java 1.6
Reporter: Sergey Maslyakov

 On multiple occasions when ZK was not able to write out a transaction log or 
 a snapshot file, the consequent attempt to restart the server fails. Usually 
 it happens when the underlying file system filled up; thus, preventing ZK 
 server from writing out consistent data file.
 Upon start-up, the server reads in the snapshot and the transaction log. If 
 the deserializer fails and throws an exception, server terminates. Please see 
 the stack trace below.
 Server not coming up for whatever reason is often an undesirable condition. 
 It would be nice to have an option to force-ignore parsing errors, 
 especially, in the transaction log. A check sum on the data could be a 
 possible solution to ensure the integrity and parsability.
 Another robustness enhancement could be via proper handling of the condition 
 when snapshot or transaction log cannot be completely written to disk. 
 Basically, better handling of write errors.
 {noformat}
 2013-08-28 12:05:30,732 ERROR [ZooKeeperServerMain] Unexpected exception, 
 exiting abnormally
 java.io.EOFException
 at java.io.DataInputStream.readInt(DataInputStream.java:375)
 at 
 org.apache.jute.BinaryInputArchive.readInt(BinaryInputArchive.java:63)
 at 
 org.apache.zookeeper.server.persistence.FileHeader.deserialize(FileHeader.java:64)
 at 
 org.apache.zookeeper.server.persistence.FileTxnLog$FileTxnIterator.inStreamCreated(FileTxnLog.java:558)
 at 
 org.apache.zookeeper.server.persistence.FileTxnLog$FileTxnIterator.createInputArchive(FileTxnLog.java:577)
 at 
 org.apache.zookeeper.server.persistence.FileTxnLog$FileTxnIterator.goToNextLog(FileTxnLog.java:543)
 at 
 org.apache.zookeeper.server.persistence.FileTxnLog$FileTxnIterator.next(FileTxnLog.java:625)
 at 
 org.apache.zookeeper.server.persistence.FileTxnSnapLog.restore(FileTxnSnapLog.java:160)
 at 
 org.apache.zookeeper.server.ZKDatabase.loadDataBase(ZKDatabase.java:223)
 at 
 org.apache.zookeeper.server.ZooKeeperServer.loadData(ZooKeeperServer.java:250)
 at 
 org.apache.zookeeper.server.ZooKeeperServer.startdata(ZooKeeperServer.java:383)
 at 
 org.apache.zookeeper.server.NIOServerCnxnFactory.startup(NIOServerCnxnFactory.java:122)
 at 
 org.apache.zookeeper.server.ZooKeeperServerMain.runFromConfig(ZooKeeperServerMain.java:112)
 at 
 org.apache.zookeeper.server.ZooKeeperServerMain.initializeAndRun(ZooKeeperServerMain.java:86)
 at 

[jira] [Commented] (ZOOKEEPER-1747) Zookeeper server fails to start if transaction log file is corrupted

2013-09-12 Thread Sergey Maslyakov (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13765472#comment-13765472
 ] 

Sergey Maslyakov commented on ZOOKEEPER-1747:
-

Let me take my original statement about force-ignoring errors back. I think 
Zookeeper server should handle data consistency issues gracefully. This means, 
it needs to handle this type of errors as opposed to terminating. The reaction 
to an error can be controlled by the user.

# For fatal errors, such as missing {{myid}} file, ZK server server shall exit.
# For non-fatal data consistency errors (empty log, missing epoch files, etc), 
ZK can be instructed to:
## Come up empty
## Make best effort in restoring DataTree. If no data can be restored 
consistently, ZK can be instructed to:
### Come up empty
### Exit

This way, a system operator, who is not a ZK expert, can be given a set of work 
instruction on how to recover a failing ZK service.

 Zookeeper server fails to start if transaction log file is corrupted
 

 Key: ZOOKEEPER-1747
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1747
 Project: ZooKeeper
  Issue Type: Bug
  Components: server
Affects Versions: 3.4.5
 Environment: Solaris10/x86, Java 1.6
Reporter: Sergey Maslyakov

 On multiple occasions when ZK was not able to write out a transaction log or 
 a snapshot file, the consequent attempt to restart the server fails. Usually 
 it happens when the underlying file system filled up; thus, preventing ZK 
 server from writing out consistent data file.
 Upon start-up, the server reads in the snapshot and the transaction log. If 
 the deserializer fails and throws an exception, server terminates. Please see 
 the stack trace below.
 Server not coming up for whatever reason is often an undesirable condition. 
 It would be nice to have an option to force-ignore parsing errors, 
 especially, in the transaction log. A check sum on the data could be a 
 possible solution to ensure the integrity and parsability.
 Another robustness enhancement could be via proper handling of the condition 
 when snapshot or transaction log cannot be completely written to disk. 
 Basically, better handling of write errors.
 {noformat}
 2013-08-28 12:05:30,732 ERROR [ZooKeeperServerMain] Unexpected exception, 
 exiting abnormally
 java.io.EOFException
 at java.io.DataInputStream.readInt(DataInputStream.java:375)
 at 
 org.apache.jute.BinaryInputArchive.readInt(BinaryInputArchive.java:63)
 at 
 org.apache.zookeeper.server.persistence.FileHeader.deserialize(FileHeader.java:64)
 at 
 org.apache.zookeeper.server.persistence.FileTxnLog$FileTxnIterator.inStreamCreated(FileTxnLog.java:558)
 at 
 org.apache.zookeeper.server.persistence.FileTxnLog$FileTxnIterator.createInputArchive(FileTxnLog.java:577)
 at 
 org.apache.zookeeper.server.persistence.FileTxnLog$FileTxnIterator.goToNextLog(FileTxnLog.java:543)
 at 
 org.apache.zookeeper.server.persistence.FileTxnLog$FileTxnIterator.next(FileTxnLog.java:625)
 at 
 org.apache.zookeeper.server.persistence.FileTxnSnapLog.restore(FileTxnSnapLog.java:160)
 at 
 org.apache.zookeeper.server.ZKDatabase.loadDataBase(ZKDatabase.java:223)
 at 
 org.apache.zookeeper.server.ZooKeeperServer.loadData(ZooKeeperServer.java:250)
 at 
 org.apache.zookeeper.server.ZooKeeperServer.startdata(ZooKeeperServer.java:383)
 at 
 org.apache.zookeeper.server.NIOServerCnxnFactory.startup(NIOServerCnxnFactory.java:122)
 at 
 org.apache.zookeeper.server.ZooKeeperServerMain.runFromConfig(ZooKeeperServerMain.java:112)
 at 
 org.apache.zookeeper.server.ZooKeeperServerMain.initializeAndRun(ZooKeeperServerMain.java:86)
 at 
 org.apache.zookeeper.server.ZooKeeperServerMain.main(ZooKeeperServerMain.java:52)
 at 
 org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:129)
 at 
 org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:78)
 {noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Resolved] (ZOOKEEPER-1747) Zookeeper server fails to start if transaction log file is corrupted

2013-09-12 Thread Sergey Maslyakov (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sergey Maslyakov resolved ZOOKEEPER-1747.
-

Resolution: Duplicate

 Zookeeper server fails to start if transaction log file is corrupted
 

 Key: ZOOKEEPER-1747
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1747
 Project: ZooKeeper
  Issue Type: Bug
  Components: server
Affects Versions: 3.4.5
 Environment: Solaris10/x86, Java 1.6
Reporter: Sergey Maslyakov

 On multiple occasions when ZK was not able to write out a transaction log or 
 a snapshot file, the consequent attempt to restart the server fails. Usually 
 it happens when the underlying file system filled up; thus, preventing ZK 
 server from writing out consistent data file.
 Upon start-up, the server reads in the snapshot and the transaction log. If 
 the deserializer fails and throws an exception, server terminates. Please see 
 the stack trace below.
 Server not coming up for whatever reason is often an undesirable condition. 
 It would be nice to have an option to force-ignore parsing errors, 
 especially, in the transaction log. A check sum on the data could be a 
 possible solution to ensure the integrity and parsability.
 Another robustness enhancement could be via proper handling of the condition 
 when snapshot or transaction log cannot be completely written to disk. 
 Basically, better handling of write errors.
 {noformat}
 2013-08-28 12:05:30,732 ERROR [ZooKeeperServerMain] Unexpected exception, 
 exiting abnormally
 java.io.EOFException
 at java.io.DataInputStream.readInt(DataInputStream.java:375)
 at 
 org.apache.jute.BinaryInputArchive.readInt(BinaryInputArchive.java:63)
 at 
 org.apache.zookeeper.server.persistence.FileHeader.deserialize(FileHeader.java:64)
 at 
 org.apache.zookeeper.server.persistence.FileTxnLog$FileTxnIterator.inStreamCreated(FileTxnLog.java:558)
 at 
 org.apache.zookeeper.server.persistence.FileTxnLog$FileTxnIterator.createInputArchive(FileTxnLog.java:577)
 at 
 org.apache.zookeeper.server.persistence.FileTxnLog$FileTxnIterator.goToNextLog(FileTxnLog.java:543)
 at 
 org.apache.zookeeper.server.persistence.FileTxnLog$FileTxnIterator.next(FileTxnLog.java:625)
 at 
 org.apache.zookeeper.server.persistence.FileTxnSnapLog.restore(FileTxnSnapLog.java:160)
 at 
 org.apache.zookeeper.server.ZKDatabase.loadDataBase(ZKDatabase.java:223)
 at 
 org.apache.zookeeper.server.ZooKeeperServer.loadData(ZooKeeperServer.java:250)
 at 
 org.apache.zookeeper.server.ZooKeeperServer.startdata(ZooKeeperServer.java:383)
 at 
 org.apache.zookeeper.server.NIOServerCnxnFactory.startup(NIOServerCnxnFactory.java:122)
 at 
 org.apache.zookeeper.server.ZooKeeperServerMain.runFromConfig(ZooKeeperServerMain.java:112)
 at 
 org.apache.zookeeper.server.ZooKeeperServerMain.initializeAndRun(ZooKeeperServerMain.java:86)
 at 
 org.apache.zookeeper.server.ZooKeeperServerMain.main(ZooKeeperServerMain.java:52)
 at 
 org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:129)
 at 
 org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:78)
 {noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (ZOOKEEPER-1747) Zookeeper server fails to start if transaction log file is corrupted

2013-09-12 Thread Sergey Maslyakov (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13765509#comment-13765509
 ] 

Sergey Maslyakov commented on ZOOKEEPER-1747:
-

Yes, it is. Thank you for pointing this out. I marked this entry as a duplicate 
of 1621.

 Zookeeper server fails to start if transaction log file is corrupted
 

 Key: ZOOKEEPER-1747
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1747
 Project: ZooKeeper
  Issue Type: Bug
  Components: server
Affects Versions: 3.4.5
 Environment: Solaris10/x86, Java 1.6
Reporter: Sergey Maslyakov

 On multiple occasions when ZK was not able to write out a transaction log or 
 a snapshot file, the consequent attempt to restart the server fails. Usually 
 it happens when the underlying file system filled up; thus, preventing ZK 
 server from writing out consistent data file.
 Upon start-up, the server reads in the snapshot and the transaction log. If 
 the deserializer fails and throws an exception, server terminates. Please see 
 the stack trace below.
 Server not coming up for whatever reason is often an undesirable condition. 
 It would be nice to have an option to force-ignore parsing errors, 
 especially, in the transaction log. A check sum on the data could be a 
 possible solution to ensure the integrity and parsability.
 Another robustness enhancement could be via proper handling of the condition 
 when snapshot or transaction log cannot be completely written to disk. 
 Basically, better handling of write errors.
 {noformat}
 2013-08-28 12:05:30,732 ERROR [ZooKeeperServerMain] Unexpected exception, 
 exiting abnormally
 java.io.EOFException
 at java.io.DataInputStream.readInt(DataInputStream.java:375)
 at 
 org.apache.jute.BinaryInputArchive.readInt(BinaryInputArchive.java:63)
 at 
 org.apache.zookeeper.server.persistence.FileHeader.deserialize(FileHeader.java:64)
 at 
 org.apache.zookeeper.server.persistence.FileTxnLog$FileTxnIterator.inStreamCreated(FileTxnLog.java:558)
 at 
 org.apache.zookeeper.server.persistence.FileTxnLog$FileTxnIterator.createInputArchive(FileTxnLog.java:577)
 at 
 org.apache.zookeeper.server.persistence.FileTxnLog$FileTxnIterator.goToNextLog(FileTxnLog.java:543)
 at 
 org.apache.zookeeper.server.persistence.FileTxnLog$FileTxnIterator.next(FileTxnLog.java:625)
 at 
 org.apache.zookeeper.server.persistence.FileTxnSnapLog.restore(FileTxnSnapLog.java:160)
 at 
 org.apache.zookeeper.server.ZKDatabase.loadDataBase(ZKDatabase.java:223)
 at 
 org.apache.zookeeper.server.ZooKeeperServer.loadData(ZooKeeperServer.java:250)
 at 
 org.apache.zookeeper.server.ZooKeeperServer.startdata(ZooKeeperServer.java:383)
 at 
 org.apache.zookeeper.server.NIOServerCnxnFactory.startup(NIOServerCnxnFactory.java:122)
 at 
 org.apache.zookeeper.server.ZooKeeperServerMain.runFromConfig(ZooKeeperServerMain.java:112)
 at 
 org.apache.zookeeper.server.ZooKeeperServerMain.initializeAndRun(ZooKeeperServerMain.java:86)
 at 
 org.apache.zookeeper.server.ZooKeeperServerMain.main(ZooKeeperServerMain.java:52)
 at 
 org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:129)
 at 
 org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:78)
 {noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (ZOOKEEPER-1718) Support JLine 2

2013-09-12 Thread Michi Mutsuzaki (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1718?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13765679#comment-13765679
 ] 

Michi Mutsuzaki commented on ZOOKEEPER-1718:


Yes, it is. Thank you for pointing out, I wasn't aware of  ZOOKEEPER-1655. I 
think I can apply ZOOKEEPER-1718 first and then merge ZOOKEEPER-1655 (i.e. keep 
the newer jline version from ZOOKEEPER-1718 and pick up the optional 
configuration stuff from ZOOKEEPER-1655).

 Support JLine 2
 ---

 Key: ZOOKEEPER-1718
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1718
 Project: ZooKeeper
  Issue Type: Test
Reporter: Christopher Tubbs
Priority: Critical
 Fix For: 3.5.0

 Attachments: ZOOKEEPER-1718.patch


 not fixed

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (ZOOKEEPER-1096) Leader communication should listen on specified IP, not wildcard address

2013-09-12 Thread Flavio Junqueira (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13765843#comment-13765843
 ] 

Flavio Junqueira commented on ZOOKEEPER-1096:
-

Thanks for the patch. Here are a few comments:

# Please add documentation.
# Why are we using system properties and not the config file to set this up?
# Please with parenthesis with if/else blocks, even if there is a single 
statement currently.
# I don't find it very elegant to catch an exception to determine that we need 
to return false. Also, I'd rather not catch a generic exception, but instead 
catch the precise exception we are expecting in the case the property is not 
there.

 Leader communication should listen on specified IP, not wildcard address
 

 Key: ZOOKEEPER-1096
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1096
 Project: ZooKeeper
  Issue Type: Improvement
  Components: server
Affects Versions: 3.3.3, 3.4.0
Reporter: Jared Cantwell
Assignee: Jared Cantwell
Priority: Minor
 Fix For: 3.5.0, 3.4.6

 Attachments: ZOOKEEPER-1096_branch3.4.patch, ZOOKEEPER-1096.patch, 
 ZOOKEEPER-1096.patch, ZOOKEEPER-1096.patch


 Server should specify the local address that is used for leader communication 
 and leader election (and not use the default of listening on all interfaces). 
  This is similar to the clientPortAddress parameter that was added a year 
 ago.  After reviewing the code, we can't think of a reason why only the port 
 would be used with the wildcard interface, when servers are already 
 connecting specifically to that interface anyway.
 I have submitted a patch, but it does not account for all leader election 
 algorithms.
 Probably should have an option to toggle this, for backwards compatibility, 
 although it seems like it would be a bug if this change broke things.
 There is some more information about making it an option here:
 http://mail-archives.apache.org/mod_mbox/hadoop-zookeeper-dev/201008.mbox/%3CAANLkTikkT97Djqt3CU=h2+7gnj_4p28hgcxjh345h...@mail.gmail.com%3E

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (ZOOKEEPER-1375) SendThread is exiting after OOMError

2013-09-12 Thread Keith Turner (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13765965#comment-13765965
 ] 

Keith Turner commented on ZOOKEEPER-1375:
-

In ACCUMULO-1708, I am trying to work out a way to make Accumulo server 
processes die if any thread throws an Error.  Accumulo uses Zookeeper and HDFS. 
 One problem I have run into is that zookeeper and HDFS create threads that 
could possibly throw OOME.  I thought I was onto something with 
ThreadGroup.uncaughtException(), but since zookeeper and hdfs threads catch 
Throwable its a dead end.  If interested, I attached an example called 
ThreadTest.java to ACCUMULO-1708 that shows an experiment trying to use a 
thread group.

If zookeeper client side threads always rethrew Errors, then this would ideal 
for my purposes.   Zookeeper code could still try to take some action in catch. 
  I suppose this might look like the following.


{code:java}
try{
//...
} catch (Throwable e)
{
   try{
 //..
 cleanup();
 if(state.isAlive()){
eventThread.queueEvent(
new WatchedEvent(Event.EventType.None, Event.KeeperState.Disconnected, 
null) )
 }
 //
   }catch (Throwable e) {
 //failure while trying to process failure
 e.printStackTrace();
   }finally{
  throw e;
   }
}
{code}


 SendThread is exiting after OOMError
 

 Key: ZOOKEEPER-1375
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1375
 Project: ZooKeeper
  Issue Type: Bug
Affects Versions: 3.4.0
Reporter: Rakesh R

 After reviewing the ClientCnxn code, there is still chances of exiting the 
 SendThread without intimating the users. Say if client throws OOMError and 
 entered into the throwable block. Here again while sending the Disconnected 
 event, its creating new WatchedEvent() object.This will throw OOMError and 
 leads to exit the SendThread without any Disconnected event notification.
 {noformat}
 try{
 //...
 } catch (Throwable e)
 {
 //..
 cleanup();
if(state.isAlive()){
 eventThread.queueEvent(
 new WatchedEvent(Event.EventType.None, 
 Event.KeeperState.Disconnected, null) )
}
//
 }
 {noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Created] (ZOOKEEPER-1757) Adler32 may not be sufficient to protect aginst data corruption

2013-09-12 Thread Thawan Kooburat (JIRA)
Thawan Kooburat created ZOOKEEPER-1757:
--

 Summary: Adler32 may not be sufficient to protect aginst data 
corruption
 Key: ZOOKEEPER-1757
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1757
 Project: ZooKeeper
  Issue Type: Bug
  Components: server
 Environment: Linux.  Oracle JDK6/7
Reporter: Thawan Kooburat


I was investigating data inconsistency bug in our internal branch. One possible 
area is snapshot/txnlog corruption. So I wrote a more robust corruption test 
and found that it is easy to break our checksum algorithm which is Adler32.

When this happen, it is more likely that corrupted data will fail other sanity 
check during deserialization phase, but it is still scary that it can pass the 
checksum.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (ZOOKEEPER-1757) Adler32 may not be sufficient to protect aginst data corruption

2013-09-12 Thread Thawan Kooburat (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thawan Kooburat updated ZOOKEEPER-1757:
---

Attachment: TEST-org.apache.zookeeper.server.persistence.FileSnapTest.txt
ZOOKEEPER.1757.patch

Attached a test case that show the example of corrupted snapshot passed Adler32 
check. 

I hard coded a seek value trigger this condition. On other environment, the 
value may have to change by using the random seek. 

In my case, I only need to do a few run to trigger a case that can bypass 
checksum.

Also attached a sample log file from this unit test run.

 Adler32 may not be sufficient to protect aginst data corruption
 ---

 Key: ZOOKEEPER-1757
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1757
 Project: ZooKeeper
  Issue Type: Bug
  Components: server
 Environment: Linux.  Oracle JDK6/7
Reporter: Thawan Kooburat
 Attachments: 
 TEST-org.apache.zookeeper.server.persistence.FileSnapTest.txt, 
 ZOOKEEPER.1757.patch


 I was investigating data inconsistency bug in our internal branch. One 
 possible area is snapshot/txnlog corruption. So I wrote a more robust 
 corruption test and found that it is easy to break our checksum algorithm 
 which is Adler32.
 When this happen, it is more likely that corrupted data will fail other 
 sanity check during deserialization phase, but it is still scary that it can 
 pass the checksum.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (ZOOKEEPER-1757) Adler32 may not be sufficient to protect aginst data corruption

2013-09-12 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1757?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13766058#comment-13766058
 ] 

Hadoop QA commented on ZOOKEEPER-1757:
--

-1 overall.  Here are the results of testing the latest attachment 
  
http://issues.apache.org/jira/secure/attachment/12602915/TEST-org.apache.zookeeper.server.persistence.FileSnapTest.txt
  against trunk revision 1522079.

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 2 new or modified tests.

-1 patch.  The patch command could not apply the patch.

Console output: 
https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/1575//console

This message is automatically generated.

 Adler32 may not be sufficient to protect aginst data corruption
 ---

 Key: ZOOKEEPER-1757
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1757
 Project: ZooKeeper
  Issue Type: Bug
  Components: server
 Environment: Linux.  Oracle JDK6/7
Reporter: Thawan Kooburat
 Attachments: 
 TEST-org.apache.zookeeper.server.persistence.FileSnapTest.txt, 
 ZOOKEEPER.1757.patch


 I was investigating data inconsistency bug in our internal branch. One 
 possible area is snapshot/txnlog corruption. So I wrote a more robust 
 corruption test and found that it is easy to break our checksum algorithm 
 which is Adler32.
 When this happen, it is more likely that corrupted data will fail other 
 sanity check during deserialization phase, but it is still scary that it can 
 pass the checksum.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


Failed: ZOOKEEPER-1757 PreCommit Build #1575

2013-09-12 Thread Apache Jenkins Server
Jira: https://issues.apache.org/jira/browse/ZOOKEEPER-1757
Build: https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/1575/

###
## LAST 60 LINES OF THE CONSOLE 
###
[...truncated 60 lines...]
 [exec] 
 [exec] 
 [exec] 
==
 [exec] 
==
 [exec] Applying patch.
 [exec] 
==
 [exec] 
==
 [exec] 
 [exec] 
 [exec] /usr/bin/patch:  Only garbage was found in the patch input.
 [exec] PATCH APPLICATION FAILED
 [exec] 
 [exec] 
 [exec] 
 [exec] 
 [exec] -1 overall.  Here are the results of testing the latest attachment 
 [exec]   
http://issues.apache.org/jira/secure/attachment/12602915/TEST-org.apache.zookeeper.server.persistence.FileSnapTest.txt
 [exec]   against trunk revision 1522079.
 [exec] 
 [exec] +1 @author.  The patch does not contain any @author tags.
 [exec] 
 [exec] +1 tests included.  The patch appears to include 2 new or 
modified tests.
 [exec] 
 [exec] -1 patch.  The patch command could not apply the patch.
 [exec] 
 [exec] Console output: 
https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/1575//console
 [exec] 
 [exec] This message is automatically generated.
 [exec] 
 [exec] 
 [exec] 
==
 [exec] 
==
 [exec] Adding comment to Jira.
 [exec] 
==
 [exec] 
==
 [exec] 
 [exec] 
 [exec] Comment added.
 [exec] 35533e76c862fe87caba08a1adfc89e9bdcf865b logged out
 [exec] 
 [exec] 
 [exec] 
==
 [exec] 
==
 [exec] Finished build.
 [exec] 
==
 [exec] 
==
 [exec] 
 [exec] 

BUILD FAILED
/home/jenkins/jenkins-slave/workspace/PreCommit-ZOOKEEPER-Build/trunk/build.xml:1623:
 exec returned: 1

Total time: 42 seconds
Build step 'Execute shell' marked build as failure
Archiving artifacts
Recording test results
Description set: ZOOKEEPER-1757
Email was triggered for: Failure
Sending email for trigger: Failure



###
## FAILED TESTS (if any) 
##
No tests ran.

[jira] [Updated] (ZOOKEEPER-1757) Adler32 may not be sufficient to protect aginst data corruption

2013-09-12 Thread Thawan Kooburat (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thawan Kooburat updated ZOOKEEPER-1757:
---

Attachment: (was: 
TEST-org.apache.zookeeper.server.persistence.FileSnapTest.txt)

 Adler32 may not be sufficient to protect aginst data corruption
 ---

 Key: ZOOKEEPER-1757
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1757
 Project: ZooKeeper
  Issue Type: Bug
  Components: server
 Environment: Linux.  Oracle JDK6/7
Reporter: Thawan Kooburat
 Attachments: ZOOKEEPER.1757.patch, ZOOKEEPER.1757.patch


 I was investigating data inconsistency bug in our internal branch. One 
 possible area is snapshot/txnlog corruption. So I wrote a more robust 
 corruption test and found that it is easy to break our checksum algorithm 
 which is Adler32.
 When this happen, it is more likely that corrupted data will fail other 
 sanity check during deserialization phase, but it is still scary that it can 
 pass the checksum.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (ZOOKEEPER-1757) Adler32 may not be sufficient to protect aginst data corruption

2013-09-12 Thread Thawan Kooburat (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thawan Kooburat updated ZOOKEEPER-1757:
---

Attachment: ZOOKEEPER.1757.patch

Re-upload the test case

 Adler32 may not be sufficient to protect aginst data corruption
 ---

 Key: ZOOKEEPER-1757
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1757
 Project: ZooKeeper
  Issue Type: Bug
  Components: server
 Environment: Linux.  Oracle JDK6/7
Reporter: Thawan Kooburat
 Attachments: ZOOKEEPER.1757.patch, ZOOKEEPER.1757.patch


 I was investigating data inconsistency bug in our internal branch. One 
 possible area is snapshot/txnlog corruption. So I wrote a more robust 
 corruption test and found that it is easy to break our checksum algorithm 
 which is Adler32.
 When this happen, it is more likely that corrupted data will fail other 
 sanity check during deserialization phase, but it is still scary that it can 
 pass the checksum.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


Re: [VOTE] Bookkeeper 4.2.2 release candidate 0

2013-09-12 Thread Ivan Kelly
Hi Jiannan,

Could you send me the logs for this. The argLine in the pom has no
affect here because that only applies to the surefire child
process. This OOM is in the maven process itself, because the test is
producing too many logs. I'll try reducing my heap to see if I can get
the same to happen.

-Ivan


On Thu, Sep 12, 2013 at 03:31:19AM +, Jiannan Wang wrote:
 +1 for this release.
 
 All test cases pass except org.apache.bookkeeper.meta.GcLedgersTest, which
 throws following exception
 --
 Exception in thread ThreadedStreamConsumer java.lang.OutOfMemoryError:
 Java heap space
   at java.util.Arrays.copyOf(Arrays.java:2882)
   at 
 java.lang.AbstractStringBuilder.expandCapacity(AbstractStringBuilder.java:1
 00)
   at 
 java.lang.AbstractStringBuilder.append(AbstractStringBuilder.java:390)
   at java.lang.StringBuffer.append(StringBuffer.java:224)
   at 
 org.apache.maven.surefire.report.ConsoleOutputFileReporter.writeMessage(Con
 soleOutputFileReporter.java:115)
   at 
 org.apache.maven.surefire.report.MulticastingReporter.writeMessage(Multicas
 tingReporter.java:101)
   at 
 org.apache.maven.surefire.report.TestSetRunListener.writeTestOutput(TestSet
 RunListener.java:99)
   at 
 org.apache.maven.plugin.surefire.booterclient.output.ForkClient.consumeLine
 (ForkClient.java:132)
   at 
 org.apache.maven.plugin.surefire.booterclient.output.ThreadedStreamConsumer
 $Pumper.run(ThreadedStreamConsumer.java:67)
   at java.lang.Thread.run(Thread.java:680)
 --
 
 I need to execute 'export MAVEN_OPTS=-Xmx1G' before running the test.
 (So the argLine-Xmx1G -Djava.net.preferIPv4Stack=true/argLine in
 pom.xml takes no effect? My maven version is Apache Maven 3.0.3 (r1075438;
 2011-03-01 01:31:09+0800))
 
 It's not a problem for the release, but I think maybe we can reduce
 annoying zk logs for this test to get around this problem.
 
 
 Best,
 Jiannan
 
 
 
 
 On 9/11/13 6:52 AM, Ivan Kelly iv...@apache.org wrote:
 
 This is the first release candidate for Apache Bookkeeper, version
 4.2.2.
 
 This is a bugfix release for 4.2.1. There are some minor API
 improvements. Notably, it is now possible to check whether a ledger is
 closed without opening it, and it is now possible to get a list of
 ledgers available in the cluster.
 
 The full release notes is available at:
 
 https://issues.apache.org/jira/secure/ReleaseNote.jspa?version=12324601st
 yleName=TextprojectId=12311293
 
 *** Please download, test and vote by September 16th 2013, 10:00 UTC+0.
 ***
 
 Note that we are voting upon the source (tag), binaries are provided for
 convenience.
 
 Source and binary files:
 http://people.apache.org/~ivank/bookkeeper-4.2.2-candidate-0/
 
 Maven staging repo:
 https://repository.apache.org/content/repositories/orgapachebookkeeper-025
 /
 
 The tag to be voted upon:
 https://svn.apache.org/repos/asf/zookeeper/bookkeeper/tags/release-4.2.2
 
 Bookkeeper's KEYS file containing PGP keys we use to sign the release:
 http://svn.apache.org/repos/asf/zookeeper/bookkeeper/dist/KEYS
 
 Please download the the source package, and follow the README to build
 and run a bookkeeper and hedwig service.
 
 -Ivan
 


Re: [VOTE] Bookkeeper 4.2.2 release candidate 0

2013-09-12 Thread Ivan Kelly
I managed to repro with 
 MAVEN_OPTS=-Xmx128M mvn test -Dtest=GcLedgersTest

The test in question creates 3 ledgers, using asyncCreateLedgers,
which causes a lot of zk INFO level log messages.

This isn't an issue worth blocking the release for.

-Ivan

On Thu, Sep 12, 2013 at 09:42:49AM +, Ivan Kelly wrote:
 Hi Jiannan,
 
 Could you send me the logs for this. The argLine in the pom has no
 affect here because that only applies to the surefire child
 process. This OOM is in the maven process itself, because the test is
 producing too many logs. I'll try reducing my heap to see if I can get
 the same to happen.
 
 -Ivan
 
 
 On Thu, Sep 12, 2013 at 03:31:19AM +, Jiannan Wang wrote:
  +1 for this release.
  
  All test cases pass except org.apache.bookkeeper.meta.GcLedgersTest, which
  throws following exception
  --
  Exception in thread ThreadedStreamConsumer java.lang.OutOfMemoryError:
  Java heap space
  at java.util.Arrays.copyOf(Arrays.java:2882)
  at 
  java.lang.AbstractStringBuilder.expandCapacity(AbstractStringBuilder.java:1
  00)
  at 
  java.lang.AbstractStringBuilder.append(AbstractStringBuilder.java:390)
  at java.lang.StringBuffer.append(StringBuffer.java:224)
  at 
  org.apache.maven.surefire.report.ConsoleOutputFileReporter.writeMessage(Con
  soleOutputFileReporter.java:115)
  at 
  org.apache.maven.surefire.report.MulticastingReporter.writeMessage(Multicas
  tingReporter.java:101)
  at 
  org.apache.maven.surefire.report.TestSetRunListener.writeTestOutput(TestSet
  RunListener.java:99)
  at 
  org.apache.maven.plugin.surefire.booterclient.output.ForkClient.consumeLine
  (ForkClient.java:132)
  at 
  org.apache.maven.plugin.surefire.booterclient.output.ThreadedStreamConsumer
  $Pumper.run(ThreadedStreamConsumer.java:67)
  at java.lang.Thread.run(Thread.java:680)
  --
  
  I need to execute 'export MAVEN_OPTS=-Xmx1G' before running the test.
  (So the argLine-Xmx1G -Djava.net.preferIPv4Stack=true/argLine in
  pom.xml takes no effect? My maven version is Apache Maven 3.0.3 (r1075438;
  2011-03-01 01:31:09+0800))
  
  It's not a problem for the release, but I think maybe we can reduce
  annoying zk logs for this test to get around this problem.
  
  
  Best,
  Jiannan
  
  
  
  
  On 9/11/13 6:52 AM, Ivan Kelly iv...@apache.org wrote:
  
  This is the first release candidate for Apache Bookkeeper, version
  4.2.2.
  
  This is a bugfix release for 4.2.1. There are some minor API
  improvements. Notably, it is now possible to check whether a ledger is
  closed without opening it, and it is now possible to get a list of
  ledgers available in the cluster.
  
  The full release notes is available at:
  
  https://issues.apache.org/jira/secure/ReleaseNote.jspa?version=12324601st
  yleName=TextprojectId=12311293
  
  *** Please download, test and vote by September 16th 2013, 10:00 UTC+0.
  ***
  
  Note that we are voting upon the source (tag), binaries are provided for
  convenience.
  
  Source and binary files:
  http://people.apache.org/~ivank/bookkeeper-4.2.2-candidate-0/
  
  Maven staging repo:
  https://repository.apache.org/content/repositories/orgapachebookkeeper-025
  /
  
  The tag to be voted upon:
  https://svn.apache.org/repos/asf/zookeeper/bookkeeper/tags/release-4.2.2
  
  Bookkeeper's KEYS file containing PGP keys we use to sign the release:
  http://svn.apache.org/repos/asf/zookeeper/bookkeeper/dist/KEYS
  
  Please download the the source package, and follow the README to build
  and run a bookkeeper and hedwig service.
  
  -Ivan
  


Re: [VOTE] Bookkeeper 4.2.2 release candidate 0

2013-09-12 Thread Jiannan Wang
Hi Ivan,
   The log has been sent to your personal yahoo mail, please check it.
   Your explanation makes sense, thanks,

Best,
Jiannan

On 9/12/13 5:42 PM, Ivan Kelly iv...@apache.org wrote:

Hi Jiannan,

Could you send me the logs for this. The argLine in the pom has no
affect here because that only applies to the surefire child
process. This OOM is in the maven process itself, because the test is
producing too many logs. I'll try reducing my heap to see if I can get
the same to happen.

-Ivan


On Thu, Sep 12, 2013 at 03:31:19AM +, Jiannan Wang wrote:
 +1 for this release.
 
 All test cases pass except org.apache.bookkeeper.meta.GcLedgersTest,
which
 throws following exception
 --
 Exception in thread ThreadedStreamConsumer java.lang.OutOfMemoryError:
 Java heap space
  at java.util.Arrays.copyOf(Arrays.java:2882)
  at 
 
java.lang.AbstractStringBuilder.expandCapacity(AbstractStringBuilder.java
:1
 00)
  at 
java.lang.AbstractStringBuilder.append(AbstractStringBuilder.java:390)
  at java.lang.StringBuffer.append(StringBuffer.java:224)
  at 
 
org.apache.maven.surefire.report.ConsoleOutputFileReporter.writeMessage(C
on
 soleOutputFileReporter.java:115)
  at 
 
org.apache.maven.surefire.report.MulticastingReporter.writeMessage(Multic
as
 tingReporter.java:101)
  at 
 
org.apache.maven.surefire.report.TestSetRunListener.writeTestOutput(TestS
et
 RunListener.java:99)
  at 
 
org.apache.maven.plugin.surefire.booterclient.output.ForkClient.consumeLi
ne
 (ForkClient.java:132)
  at 
 
org.apache.maven.plugin.surefire.booterclient.output.ThreadedStreamConsum
er
 $Pumper.run(ThreadedStreamConsumer.java:67)
  at java.lang.Thread.run(Thread.java:680)
 --
 
 I need to execute 'export MAVEN_OPTS=-Xmx1G' before running the test.
 (So the argLine-Xmx1G -Djava.net.preferIPv4Stack=true/argLine in
 pom.xml takes no effect? My maven version is Apache Maven 3.0.3
(r1075438;
 2011-03-01 01:31:09+0800))
 
 It's not a problem for the release, but I think maybe we can reduce
 annoying zk logs for this test to get around this problem.
 
 
 Best,
 Jiannan
 
 
 
 
 On 9/11/13 6:52 AM, Ivan Kelly iv...@apache.org wrote:
 
 This is the first release candidate for Apache Bookkeeper, version
 4.2.2.
 
 This is a bugfix release for 4.2.1. There are some minor API
 improvements. Notably, it is now possible to check whether a ledger is
 closed without opening it, and it is now possible to get a list of
 ledgers available in the cluster.
 
 The full release notes is available at:
 
 
https://issues.apache.org/jira/secure/ReleaseNote.jspa?version=12324601;
st
 yleName=TextprojectId=12311293
 
 *** Please download, test and vote by September 16th 2013, 10:00 UTC+0.
 ***
 
 Note that we are voting upon the source (tag), binaries are provided
for
 convenience.
 
 Source and binary files:
 http://people.apache.org/~ivank/bookkeeper-4.2.2-candidate-0/
 
 Maven staging repo:
 
https://repository.apache.org/content/repositories/orgapachebookkeeper-0
25
 /
 
 The tag to be voted upon:
 
https://svn.apache.org/repos/asf/zookeeper/bookkeeper/tags/release-4.2.2
 
 Bookkeeper's KEYS file containing PGP keys we use to sign the release:
 http://svn.apache.org/repos/asf/zookeeper/bookkeeper/dist/KEYS
 
 Please download the the source package, and follow the README to build
 and run a bookkeeper and hedwig service.
 
 -Ivan