Mahadev konar commented on ZOOKEEPER-663:

This looks like a quorum peer was creting a new txn log file and was shutdown 
in the middle of that. This probably led to corruption of txnlogs in the data 
directory of one of the quorumpeers. We actually do not have a good story with 
the corruption with of the transaction logs. Currently we depend on admins 
manually going to the node and making decisions on how to resolve this.

As a part of this jira we can add documentation in the forrest docs for now, on 
how to deal with such situations. Also, the logging needs to change to point 
which file was corrupted.

> hudson failure in ZKDatabaseCorruptionTest
> ------------------------------------------
>                 Key: ZOOKEEPER-663
>                 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-663
>             Project: Zookeeper
>          Issue Type: Bug
>          Components: server
>            Reporter: Patrick Hunt
>            Assignee: Mahadev konar
>            Priority: Critical
>             Fix For: 3.3.0
> http://hudson.zones.apache.org/hudson/job/ZooKeeper-trunk/686/
> java.lang.RuntimeException: Unable to run quorum server 
>       at 
> org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:380)
>       at 
> org.apache.zookeeper.test.ZkDatabaseCorruptionTest.testCorruption(ZkDatabaseCorruptionTest.java:99)
> Caused by: java.io.IOException: Invalid magic number 0 != 1514884167
>       at 
> org.apache.zookeeper.server.persistence.FileTxnLog$FileTxnIterator.inStreamCreated(FileTxnLog.java:455)
>       at 
> org.apache.zookeeper.server.persistence.FileTxnLog$FileTxnIterator.createInputArchive(FileTxnLog.java:471)
>       at 
> org.apache.zookeeper.server.persistence.FileTxnLog$FileTxnIterator.goToNextLog(FileTxnLog.java:438)
>       at 
> org.apache.zookeeper.server.persistence.FileTxnLog$FileTxnIterator.next(FileTxnLog.java:519)
>       at 
> org.apache.zookeeper.server.persistence.FileTxnSnapLog.restore(FileTxnSnapLog.java:145)
>       at 
> org.apache.zookeeper.server.ZKDatabase.loadDataBase(ZKDatabase.java:193)
>       at 
> org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:377)

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

Reply via email to