[jira] Updated: (ZOOKEEPER-773) Log visualisation

2010-06-02 Thread Ivan Kelly (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-773?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ivan Kelly updated ZOOKEEPER-773:
-

Attachment: ZOOKEEPER-773_2.diff

Renamed to loggraph. Works for chrome (chrome doesn't like for each). Minor 
changes for comments.

 Log visualisation
 -

 Key: ZOOKEEPER-773
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-773
 Project: Zookeeper
  Issue Type: Improvement
  Components: contrib
Reporter: Ivan Kelly
Assignee: Ivan Kelly
Priority: Minor
 Fix For: 3.4.0

 Attachments: ZOOKEEPER-773.diff, ZOOKEEPER-773_2.diff


 Zkgraph is a log viewer for zookeeper. It can handle transaction logs and 
 message logs. There are currently two view.
  
   a) Server view
  The server view shows the interactions between the different servers in 
 an ensemble. The X axis represents time. 
 * Exceptions show up as red dots. Hovering your mouse over them will 
 give you more details of the exception
   * The colour of the line represents the election state of the server. 
  - orange means LOOKING for leader
  - dark green means the server is the leader
  - light green means the server is following a leader
  - yellow means there isn't enough information to determine the state 
 of the server. 
   * The gray arrows denote election messages between servers. Pink dashed 
 arrows are messages that were sent but never delivered.
b) Session view
   The session view shows the lifetime of sessions on a server. Use the 
 time filter to narrow down the view. Any more than about 2000 events will 
 take a long time to view in your browser. 
   The Y axis represents time in this case. Each line is a session. The 
 black dots represent events on the session. You can click on the black dots 
 for more details of the event.
 2 - Compiling  Running
 Run ant jar in src/contrib/zkgraph/. This will download all dependencies 
 and compile all the zkgraph code.
 Once compilation has finished, you can run it the the zkgraph.sh script in 
 src/contrib/zkgraph/bin. This will start and embedded web server on you 
 machine. Navigate to http://localhost:8182/graph/main.html.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Assigned: (ZOOKEEPER-704) GSoC 2010: Read-Only Mode

2010-06-02 Thread Henry Robinson (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Henry Robinson reassigned ZOOKEEPER-704:


Assignee: Sergey Doroshenko

 GSoC 2010: Read-Only Mode
 -

 Key: ZOOKEEPER-704
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-704
 Project: Zookeeper
  Issue Type: Wish
Reporter: Henry Robinson
Assignee: Sergey Doroshenko

 Read-only mode
 Possible Mentor
 Henry Robinson (henry at apache dot org)
 Requirements
 Java and TCP/IP networking
 Description
 When a ZooKeeper server loses contact with over half of the other servers in 
 an ensemble ('loses a quorum'), it stops responding to client requests 
 because it cannot guarantee that writes will get processed correctly. For 
 some applications, it would be beneficial if a server still responded to read 
 requests when the quorum is lost, but caused an error condition when a write 
 request was attempted.
 This project would implement a 'read-only' mode for ZooKeeper servers (maybe 
 only for Observers) that allowed read requests to be served as long as the 
 client can contact a server.
 This is a great project for getting really hands-on with the internals of 
 ZooKeeper - you must be comfortable with Java and networking otherwise you'll 
 have a hard time coming up to speed.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (ZOOKEEPER-733) use netty to handle client connections

2010-06-02 Thread Benjamin Reed (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Benjamin Reed updated ZOOKEEPER-733:


Attachment: flowctl.zip

here is my cut at flowctl with netty. flow control seems to be happening, but 
it doesn't seem to fix the problem.

 use netty to handle client connections
 --

 Key: ZOOKEEPER-733
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-733
 Project: Zookeeper
  Issue Type: Improvement
Reporter: Benjamin Reed
 Attachments: accessive.jar, flowctl.zip, moved.zip, 
 QuorumTestFailed_sessionmoved_TRACE_LOG.txt.gz, ZOOKEEPER-733.patch, 
 ZOOKEEPER-733.patch, ZOOKEEPER-733.patch


 we currently have our own asynchronous NIO socket engine to be able to handle 
 lots of clients with a single thread. over time the engine has become more 
 complicated. we would also like the engine to use multiple threads on 
 machines with lots of cores. plus, we would like to be able to support things 
 like SSL. if we switch to netty, we can simplify our code and get the 
 previously mentioned benefits.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (ZOOKEEPER-335) zookeeper servers should commit the new leader txn to their logs.

2010-06-02 Thread Patrick Hunt (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-335?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Patrick Hunt updated ZOOKEEPER-335:
---

Priority: Blocker  (was: Major)

Raising to blocker level - we've seen this reported by users a couple times now.

 zookeeper servers should commit the new leader txn to their logs.
 -

 Key: ZOOKEEPER-335
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-335
 Project: Zookeeper
  Issue Type: Bug
  Components: server
Affects Versions: 3.1.0
Reporter: Mahadev konar
Assignee: Mahadev konar
Priority: Blocker
 Fix For: 3.4.0


 currently the zookeeper followers do not commit the new leader election. This 
 will cause problems in a failure scenarios with a follower acking to the same 
 leader txn id twice, which might be two different intermittent leaders and 
 allowing them to propose two different txn's of the same zxid.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (ZOOKEEPER-784) server-side functionality for read-only mode

2010-06-02 Thread Sergey Doroshenko (JIRA)
server-side functionality for read-only mode


 Key: ZOOKEEPER-784
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-784
 Project: Zookeeper
  Issue Type: Sub-task
Reporter: Sergey Doroshenko
Assignee: Sergey Doroshenko


As per http://wiki.apache.org/hadoop/ZooKeeper/GSoCReadOnlyMode , create 
ReadOnlyZooKeeperServer which comes into play when peer is partitioned.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (ZOOKEEPER-784) server-side functionality for read-only mode

2010-06-02 Thread Sergey Doroshenko (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-784?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sergey Doroshenko updated ZOOKEEPER-784:


Attachment: ZOOKEEPER-784.patch

Patch includes ReadOnlyZooKeeperServer and ReadOnlyReqestProcessor.
This patch outlines the general idea: read-only server is spawned when peer 
becomes partitioned, and is able to serve read-only requests but drops 
state-changing ones.

It's by no means a final version, however it's already possible to connect to a 
partitioned peer (or remain connected to it when it loses a quorum) and make 
read requests (write requests are dropped with explanation message).

Next things to work on are:
* session handling during partition. For now ReadOnlyServer creates 
SessionTrackerImpl and it's ok for manual tests, but that's just to make this 
proof-of-concept patch work
* distinguish read-only clients and usual ones (depends on yet non-existent 
ticket about creation of read-only client), accept only read-onlies
* create test cases, make all current ones pass


 server-side functionality for read-only mode
 

 Key: ZOOKEEPER-784
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-784
 Project: Zookeeper
  Issue Type: Sub-task
Reporter: Sergey Doroshenko
Assignee: Sergey Doroshenko
 Attachments: ZOOKEEPER-784.patch


 As per http://wiki.apache.org/hadoop/ZooKeeper/GSoCReadOnlyMode , create 
 ReadOnlyZooKeeperServer which comes into play when peer is partitioned.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (ZOOKEEPER-785) Zookeeper 3.3.1 shouldn't infinite loop if someone creates a server.0 line

2010-06-02 Thread Alex Newman (JIRA)
 Zookeeper 3.3.1 shouldn't infinite loop if someone creates a server.0 line
---

 Key: ZOOKEEPER-785
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-785
 Project: Zookeeper
  Issue Type: Bug
Affects Versions: 3.3.1
 Environment: Tested in linux with a new jvm
Reporter: Alex Newman


The following config causes an infinite loop

[zoo.cfg]
tickTime=2000
dataDir=/var/zookeeper/
clientPort=2181
initLimit=10
syncLimit=5
server.0=localhost:2888:3888

Output:

2010-06-01 16:20:32,471 - INFO [main:quorumpeerm...@119] - Starting quorum peer
2010-06-01 16:20:32,489 - INFO [main:nioservercnxn$fact...@143] - binding to 
port 0.0.0.0/0.0.0.0:2181
2010-06-01 16:20:32,504 - INFO [main:quorump...@818] - tickTime set to 2000
2010-06-01 16:20:32,504 - INFO [main:quorump...@829] - minSessionTimeout set to 
-1
2010-06-01 16:20:32,505 - INFO [main:quorump...@840] - maxSessionTimeout set to 
-1
2010-06-01 16:20:32,505 - INFO [main:quorump...@855] - initLimit set to 10
2010-06-01 16:20:32,526 - INFO [main:files...@82] - Reading snapshot 
/var/zookeeper/version-2/snapshot.c
2010-06-01 16:20:32,547 - INFO [Thread-1:quorumcnxmanager$liste...@436] - My 
election bind port: 3888
2010-06-01 16:20:32,554 - INFO 
[QuorumPeer:/0:0:0:0:0:0:0:0:2181:quorump...@620] - LOOKING
2010-06-01 16:20:32,556 - INFO 
[QuorumPeer:/0:0:0:0:0:0:0:0:2181:fastleaderelect...@649] - New election. My id 
= 0, Proposed zxid = 12
2010-06-01 16:20:32,558 - INFO 
[QuorumPeer:/0:0:0:0:0:0:0:0:2181:fastleaderelect...@689] - Notification: 0, 
12, 1, 0, LOOKING, LOOKING, 0
2010-06-01 16:20:32,560 - WARN 
[QuorumPeer:/0:0:0:0:0:0:0:0:2181:quorump...@623] - Unexpected exception
java.lang.NullPointerException
at 
org.apache.zookeeper.server.quorum.FastLeaderElection.totalOrderPredicate(FastLeaderElection.java:496)
at 
org.apache.zookeeper.server.quorum.FastLeaderElection.lookForLeader(FastLeaderElection.java:709)
at org.apache.zookeeper.server.quorum.QuorumPeer.run(QuorumPeer.java:621)
2010-06-01 16:20:32,560 - INFO 
[QuorumPeer:/0:0:0:0:0:0:0:0:2181:quorump...@620] - LOOKING
2010-06-01 16:20:32,560 - INFO 
[QuorumPeer:/0:0:0:0:0:0:0:0:2181:fastleaderelect...@649] - New election. My id 
= 0, Proposed zxid = 12
2010-06-01 16:20:32,561 - INFO 
[QuorumPeer:/0:0:0:0:0:0:0:0:2181:fastleaderelect...@689] - Notification: 0, 
12, 2, 0, LOOKING, LOOKING, 0
2010-06-01 16:20:32,561 - WARN 
[QuorumPeer:/0:0:0:0:0:0:0:0:2181:quorump...@623] - Unexpected exception
java.lang.NullPointerException
at 
org.apache.zookeeper.server.quorum.FastLeaderElection.totalOrderPredicate(FastLeaderElection.java:496)
at 
org.apache.zookeeper.server.quorum.FastLeaderElection.lookForLeader(FastLeaderElection.java:709)
at org.apache.zookeeper.server.quorum.QuorumPeer.run(QuorumPeer.java:621)
2010-06-01 16:20:32,561 - INFO 
[QuorumPeer:/0:0:0:0:0:0:0:0:2181:quorump...@620] - LOOKING
2010-06-01 16:20:32,562 - INFO 
[QuorumPeer:/0:0:0:0:0:0:0:0:2181:fastleaderelect...@649] - New election. My id 
= 0, Proposed zxid = 12
2010-06-01 16:20:32,562 - INFO 
[QuorumPeer:/0:0:0:0:0:0:0:0:2181:fastleaderelect...@689] - Notification: 0, 
12, 3, 0, LOOKING, LOOKING, 0
2010-06-01 16:20:32,562 - WARN 
[QuorumPeer:/0:0:0:0:0:0:0:0:2181:quorump...@623] - Unexpected exception
java.lang.NullPointerException


Things like HBase require that the zookeeper servers be listed in the zoo.cfg. 
This is a bug on their part, but zookeeper shouldn't null pointer in a loop 
though.


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Assigned: (ZOOKEEPER-785) Zookeeper 3.3.1 shouldn't infinite loop if someone creates a server.0 line

2010-06-02 Thread Patrick Hunt (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Patrick Hunt reassigned ZOOKEEPER-785:
--

Assignee: Patrick Hunt

  Zookeeper 3.3.1 shouldn't infinite loop if someone creates a server.0 line
 ---

 Key: ZOOKEEPER-785
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-785
 Project: Zookeeper
  Issue Type: Bug
Affects Versions: 3.3.1
 Environment: Tested in linux with a new jvm
Reporter: Alex Newman
Assignee: Patrick Hunt

 The following config causes an infinite loop
 [zoo.cfg]
 tickTime=2000
 dataDir=/var/zookeeper/
 clientPort=2181
 initLimit=10
 syncLimit=5
 server.0=localhost:2888:3888
 Output:
 2010-06-01 16:20:32,471 - INFO [main:quorumpeerm...@119] - Starting quorum 
 peer
 2010-06-01 16:20:32,489 - INFO [main:nioservercnxn$fact...@143] - binding to 
 port 0.0.0.0/0.0.0.0:2181
 2010-06-01 16:20:32,504 - INFO [main:quorump...@818] - tickTime set to 2000
 2010-06-01 16:20:32,504 - INFO [main:quorump...@829] - minSessionTimeout set 
 to -1
 2010-06-01 16:20:32,505 - INFO [main:quorump...@840] - maxSessionTimeout set 
 to -1
 2010-06-01 16:20:32,505 - INFO [main:quorump...@855] - initLimit set to 10
 2010-06-01 16:20:32,526 - INFO [main:files...@82] - Reading snapshot 
 /var/zookeeper/version-2/snapshot.c
 2010-06-01 16:20:32,547 - INFO [Thread-1:quorumcnxmanager$liste...@436] - My 
 election bind port: 3888
 2010-06-01 16:20:32,554 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:quorump...@620] - LOOKING
 2010-06-01 16:20:32,556 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:fastleaderelect...@649] - New election. My 
 id = 0, Proposed zxid = 12
 2010-06-01 16:20:32,558 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:fastleaderelect...@689] - Notification: 0, 
 12, 1, 0, LOOKING, LOOKING, 0
 2010-06-01 16:20:32,560 - WARN 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:quorump...@623] - Unexpected exception
 java.lang.NullPointerException
 at 
 org.apache.zookeeper.server.quorum.FastLeaderElection.totalOrderPredicate(FastLeaderElection.java:496)
 at 
 org.apache.zookeeper.server.quorum.FastLeaderElection.lookForLeader(FastLeaderElection.java:709)
 at org.apache.zookeeper.server.quorum.QuorumPeer.run(QuorumPeer.java:621)
 2010-06-01 16:20:32,560 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:quorump...@620] - LOOKING
 2010-06-01 16:20:32,560 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:fastleaderelect...@649] - New election. My 
 id = 0, Proposed zxid = 12
 2010-06-01 16:20:32,561 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:fastleaderelect...@689] - Notification: 0, 
 12, 2, 0, LOOKING, LOOKING, 0
 2010-06-01 16:20:32,561 - WARN 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:quorump...@623] - Unexpected exception
 java.lang.NullPointerException
 at 
 org.apache.zookeeper.server.quorum.FastLeaderElection.totalOrderPredicate(FastLeaderElection.java:496)
 at 
 org.apache.zookeeper.server.quorum.FastLeaderElection.lookForLeader(FastLeaderElection.java:709)
 at org.apache.zookeeper.server.quorum.QuorumPeer.run(QuorumPeer.java:621)
 2010-06-01 16:20:32,561 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:quorump...@620] - LOOKING
 2010-06-01 16:20:32,562 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:fastleaderelect...@649] - New election. My 
 id = 0, Proposed zxid = 12
 2010-06-01 16:20:32,562 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:fastleaderelect...@689] - Notification: 0, 
 12, 3, 0, LOOKING, LOOKING, 0
 2010-06-01 16:20:32,562 - WARN 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:quorump...@623] - Unexpected exception
 java.lang.NullPointerException
 Things like HBase require that the zookeeper servers be listed in the 
 zoo.cfg. This is a bug on their part, but zookeeper shouldn't null pointer in 
 a loop though.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (ZOOKEEPER-785) Zookeeper 3.3.1 shouldn't infinite loop if someone creates a server.0 line

2010-06-02 Thread Patrick Hunt (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Patrick Hunt updated ZOOKEEPER-785:
---

Fix Version/s: 3.3.2
   3.4.0
  Component/s: server

  Zookeeper 3.3.1 shouldn't infinite loop if someone creates a server.0 line
 ---

 Key: ZOOKEEPER-785
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-785
 Project: Zookeeper
  Issue Type: Bug
  Components: server
Affects Versions: 3.3.1
 Environment: Tested in linux with a new jvm
Reporter: Alex Newman
Assignee: Patrick Hunt
 Fix For: 3.3.2, 3.4.0


 The following config causes an infinite loop
 [zoo.cfg]
 tickTime=2000
 dataDir=/var/zookeeper/
 clientPort=2181
 initLimit=10
 syncLimit=5
 server.0=localhost:2888:3888
 Output:
 2010-06-01 16:20:32,471 - INFO [main:quorumpeerm...@119] - Starting quorum 
 peer
 2010-06-01 16:20:32,489 - INFO [main:nioservercnxn$fact...@143] - binding to 
 port 0.0.0.0/0.0.0.0:2181
 2010-06-01 16:20:32,504 - INFO [main:quorump...@818] - tickTime set to 2000
 2010-06-01 16:20:32,504 - INFO [main:quorump...@829] - minSessionTimeout set 
 to -1
 2010-06-01 16:20:32,505 - INFO [main:quorump...@840] - maxSessionTimeout set 
 to -1
 2010-06-01 16:20:32,505 - INFO [main:quorump...@855] - initLimit set to 10
 2010-06-01 16:20:32,526 - INFO [main:files...@82] - Reading snapshot 
 /var/zookeeper/version-2/snapshot.c
 2010-06-01 16:20:32,547 - INFO [Thread-1:quorumcnxmanager$liste...@436] - My 
 election bind port: 3888
 2010-06-01 16:20:32,554 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:quorump...@620] - LOOKING
 2010-06-01 16:20:32,556 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:fastleaderelect...@649] - New election. My 
 id = 0, Proposed zxid = 12
 2010-06-01 16:20:32,558 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:fastleaderelect...@689] - Notification: 0, 
 12, 1, 0, LOOKING, LOOKING, 0
 2010-06-01 16:20:32,560 - WARN 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:quorump...@623] - Unexpected exception
 java.lang.NullPointerException
 at 
 org.apache.zookeeper.server.quorum.FastLeaderElection.totalOrderPredicate(FastLeaderElection.java:496)
 at 
 org.apache.zookeeper.server.quorum.FastLeaderElection.lookForLeader(FastLeaderElection.java:709)
 at org.apache.zookeeper.server.quorum.QuorumPeer.run(QuorumPeer.java:621)
 2010-06-01 16:20:32,560 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:quorump...@620] - LOOKING
 2010-06-01 16:20:32,560 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:fastleaderelect...@649] - New election. My 
 id = 0, Proposed zxid = 12
 2010-06-01 16:20:32,561 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:fastleaderelect...@689] - Notification: 0, 
 12, 2, 0, LOOKING, LOOKING, 0
 2010-06-01 16:20:32,561 - WARN 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:quorump...@623] - Unexpected exception
 java.lang.NullPointerException
 at 
 org.apache.zookeeper.server.quorum.FastLeaderElection.totalOrderPredicate(FastLeaderElection.java:496)
 at 
 org.apache.zookeeper.server.quorum.FastLeaderElection.lookForLeader(FastLeaderElection.java:709)
 at org.apache.zookeeper.server.quorum.QuorumPeer.run(QuorumPeer.java:621)
 2010-06-01 16:20:32,561 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:quorump...@620] - LOOKING
 2010-06-01 16:20:32,562 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:fastleaderelect...@649] - New election. My 
 id = 0, Proposed zxid = 12
 2010-06-01 16:20:32,562 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:fastleaderelect...@689] - Notification: 0, 
 12, 3, 0, LOOKING, LOOKING, 0
 2010-06-01 16:20:32,562 - WARN 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:quorump...@623] - Unexpected exception
 java.lang.NullPointerException
 Things like HBase require that the zookeeper servers be listed in the 
 zoo.cfg. This is a bug on their part, but zookeeper shouldn't null pointer in 
 a loop though.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (ZOOKEEPER-785) Zookeeper 3.3.1 shouldn't infinite loop if someone creates a server.0 line

2010-06-02 Thread Patrick Hunt (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Patrick Hunt updated ZOOKEEPER-785:
---

Attachment: ZOOKEEPER-785.patch

this patch logs and error if only a single server is specified and starts in 
standalone mode.

This is similar to what happened pre-3.3 (we ignored the server line but didn't 
log it). In 3.3 having a single server line would cause the server to fail 
(infinite loop if server.0 for example)

  Zookeeper 3.3.1 shouldn't infinite loop if someone creates a server.0 line
 ---

 Key: ZOOKEEPER-785
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-785
 Project: Zookeeper
  Issue Type: Bug
  Components: server
Affects Versions: 3.3.1
 Environment: Tested in linux with a new jvm
Reporter: Alex Newman
Assignee: Patrick Hunt
 Fix For: 3.3.2, 3.4.0

 Attachments: ZOOKEEPER-785.patch


 The following config causes an infinite loop
 [zoo.cfg]
 tickTime=2000
 dataDir=/var/zookeeper/
 clientPort=2181
 initLimit=10
 syncLimit=5
 server.0=localhost:2888:3888
 Output:
 2010-06-01 16:20:32,471 - INFO [main:quorumpeerm...@119] - Starting quorum 
 peer
 2010-06-01 16:20:32,489 - INFO [main:nioservercnxn$fact...@143] - binding to 
 port 0.0.0.0/0.0.0.0:2181
 2010-06-01 16:20:32,504 - INFO [main:quorump...@818] - tickTime set to 2000
 2010-06-01 16:20:32,504 - INFO [main:quorump...@829] - minSessionTimeout set 
 to -1
 2010-06-01 16:20:32,505 - INFO [main:quorump...@840] - maxSessionTimeout set 
 to -1
 2010-06-01 16:20:32,505 - INFO [main:quorump...@855] - initLimit set to 10
 2010-06-01 16:20:32,526 - INFO [main:files...@82] - Reading snapshot 
 /var/zookeeper/version-2/snapshot.c
 2010-06-01 16:20:32,547 - INFO [Thread-1:quorumcnxmanager$liste...@436] - My 
 election bind port: 3888
 2010-06-01 16:20:32,554 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:quorump...@620] - LOOKING
 2010-06-01 16:20:32,556 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:fastleaderelect...@649] - New election. My 
 id = 0, Proposed zxid = 12
 2010-06-01 16:20:32,558 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:fastleaderelect...@689] - Notification: 0, 
 12, 1, 0, LOOKING, LOOKING, 0
 2010-06-01 16:20:32,560 - WARN 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:quorump...@623] - Unexpected exception
 java.lang.NullPointerException
 at 
 org.apache.zookeeper.server.quorum.FastLeaderElection.totalOrderPredicate(FastLeaderElection.java:496)
 at 
 org.apache.zookeeper.server.quorum.FastLeaderElection.lookForLeader(FastLeaderElection.java:709)
 at org.apache.zookeeper.server.quorum.QuorumPeer.run(QuorumPeer.java:621)
 2010-06-01 16:20:32,560 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:quorump...@620] - LOOKING
 2010-06-01 16:20:32,560 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:fastleaderelect...@649] - New election. My 
 id = 0, Proposed zxid = 12
 2010-06-01 16:20:32,561 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:fastleaderelect...@689] - Notification: 0, 
 12, 2, 0, LOOKING, LOOKING, 0
 2010-06-01 16:20:32,561 - WARN 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:quorump...@623] - Unexpected exception
 java.lang.NullPointerException
 at 
 org.apache.zookeeper.server.quorum.FastLeaderElection.totalOrderPredicate(FastLeaderElection.java:496)
 at 
 org.apache.zookeeper.server.quorum.FastLeaderElection.lookForLeader(FastLeaderElection.java:709)
 at org.apache.zookeeper.server.quorum.QuorumPeer.run(QuorumPeer.java:621)
 2010-06-01 16:20:32,561 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:quorump...@620] - LOOKING
 2010-06-01 16:20:32,562 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:fastleaderelect...@649] - New election. My 
 id = 0, Proposed zxid = 12
 2010-06-01 16:20:32,562 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:fastleaderelect...@689] - Notification: 0, 
 12, 3, 0, LOOKING, LOOKING, 0
 2010-06-01 16:20:32,562 - WARN 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:quorump...@623] - Unexpected exception
 java.lang.NullPointerException
 Things like HBase require that the zookeeper servers be listed in the 
 zoo.cfg. This is a bug on their part, but zookeeper shouldn't null pointer in 
 a loop though.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (ZOOKEEPER-785) Zookeeper 3.3.1 shouldn't infinite loop if someone creates a server.0 line

2010-06-02 Thread Patrick Hunt (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Patrick Hunt updated ZOOKEEPER-785:
---

Status: Patch Available  (was: Open)

  Zookeeper 3.3.1 shouldn't infinite loop if someone creates a server.0 line
 ---

 Key: ZOOKEEPER-785
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-785
 Project: Zookeeper
  Issue Type: Bug
  Components: server
Affects Versions: 3.3.1
 Environment: Tested in linux with a new jvm
Reporter: Alex Newman
Assignee: Patrick Hunt
 Fix For: 3.3.2, 3.4.0

 Attachments: ZOOKEEPER-785.patch


 The following config causes an infinite loop
 [zoo.cfg]
 tickTime=2000
 dataDir=/var/zookeeper/
 clientPort=2181
 initLimit=10
 syncLimit=5
 server.0=localhost:2888:3888
 Output:
 2010-06-01 16:20:32,471 - INFO [main:quorumpeerm...@119] - Starting quorum 
 peer
 2010-06-01 16:20:32,489 - INFO [main:nioservercnxn$fact...@143] - binding to 
 port 0.0.0.0/0.0.0.0:2181
 2010-06-01 16:20:32,504 - INFO [main:quorump...@818] - tickTime set to 2000
 2010-06-01 16:20:32,504 - INFO [main:quorump...@829] - minSessionTimeout set 
 to -1
 2010-06-01 16:20:32,505 - INFO [main:quorump...@840] - maxSessionTimeout set 
 to -1
 2010-06-01 16:20:32,505 - INFO [main:quorump...@855] - initLimit set to 10
 2010-06-01 16:20:32,526 - INFO [main:files...@82] - Reading snapshot 
 /var/zookeeper/version-2/snapshot.c
 2010-06-01 16:20:32,547 - INFO [Thread-1:quorumcnxmanager$liste...@436] - My 
 election bind port: 3888
 2010-06-01 16:20:32,554 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:quorump...@620] - LOOKING
 2010-06-01 16:20:32,556 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:fastleaderelect...@649] - New election. My 
 id = 0, Proposed zxid = 12
 2010-06-01 16:20:32,558 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:fastleaderelect...@689] - Notification: 0, 
 12, 1, 0, LOOKING, LOOKING, 0
 2010-06-01 16:20:32,560 - WARN 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:quorump...@623] - Unexpected exception
 java.lang.NullPointerException
 at 
 org.apache.zookeeper.server.quorum.FastLeaderElection.totalOrderPredicate(FastLeaderElection.java:496)
 at 
 org.apache.zookeeper.server.quorum.FastLeaderElection.lookForLeader(FastLeaderElection.java:709)
 at org.apache.zookeeper.server.quorum.QuorumPeer.run(QuorumPeer.java:621)
 2010-06-01 16:20:32,560 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:quorump...@620] - LOOKING
 2010-06-01 16:20:32,560 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:fastleaderelect...@649] - New election. My 
 id = 0, Proposed zxid = 12
 2010-06-01 16:20:32,561 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:fastleaderelect...@689] - Notification: 0, 
 12, 2, 0, LOOKING, LOOKING, 0
 2010-06-01 16:20:32,561 - WARN 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:quorump...@623] - Unexpected exception
 java.lang.NullPointerException
 at 
 org.apache.zookeeper.server.quorum.FastLeaderElection.totalOrderPredicate(FastLeaderElection.java:496)
 at 
 org.apache.zookeeper.server.quorum.FastLeaderElection.lookForLeader(FastLeaderElection.java:709)
 at org.apache.zookeeper.server.quorum.QuorumPeer.run(QuorumPeer.java:621)
 2010-06-01 16:20:32,561 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:quorump...@620] - LOOKING
 2010-06-01 16:20:32,562 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:fastleaderelect...@649] - New election. My 
 id = 0, Proposed zxid = 12
 2010-06-01 16:20:32,562 - INFO 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:fastleaderelect...@689] - Notification: 0, 
 12, 3, 0, LOOKING, LOOKING, 0
 2010-06-01 16:20:32,562 - WARN 
 [QuorumPeer:/0:0:0:0:0:0:0:0:2181:quorump...@623] - Unexpected exception
 java.lang.NullPointerException
 Things like HBase require that the zookeeper servers be listed in the 
 zoo.cfg. This is a bug on their part, but zookeeper shouldn't null pointer in 
 a loop though.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (ZOOKEEPER-335) zookeeper servers should commit the new leader txn to their logs.

2010-06-02 Thread Charity Majors (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-335?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12874888#action_12874888
 ] 

Charity Majors commented on ZOOKEEPER-335:
--

I ran into this bug this morning, but it also seemed to put my cluster into an 
unusable state.  The cluster stopped accepting all connections, until I 
restarted node one.  After node one departed the cluster, nodes two and three 
formed a quorum and started serving again.  Node one was unable to rejoin, and 
had this error:

2010-06-02 17:04:56,486 - FATAL [QuorumPeer:/0:0:0:0:0:0:0:0:2181:follo...@71] 
- Leader epoch a is less than our epoch b
2010-06-02 17:04:56,486 - WARN  [QuorumPeer:/0:0:0:0:0:0:0:0:2181:follo...@82] 
- Exception when following the leader
java.io.IOException: Error: Epoch of leader is lower

until I cleared the data directory and restarted again.

 zookeeper servers should commit the new leader txn to their logs.
 -

 Key: ZOOKEEPER-335
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-335
 Project: Zookeeper
  Issue Type: Bug
  Components: server
Affects Versions: 3.1.0
Reporter: Mahadev konar
Assignee: Mahadev konar
Priority: Blocker
 Fix For: 3.4.0


 currently the zookeeper followers do not commit the new leader election. This 
 will cause problems in a failure scenarios with a follower acking to the same 
 leader txn id twice, which might be two different intermittent leaders and 
 allowing them to propose two different txn's of the same zxid.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.