ZOOKEEPER-4766 - Avoid snapshots during leader election

2024-06-07 Thread Anup Ghatage
info used for something upon restart that is necessary for correctness? Is it mainly important for telemetry? If it is not needed for correctness could skipping this snapshotting step during leader election be made a configurable option? As per my knowledge, If session info is not persisted, sessions wil

[jira] [Created] (ZOOKEEPER-4771) Fast leader election taking too long

2023-11-22 Thread Ivo Vrdoljak (Jira)
Ivo Vrdoljak created ZOOKEEPER-4771: --- Summary: Fast leader election taking too long Key: ZOOKEEPER-4771 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-4771 Project: ZooKeeper Issue

[jira] [Created] (ZOOKEEPER-4766) Ensure leader election time does not unnecessarily scale with tree size due to snapshotting

2023-10-30 Thread Rishabh Rai (Jira)
Rishabh Rai created ZOOKEEPER-4766: -- Summary: Ensure leader election time does not unnecessarily scale with tree size due to snapshotting Key: ZOOKEEPER-4766 URL: https://issues.apache.org/jira/browse/ZOOKEEPER

[jira] [Created] (ZOOKEEPER-4620) zookeeper leader election time metric not reported correctly

2022-09-30 Thread Benton Liang (Jira)
Benton Liang created ZOOKEEPER-4620: --- Summary: zookeeper leader election time metric not reported correctly Key: ZOOKEEPER-4620 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-4620 Project

[jira] [Created] (ZOOKEEPER-4502) SyncRequestProcessor leak when leader election occurred

2022-03-28 Thread xiongjianbo (Jira)
xiongjianbo created ZOOKEEPER-4502: -- Summary: SyncRequestProcessor leak when leader election occurred Key: ZOOKEEPER-4502 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-4502 Project: ZooKeeper

[jira] [Created] (ZOOKEEPER-4316) Leader election fails due to SocketTimeoutException in QuorumCnxManager

2021-06-11 Thread Arun Subramanian R (Jira)
Arun Subramanian R created ZOOKEEPER-4316: - Summary: Leader election fails due to SocketTimeoutException in QuorumCnxManager Key: ZOOKEEPER-4316 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-4316

[jira] [Created] (ZOOKEEPER-4220) Redundant connection attempts during leader election

2021-02-23 Thread Alex Mirgorodskiy (Jira)
Alex Mirgorodskiy created ZOOKEEPER-4220: Summary: Redundant connection attempts during leader election Key: ZOOKEEPER-4220 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-4220 Project

[jira] [Created] (ZOOKEEPER-4185) Addition of new ZooTrace Mask LEADER_ELECTION_MASK and related log lines for better leader election telemetry

2021-01-19 Thread Julian Chow (Jira)
Julian Chow created ZOOKEEPER-4185: -- Summary: Addition of new ZooTrace Mask LEADER_ELECTION_MASK and related log lines for better leader election telemetry Key: ZOOKEEPER-4185 URL: https://issues.apache.org/jira

[jira] [Created] (ZOOKEEPER-4183) Leader election not working when using hostname in server config and hostname resolves to an internal IP addresses

2021-01-19 Thread Harald Musum (Jira)
Harald Musum created ZOOKEEPER-4183: --- Summary: Leader election not working when using hostname in server config and hostname resolves to an internal IP addresses Key: ZOOKEEPER-4183 URL: https

[jira] [Created] (ZOOKEEPER-3923) Leader election issues with Istio

2020-08-28 Thread Matteo Merli (Jira)
Matteo Merli created ZOOKEEPER-3923: --- Summary: Leader election issues with Istio Key: ZOOKEEPER-3923 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-3923 Project: ZooKeeper Issue Type

[jira] [Created] (ZOOKEEPER-3769) fast leader election does not end if leader is taken down

2020-03-24 Thread Lasaro Camargos (Jira)
Lasaro Camargos created ZOOKEEPER-3769: -- Summary: fast leader election does not end if leader is taken down Key: ZOOKEEPER-3769 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-3769 Project

[jira] [Created] (ZOOKEEPER-3757) Transaction log sync can take 20+ seconds after leader election when there is a large snapCount

2020-03-14 Thread Alex Kaiser (Jira)
Alex Kaiser created ZOOKEEPER-3757: -- Summary: Transaction log sync can take 20+ seconds after leader election when there is a large snapCount Key: ZOOKEEPER-3757 URL: https://issues.apache.org/jira/browse

[jira] [Created] (ZOOKEEPER-3537) Leader election - Use of out of election messages

2019-09-07 Thread Karolos Antoniadis (Jira)
Karolos Antoniadis created ZOOKEEPER-3537: - Summary: Leader election - Use of out of election messages Key: ZOOKEEPER-3537 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-3537 Project

[jira] [Created] (ZOOKEEPER-3479) Logging false leader election times

2019-07-30 Thread Karolos Antoniadis (JIRA)
Karolos Antoniadis created ZOOKEEPER-3479: - Summary: Logging false leader election times Key: ZOOKEEPER-3479 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-3479 Project: ZooKeeper

Re: Leader election logging during reconfiguration

2019-07-30 Thread Michael Han
w long the conventional FLE actually took. On Mon, Jul 29, 2019 at 7:13 PM Alexander Shraer wrote: > Please see comments inline. > > Thanks, > Alex > > On Mon, Jul 29, 2019 at 5:29 PM Karolos Antoniadis > wrote: > > > Hi ZooKeeper developers, > > > > ZooKee

Re: Leader election logging during reconfiguration

2019-07-29 Thread Alexander Shraer
Please see comments inline. Thanks, Alex On Mon, Jul 29, 2019 at 5:29 PM Karolos Antoniadis wrote: > Hi ZooKeeper developers, > > ZooKeeper seems to be logging a "*LEADER ELECTION TOOK*" message even > though no leader election takes place during a reconfiguration. >

Leader election logging during reconfiguration

2019-07-29 Thread Karolos Antoniadis
Hi ZooKeeper developers, ZooKeeper seems to be logging a "*LEADER ELECTION TOOK*" message even though no leader election takes place during a reconfiguration. This can be reproduced by following these steps: 1) start a ZooKeeper cluster (e.g., 3 participants) 2) start a client tha

[jira] [Created] (ZOOKEEPER-3456) Service temporarily unavailable due to an ongoing leader election. Please refresh

2019-07-07 Thread Marzieh (JIRA)
Marzieh created ZOOKEEPER-3456: -- Summary: Service temporarily unavailable due to an ongoing leader election. Please refresh Key: ZOOKEEPER-3456 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-3456

[jira] [Commented] (ZOOKEEPER-3336) Leader election terminated, two leaders or not following leader or not having state

2019-05-10 Thread Simin Oraee (JIRA)
election from the beginning. I mean it is not an infinite sequence of message passing and I just provide 3 slices of this infinite sequence. Therefore each case is just one leader election protocol which starts with 3 nodes and finishes when there was no enabled message. Therefore in case 2

[jira] [Commented] (ZOOKEEPER-3336) Leader election terminated, two leaders or not following leader or not having state

2019-05-10 Thread Fangmin Lv (JIRA)
changed it's state to following, it received vote from 4, and changed mind to vote for 4, so 3 and 4 will keep voting 4 and in LOOKING state.  > Leader election terminated, two leaders or not following leader or not having >

[GitHub] [zookeeper] Lagrang edited a comment on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time

2019-03-29 Thread GitBox
Lagrang edited a comment on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time URL: https://github.com/apache/zookeeper/pull/863#issuecomment-477946006 Zookeeper binds to LE port on start, if you can't bind to it, I think fail-fast

[GitHub] [zookeeper] Lagrang commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time

2019-03-29 Thread GitBox
Lagrang commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time URL: https://github.com/apache/zookeeper/pull/863#issuecomment-477946006 Zookeeper binds to LE port on start, if you can't bind to it, I think fail-fast is more

[jira] [Commented] (ZOOKEEPER-3336) Leader election terminated, two leaders or not following leader or not having state

2019-03-29 Thread Simin Oraee (JIRA)
and I appreciate your time. Then what about the other two conditions I mentioned? 2) There are some nodes that follow nodes other than the leaders. 3) There are some nodes that neither following nor leading > Leader election terminated, two leaders or not following leader or not having >

[GitHub] [zookeeper] maoling commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time

2019-03-29 Thread GitBox
maoling commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time URL: https://github.com/apache/zookeeper/pull/863#issuecomment-477893575 After extending the `ZooKeeperCriticalThread`, When `QuorumCnxManager.Listener `had failed

[GitHub] [zookeeper] Lagrang commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time

2019-03-29 Thread GitBox
Lagrang commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time URL: https://github.com/apache/zookeeper/pull/863#issuecomment-477882906 @maoling Can you please describe your concerns

[GitHub] [zookeeper] maoling commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time

2019-03-28 Thread GitBox
maoling commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time URL: https://github.com/apache/zookeeper/pull/863#issuecomment-477839636 I have reservations about extending `QuorumCnxManager.Listener

[GitHub] [zookeeper] enixon commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time

2019-03-28 Thread GitBox
enixon commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time URL: https://github.com/apache/zookeeper/pull/863#issuecomment-477804567 Good find @maoling ! I don't see any discussion in ZOOKEEPER-602 about criteria for what

[jira] [Commented] (ZOOKEEPER-3336) Leader election terminated, two leaders or not following leader or not having state

2019-03-28 Thread Fangmin Lv (JIRA)
of leader election # node 4 and 5 mentioned 5 is leader, but 1, 2, 3 only following 5 when there is another majority confirmed it, which is not # node 1, 2, 3 voted for 3, 3 gets majority so it's start leading, meanwhile 5 is still waiting for another peer to join before it's timed out > Lea

[GitHub] [zookeeper] maoling commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time

2019-03-28 Thread GitBox
maoling commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time URL: https://github.com/apache/zookeeper/pull/863#issuecomment-477471802 - There was a [sum-up](https://issues.apache.org/jira/browse/ZOOKEEPER-602?focusedCommentId

[GitHub] [zookeeper] enixon commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time

2019-03-27 Thread GitBox
enixon commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time URL: https://github.com/apache/zookeeper/pull/863#issuecomment-477303890 Thanks for the new test

[GitHub] [zookeeper] Lagrang edited a comment on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time

2019-03-27 Thread GitBox
Lagrang edited a comment on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time URL: https://github.com/apache/zookeeper/pull/863#issuecomment-477172489 Push changes which extend `QuorumCnxManager.Listener` from `ZookeeperCriticalThread

[GitHub] [zookeeper] Lagrang commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time

2019-03-27 Thread GitBox
Lagrang commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time URL: https://github.com/apache/zookeeper/pull/863#issuecomment-477172489 Push changes to extends QuorumCnxManager.Listener from ZookeeperCriticalThread. Plus test

[GitHub] [zookeeper] enixon commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time

2019-03-25 Thread GitBox
enixon commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time URL: https://github.com/apache/zookeeper/pull/863#issuecomment-476324387 Thanks for adding docs! I'm wary of adding an infinite loop for the same reasons

[GitHub] [zookeeper] Lagrang commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time

2019-03-25 Thread GitBox
Lagrang commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time URL: https://github.com/apache/zookeeper/pull/863#issuecomment-476284089 Add doc for introduced property to `zookeeperAdmin.md` file, `Cluster Options` section

[GitHub] [zookeeper] Lagrang commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time

2019-03-24 Thread GitBox
Lagrang commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time URL: https://github.com/apache/zookeeper/pull/863#issuecomment-475969622 > An infinite loop is a better option, especially for the availability? because the contai

[GitHub] [zookeeper] maoling commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time

2019-03-24 Thread GitBox
maoling commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time URL: https://github.com/apache/zookeeper/pull/863#issuecomment-475965281 @Lagrang - you also need to doc this property in the `zookeeperAdmin.md` - Look back

[GitHub] [zookeeper] Lagrang commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time

2019-03-23 Thread GitBox
Lagrang commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time URL: https://github.com/apache/zookeeper/pull/863#issuecomment-475885134 retest maven build

[GitHub] [zookeeper] eolivelli commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time

2019-03-22 Thread GitBox
eolivelli commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time URL: https://github.com/apache/zookeeper/pull/863#issuecomment-475788393 retest maven build

[GitHub] [zookeeper] eolivelli commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time

2019-03-22 Thread GitBox
eolivelli commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time URL: https://github.com/apache/zookeeper/pull/863#issuecomment-475767323 Even Travis

[GitHub] [zookeeper] enixon commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time

2019-03-22 Thread GitBox
enixon commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time URL: https://github.com/apache/zookeeper/pull/863#issuecomment-475741434 Some thing is wrong with JenkinsMaven here

[GitHub] [zookeeper] Lagrang opened a new pull request #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time

2019-03-22 Thread GitBox
Lagrang opened a new pull request #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time URL: https://github.com/apache/zookeeper/pull/863 This is an automated message from

[GitHub] [zookeeper] Lagrang closed pull request #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time

2019-03-22 Thread GitBox
Lagrang closed pull request #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time URL: https://github.com/apache/zookeeper/pull/863 This is an automated message from the Apache Git

[GitHub] [zookeeper] Lagrang closed pull request #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time

2019-03-22 Thread GitBox
Lagrang closed pull request #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time URL: https://github.com/apache/zookeeper/pull/863 This is an automated message from the Apache Git

[GitHub] [zookeeper] Lagrang opened a new pull request #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time

2019-03-22 Thread GitBox
Lagrang opened a new pull request #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time URL: https://github.com/apache/zookeeper/pull/863 This is an automated message from

[GitHub] [zookeeper] enixon commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time

2019-03-22 Thread GitBox
enixon commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time URL: https://github.com/apache/zookeeper/pull/863#issuecomment-475707431 LGTM! The test failure (NonRecoverableErrorTest::testZooKeeperServiceAvailableOnLeader) looks

[GitHub] [zookeeper] maoling commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time

2019-03-22 Thread GitBox
maoling commented on issue #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time URL: https://github.com/apache/zookeeper/pull/863#issuecomment-475644114 @Lagrang if you're interested in the same issue, you can step into [this ](https

[GitHub] [zookeeper] Lagrang commented on a change in pull request #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time

2019-03-22 Thread GitBox
Lagrang commented on a change in pull request #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time URL: https://github.com/apache/zookeeper/pull/863#discussion_r268053495 ## File path: zookeeper-server/src/main/java/org/apache/zookeeper

[GitHub] [zookeeper] enixon commented on a change in pull request #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time

2019-03-21 Thread GitBox
enixon commented on a change in pull request #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time URL: https://github.com/apache/zookeeper/pull/863#discussion_r267946154 ## File path: zookeeper-server/src/main/java/org/apache/zookeeper

[jira] [Commented] (ZOOKEEPER-3320) Leader election port stop listen when hostname unresolvable for some time

2019-03-21 Thread Igor Skokov (JIRA)
-3.5, but as far as a know, this error can happen on master branch. If needed, I can create another pull request for master. > Leader election port stop listen when hostname unresolvable for some t

[jira] [Updated] (ZOOKEEPER-3320) Leader election port stop listen when hostname unresolvable for some time

2019-03-21 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ZOOKEEPER-3320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ZOOKEEPER-3320: -- Labels: pull-request-available (was: ) > Leader election port stop listen w

[GitHub] [zookeeper] Lagrang opened a new pull request #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time

2019-03-21 Thread GitBox
Lagrang opened a new pull request #863: ZOOKEEPER-3320: Leader election port stop listen when hostname unresolvable for some time URL: https://github.com/apache/zookeeper/pull/863 This is an automated message from

[jira] [Commented] (ZOOKEEPER-3320) Leader election port stop listen when hostname unresolvable for some time

2019-03-20 Thread Brian Nixon (JIRA)
. Either something like "election port bind time" or "dns unavailable time" if we want to be more general. Do you want to contribute a short diff? This may also be related to ZOOKEEPER-2982 (or may not, making a note to check later). > Leader election port stop listen whe

[jira] [Commented] (ZOOKEEPER-3320) Leader election port stop listen when hostname unresolvable for some time

2019-03-20 Thread Igor Skokov (JIRA)
server will continue to run without leader election participation:) ??Looking at this from the opposite direction, can you add the desired delay in the startup sequence of your Kubernetes container? My concern is that the pattern of "DNS is currently unreliable but will be reliable soon&q

[jira] [Commented] (ZOOKEEPER-3320) Leader election port stop listen when hostname unresolvable for some time

2019-03-19 Thread Brian Nixon (JIRA)
in the startup sequence of your Kubernetes container? My concern is that the pattern of "DNS is currently unreliable but will be reliable soon" seems specific to the container management and may result in strange behavior when applied to other environments. > Leader election port sto

[jira] [Updated] (ZOOKEEPER-3320) Leader election port stop listen when hostname unresolvable for some time

2019-03-17 Thread Igor Skokov (JIRA)
that in some circumstances Zookeeper node stop listening on leader election port. This cause unavailability of ZK cluster. Zookeeper deployed  as StatefulSet in term of Kubernetes and has following dynamic configuration: {code:java} zookeeper-0.zookeeper:2182:2183:participant;2181 zookeeper-1

[jira] [Updated] (ZOOKEEPER-3320) Leader election port stop listen when hostname unresolvable for some time

2019-03-17 Thread Igor Skokov (JIRA)
[ https://issues.apache.org/jira/browse/ZOOKEEPER-3320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Igor Skokov updated ZOOKEEPER-3320: --- Summary: Leader election port stop listen when hostname unresolvable for some time

[jira] [Created] (ZOOKEEPER-3320) Don't give up on bind of leader election port

2019-03-17 Thread Igor Skokov (JIRA)
Igor Skokov created ZOOKEEPER-3320: -- Summary: Don't give up on bind of leader election port Key: ZOOKEEPER-3320 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-3320 Project: ZooKeeper

[jira] [Commented] (ZOOKEEPER-2164) fast leader election keeps failing

2019-01-24 Thread Ian Spence (JIRA)
cluster, we restarted one node and after an hour it still has not joined the quorum. stat and mntr show "This ZooKeeper instance is not currently serving requests". > fast leader election keeps failing > -- > > Key: ZOOKEEPER-2164

[jira] [Created] (ZOOKEEPER-3247) New lest admin command to get leader election time

2019-01-12 Thread Dinesh Appavoo (JIRA)
Dinesh Appavoo created ZOOKEEPER-3247: - Summary: New lest admin command to get leader election time Key: ZOOKEEPER-3247 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-3247 Project: ZooKeeper

Re: Leader election

2018-12-12 Thread Michael Han
>> Can we reduce this time by configuring "syncLimit" and "tickTime" to let's say 5 seconds? Can we have a strong guarantee on this time bound? It's not possible to guarantee the time bound, because of FLP impossibility (reliable failure detection is not possible in async environment). Though

Re: Leader election

2018-12-11 Thread Michael Borokhovich
hat all replicas only > acknowledge writes with their idea of the current epoch for an object. > > What happens in the even of partition is that we have a few possible cases, > but in any case where data replicas are split by a partition, writes will > fail triggering a new leader election. Only

[jira] [Commented] (ZOOKEEPER-2461) There is no difference between the observer and the participants in the leader election algorithm

2018-12-11 Thread John Kim (JIRA)
een the observer and the participants in the > leader election algorithm > - > > Key: ZOOKEEPER-2461 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-2461 >

Re: Leader election

2018-12-10 Thread Ted Dunning
will fail triggering a new leader election. Only replicas on the side of the new ZK quorum (which may be the old quorum) have a chance of succeeding here. If the replicas are split away from the ZK quorum, writes may not be possible until the partition heals. If a new leader is elected

Re: Leader election

2018-12-10 Thread Michael Borokhovich
Thanks, Maciej. That sounds good. We will try playing with the parameters and have at least a known upper limit on the inconsistency interval. On Fri, Dec 7, 2018 at 2:11 AM Maciej Smoleński wrote: > On Fri, Dec 7, 2018 at 3:03 AM Michael Borokhovich > wrote: > > > We are planning to run

Re: Leader election

2018-12-10 Thread Michael Borokhovich
her ZK nodes. So, another client may take leadership while the > > current leader still unaware of the change. Is it true? > > > > Another follow up question. If Zookeeper can guarantee a single leader, > is > > it worth using it just for leader election? Maybe

Re: Leader election

2018-12-10 Thread Michael Borokhovich
Makes sense. Thanks, Ted. We will design our system to cope with the short periods where we might have two leaders. On Thu, Dec 6, 2018 at 11:03 PM Ted Dunning wrote: > ZK is able to guarantee that there is only one leader for the purposes of > updating ZK data. That is because all commits have

Re: Leader election

2018-12-07 Thread Maciej Smoleński
On Fri, Dec 7, 2018 at 3:03 AM Michael Borokhovich wrote: > We are planning to run Zookeeper nodes embedded with the client nodes. > I.e., each client runs also a ZK node. So, network partition will > disconnect a ZK node and not only the client. > My concern is about the following statement

Re: Leader election

2018-12-06 Thread Ted Dunning
ZK is able to guarantee that there is only one leader for the purposes of updating ZK data. That is because all commits have to originate with the current quorum leader and then be acknowledged by a quorum of the current cluster. IF the leader can't get enough acks, then it has de facto lost

Re: Leader election

2018-12-06 Thread Michael Borokhovich
We are planning to run Zookeeper nodes embedded with the client nodes. I.e., each client runs also a ZK node. So, network partition will disconnect a ZK node and not only the client. My concern is about the following statement from the ZK documentation: "Timeliness: The clients view of the system

Re: Leader election

2018-12-06 Thread Michael Han
Tweak timeout is tempting as your solution might work most of the time yet fail in certain cases (which others have pointed out). If the goal is absolute correctness then we should avoid timeout, which does not guarantee correctness as it only makes the problem hard to manifest. Fencing is the

Re: Leader election

2018-12-06 Thread Jordan Zimmerman
> Old service leader will detect network partition max 15 seconds after it > happened. If the old service leader is in a very long GC it will not detect the partition. In the face of VM pauses, etc. it's not possible to avoid 2 leaders for a short period of time. -JZ

Re: Leader election

2018-12-06 Thread Maciej Smoleński
er" client is connected to the partitioned ZK > node, > > it may be notified not at the same time as the other clients connected to > > the other ZK nodes. So, another client may take leadership while the > > current leader still unaware of the change. Is it true? > > > >

Re: Leader election

2018-12-06 Thread Jordan Zimmerman
nt may take leadership while the > current leader still unaware of the change. Is it true? > > Another follow up question. If Zookeeper can guarantee a single leader, is > it worth using it just for leader election? Maybe we can use a more > lightweight Hazelcast for example? > &g

Re: Leader election

2018-12-06 Thread Michael Borokhovich
same time as the other clients connected to the other ZK nodes. So, another client may take leadership while the current leader still unaware of the change. Is it true? Another follow up question. If Zookeeper can guarantee a single leader, is it worth using it just for leader election? Maybe

Re: Leader election

2018-12-06 Thread Jordan Zimmerman
e have a service that runs on 3 hosts for high availability. However, at > any given time, exactly one instance must be active. So, we are thinking to > use Leader election using Zookeeper. > To this goal, on each service host we also start a ZK server, so we have a > 3-nodes ZK cluster and e

回复:Re: Leader election

2018-12-06 Thread 毛蛤丝
> suggest you use the ready-made implements of curator: > http://curator.apache.org/curator-recipes/leader-election.html > - 原始邮件 - > 发件人:Michael Borokhovich > 收件人:"dev@zookeeper.apache.org" > 主题:Leader election > 日期:2018年12月06日 07点29分 > > Hello,

Re: Leader election

2018-12-05 Thread Enrico Olivelli
Michael, Leader election is not enough. You must have some mechanism to fence off the partitioned leader. If you are building a replicated state machine Apache Zookeeper + Apache Bookkeeper can be a good choice See this just an example: https://github.com/ivankelly/bookkeeper-tutorial

Re: Leader election

2018-12-05 Thread Michael Borokhovich
ade implements of curator: > http://curator.apache.org/curator-recipes/leader-election.html > - 原始邮件 - > 发件人:Michael Borokhovich > 收件人:"dev@zookeeper.apache.org" > 主题:Leader election > 日期:2018年12月06日 07点29分 > > Hello, > We have a service that runs o

回复:Leader election

2018-12-05 Thread 毛蛤丝
suggest you use the ready-made implements of curator: http://curator.apache.org/curator-recipes/leader-election.html - 原始邮件 - 发件人:Michael Borokhovich 收件人:"dev@zookeeper.apache.org" 主题:Leader election 日期:2018年12月06日 07点29分 Hello, We have a service that runs on 3 host

Leader election

2018-12-05 Thread Michael Borokhovich
Hello, We have a service that runs on 3 hosts for high availability. However, at any given time, exactly one instance must be active. So, we are thinking to use Leader election using Zookeeper. To this goal, on each service host we also start a ZK server, so we have a 3-nodes ZK cluster and each

[jira] [Commented] (ZOOKEEPER-1814) Reduction of waiting time during Fast Leader Election

2018-11-22 Thread Daniel Peon (JIRA)
ing time during Fast Leader Election > - > > Key: ZOOKEEPER-1814 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1814 > Project: ZooKeeper > Issue Type: Bug > Co

[jira] [Commented] (ZOOKEEPER-1814) Reduction of waiting time during Fast Leader Election

2018-11-21 Thread Hadoop QA (JIRA)
or modified tests. -1 patch. The patch command could not apply the patch. Console output: https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/3715//console This message is automatically generated. > Reduction of waiting time during Fast Leader Elect

[jira] [Commented] (ZOOKEEPER-1814) Reduction of waiting time during Fast Leader Election

2018-11-21 Thread Michael K. Edwards (JIRA)
branch-3.5? > Reduction of waiting time during Fast Leader Election > - > > Key: ZOOKEEPER-1814 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1814 > Project: ZooKeeper >

[jira] [Commented] (ZOOKEEPER-2164) fast leader election keeps failing

2018-11-21 Thread Michael K. Edwards (JIRA)
branch-3.5 code? > fast leader election keeps failing > -- > > Key: ZOOKEEPER-2164 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-2164 > Project: ZooKeeper > Issue Type: Bug

[jira] [Updated] (ZOOKEEPER-2845) Data inconsistency issue due to retain database in leader election

2018-09-13 Thread ASF GitHub Bot (JIRA)
ain database in leader election > -- > > Key: ZOOKEEPER-2845 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-2845 > Project: ZooKeeper > Issue Type: Bug

[jira] [Comment Edited] (ZOOKEEPER-2845) Data inconsistency issue due to retain database in leader election

2018-09-13 Thread Fangmin Lv (JIRA)
again for moving this forward!  > Data inconsistency issue due to retain database in leader election > -- > > Key: ZOOKEEPER-2845 > URL: https://issues.apache.org/jira/browse

[jira] [Commented] (ZOOKEEPER-2845) Data inconsistency issue due to retain database in leader election

2018-09-13 Thread Fangmin Lv (JIRA)
 heavier and complexity, we may change to use this simpler solution as well. Thanks again for moving this forward!  > Data inconsistency issue due to retain database in leader election > -- > > Key: ZO

[jira] [Commented] (ZOOKEEPER-3084) Exit when ZooKeeper cannot bind to the leader election port

2018-07-12 Thread Hadoop QA (JIRA)
://builds.apache.org/job/PreCommit-ZOOKEEPER-github-pr-build/1932//console This message is automatically generated. > Exit when ZooKeeper cannot bind to the leader election port > --- > > Key: ZOOKEEPER-3084 >

[jira] [Commented] (ZOOKEEPER-3084) Exit when ZooKeeper cannot bind to the leader election port

2018-07-11 Thread Andor Molnar (JIRA)
a separate pull request for the 3.5 branch? I think that would be the easiest to get this done. > Exit when ZooKeeper cannot bind to the leader election port > --- > > Key: ZOOKEEPER-3084 >

[jira] [Commented] (ZOOKEEPER-3084) Exit when ZooKeeper cannot bind to the leader election port

2018-07-10 Thread Hudson (JIRA)
[https://builds.apache.org/job/ZooKeeper-trunk/95/]) ZOOKEEPER-3084: Exit when ZooKeeper cannot bind to the leader election (hanm: rev c2e7ed1e6f8f2de48778db7f3d63f9629c086ea8) * (edit) src/java/main/org/apache/zookeeper/server/quorum/QuorumCnxManager.java > Exit when ZooKeeper cannot b

[jira] [Commented] (ZOOKEEPER-3084) Exit when ZooKeeper cannot bind to the leader election port

2018-07-10 Thread Michael Han (JIRA)
/zookeeper/commit/c2e7ed1e6f8f2de48778db7f3d63f9629c086ea8] tried to merge to 3.5 but there are merge conflicts. have this on record so we can get this into 3.5 at some point before next release. > Exit when ZooKeeper cannot bind to the leader election p

[jira] [Issue Comment Deleted] (ZOOKEEPER-3084) Exit when ZooKeeper cannot bind to the leader election port

2018-07-10 Thread Michael Han (JIRA)
not bind to the leader election port > --- > > Key: ZOOKEEPER-3084 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-3084 > Project: ZooKeeper > Issue Type: Improvement

[jira] [Resolved] (ZOOKEEPER-3084) Exit when ZooKeeper cannot bind to the leader election port

2018-07-10 Thread Michael Han (JIRA)
[ https://issues.apache.org/jira/browse/ZOOKEEPER-3084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Han resolved ZOOKEEPER-3084. Resolution: Fixed > Exit when ZooKeeper cannot bind to the leader election p

[jira] [Commented] (ZOOKEEPER-3084) Exit when ZooKeeper cannot bind to the leader election port

2018-07-09 Thread Mohamedvolt (JIRA)
per cannot bind to the leader election port > --- > > Key: ZOOKEEPER-3084 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-3084 > Project: ZooKeeper > Is

[jira] [Updated] (ZOOKEEPER-3084) Exit when ZooKeeper cannot bind to the leader election port

2018-07-08 Thread Fangmin Lv (JIRA)
[ https://issues.apache.org/jira/browse/ZOOKEEPER-3084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Fangmin Lv updated ZOOKEEPER-3084: -- Summary: Exit when ZooKeeper cannot bind to the leader election port (was: Exit when zeus

[jira] [Commented] (ZOOKEEPER-3084) Exit when zeus cannot bind to the leader election port

2018-07-06 Thread Hadoop QA (JIRA)
Console output: https://builds.apache.org/job/PreCommit-ZOOKEEPER-github-pr-build/1898//console This message is automatically generated. > Exit when zeus cannot bind to the leader election port > -- > > Key: ZO

[jira] [Updated] (ZOOKEEPER-3084) Exit when zeus cannot bind to the leader election port

2018-07-05 Thread ASF GitHub Bot (JIRA)
not bind to the leader election port > -- > > Key: ZOOKEEPER-3084 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-3084 > Project: ZooKeeper > Issue Type: Improvement >

[jira] [Created] (ZOOKEEPER-3084) Exit when zeus cannot bind to the leader election port

2018-07-05 Thread Fangmin Lv (JIRA)
Fangmin Lv created ZOOKEEPER-3084: - Summary: Exit when zeus cannot bind to the leader election port Key: ZOOKEEPER-3084 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-3084 Project: ZooKeeper

[jira] [Commented] (ZOOKEEPER-2845) Data inconsistency issue due to retain database in leader election

2018-02-27 Thread ASF GitHub Bot (JIRA)
: https://github.com/apache/zookeeper/pull/453 Thanks @afine I closed them. > Data inconsistency issue due to retain database in leader election > -- > > Key: ZOOKEEPER-2845 >

[jira] [Commented] (ZOOKEEPER-2845) Data inconsistency issue due to retain database in leader election

2018-02-27 Thread ASF GitHub Bot (JIRA)
at: https://github.com/apache/zookeeper/pull/455 > Data inconsistency issue due to retain database in leader election > -- > > Key: ZOOKEEPER-2845 > URL: https://issues.apache.

  1   2   3   4   5   6   >