Raul Gutierrez Segales created ZOOKEEPER-2098:
-------------------------------------------------

             Summary: QuorumCnxManager: use BufferedOutputStream for initial msg
                 Key: ZOOKEEPER-2098
                 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-2098
             Project: ZooKeeper
          Issue Type: Improvement
          Components: quorum, server
    Affects Versions: 3.5.0
            Reporter: Raul Gutierrez Segales
            Assignee: Raul Gutierrez Segales
             Fix For: 3.5.1


Whilst writing fle-dump (a tool like 
[zk-dump|https://github.com/twitter/zktraffic/], but to dump FastLeaderElection 
messages), I noticed that QCM is using DataOutputStream (which doesn't buffer) 
directly.

So all calls to write() are written immediately to the network, which means 
simple messaages like two participants exchanging Votes can take a couple RTTs! 
This is specially terrible for global clusters (i.e.: x-country RTTs).

The solution is to use BufferedOutputStream for the initial negotiation between 
members of the cluster. Note that there are other places were suboptimal (but 
not entirely unbuffered) writes to the network still exist. I'll get those in 
separate tickets.

After using BufferedOutputStream we get only 1 RTT for the initial message, so 
elections & time for for participants to join a cluster is reduced.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to