Re: permanent ZSESSIONMOVED

Michael Bauland Thu, 18 Mar 2010 02:13:07 -0700

Hi Patrick,

thanks for your detailed answer.



>> I'm not sure, but maybe my input could help, too. As I mentioned
>> earlier, I also run the three zookeeper servers not in a local
>> environment but across two sites in different countries (soon, in
>> production, it'll be three sites).
>> I'm accessing the zookeeper ensemble continuously with about 5 clients
>> for testing purposes. These clients are running on the same machine as
>> one of the three zookeeper servers. I have attached logs of two of the
>> zookeeper servers (one leader and one follower).
>> My clients have a way to recover from connectionloss by trying ten
>> times, then they wait for 5 seconds, close the zookeeper connection and
>> open a new one and try up to ten times again; then again waiting 10
>> secs, closing, reconnecting, trying, etc., up to 50 seconds, then they
>> fail.
> 
> The zk client lib will handle connection loss automatically, not sure
> what you mean by "try ten times" to recover from conn loss.
> 
> Basically the client lib will notify your watcher when the session is
> disconnected from the cluster. If you have "in process" operations they
> will get the "connection loss" exception. However once this happens you
> want to wait, the ZK client lib will attempt to connect to the cluster
> again (one of the servers in the connect string), once it does it will
> notify you again via the watcher (sync connected event).
> 
> If the session reconnects to the cluster within the session timeout then
> all watches are restored and nothing is necessary on your part.

Unfortunately there are things I have to take care of. As explained in
http://wiki.apache.org/hadoop/ZooKeeper/ErrorHandling, I don't know
whether my previous request went through. Of course this is no problem
for idempotent requests (I can just reissue the request), but for
non-indempotent requests I first have to find out, whether the request
succeeded or failed.
So for read-requests I have to repeat it to get the information I need
and for write requests I have to first issue a read request to find out,
whether the request went through. Those follow-up recovery requests are
the ones I ment by trying ten times before I close the connection, wait
and re-connect to try again.

If I understand you correctly, I should not retry my request right away,
but wait until I get notified.
So, when I establish a connection to the Zookeeper ensemble, I also
provide a watcher:
zk = new ZooKeeper (connectString, timeOut, watcher);

This watcher is very simple in the way that it has a
public final Integer mutex = -1;

and a method

@Override
public void process (WatchedEvent event)
{
  synchronized (mutex)
  {
    mutex.notifyAll ();
  }
}

So probably I will have to add some code that checks the event and
stores a special flag in the watcher for the "sync connected event" and
another one for the "session expired event".
Whenever I get a connectionloss or sessionmoved exception (those are the
recoverable ones) during my work with the zk object, I will need to issue a
watcher.mutex.wait ();
command and only retry if the watcher has the special sync connected
flag showing. And if I see the session expired flag in the watcher I
know I'll have to close and re-create a new zookeeper connection.

Is that the correct way to handle those recoverable errors?




>> Sometimes all of the clients fail at about the same time. When I look at
>> the zookeeper logs I see that they report that the client connection
>> limit has been reached (which is set to 10, since I didn't set the value
>> at all), although this shouldn't happen since I have just 5 clients
>> running and none of them opens two connections at the same time, since
>> they're just single threaded.
>>
> 
> These clients are all on the same ip right?

At the moment, yes, and it's the same IP as one of the zookeeper
servers. Though this is just for testing. Later in production the
clients will be on different IPs (though there may still be several
threads running on the same IP).


> Are you sure you are closing the sessions in all cases (the old ones)?

Yes, I always call zk.close (); in the end.

> It could also be the case that you
> 
> 1) create a session, it gets disco, so you close it and
> 2) create a new session (and so on)
> 
> the session created in 1) (even if you closed it) may still be
> considered alive by the server until a) it expires, b) the session close
> in 1) eventually makes it's way back to the server.

I see. But I set the session time-out to 3000 (i.e., 3 seconds), so they
shouldn't stay active too long after I disconnect, I guess.


>> To me it seems that there are some connections which actually shouldn't
>> be there (anymore).
> 
> The easiest way to see this is using the "dump" command on the leader.
> It will tell you the sessions that the cluster thinks is still active.
> Run this command while running your test and see what it reports...
> http://hadoop.apache.org/zookeeper/docs/current/zookeeperAdmin.html#sc_zkCommands
> 
> 
>> For the logs, note that the clients are running also on the server which
>> is at that time the follower and I replaced my IPs by letters.
> 
> Take a look at this session 0x226f49bb5d78ea8 for example:
> 
> 2010-03-08 15:27:20,525 - INFO
> [NIOServerCxn.Factory:2181:nioserverc...@639] - Creating new session
> 0x226f49bb5d78ea8
> 2010-03-08 15:27:22,253 - INFO  [CommitProcessor:0:nioserverc...@992] -
> Finished init of 0x226f49bb5d78ea8 valid:true
> 2010-03-08 15:27:22,254 - WARN
> [NIOServerCxn.Factory:2181:nioserverc...@518] - Exception causing close
> of session 0x226f49bb5d78ea8 due to java.io.IOException: Read error
> 2010-03-08 15:27:22,254 - INFO
> [NIOServerCxn.Factory:2181:nioserverc...@857] - closing
> session:0x226f49bb5d78ea8 NIOServerCnxn:
> java.nio.channels.SocketChannel[connected local=/A.B.C.E:2181
> remote=/A.B.C.D:49258]
> 2010-03-08 15:27:28,000 - INFO  [SessionTracker:sessiontrackeri...@133]
> - Expiring session 0x226f49bb5d78ea8
> 2010-03-08 15:27:28,001 - INFO  [SessionTracker:zookeeperser...@326] -
> Expiring session 0x226f49bb5d78ea8
> 2010-03-08 15:27:28,001 - INFO
> [ProcessThread:-1:preprequestproces...@384] - Processed session
> termination request for id: 0x226f49bb5d78ea8
> 
> Looks like it's using a 5sec timeout, I can see that it's connecting to
> the server, then the connection is failing (or client closing? I don't
> have the log for that) after just a fraction of a second. Then the
> server is expiring the session 5 seconds later. This means that the
> server never saw a close from the client...
> 
> here's an example where the client called close:
> 2010-03-08 15:27:19,053 - INFO
> [ProcessThread:-1:preprequestproces...@384] - Processed session
> termination request for id: 0x226f49bb5d78ea6
> 2010-03-08 15:27:20,099 - INFO  [CommitProcessor:0:nioserverc...@857] -
> closing session:0x226f49bb5d78ea6 NIOServerCnxn:
> java.nio.channels.SocketChannel[connected local=/A.B.C.E:2181
> remote=/A.B.C.D:49249]
> 
> checkout time index  15:27:20,605 in the leader logs. Looks to me like
> network connectivity problems... notice that a bit later the clients are
> renewing the sessions successfully.
> 
> 
> As I mentioned, try using the dump command:
> http://hadoop.apache.org/zookeeper/docs/current/zookeeperAdmin.html#sc_zkCommands
> 
> 
> and see if that sheds any light on the maxclientcnxns issue.
> 
> You could also look at both the client and server logs and correlate the
> sessions in your client against information in the server (ie which
> sessions are active at which times).
> 
> Hope this helps, regards,

Yes, thanks a lot. I will investigate a bit further.

Cheers,

Michael


-- 
____________________________________________________________________
     |       |
     | knipp |            Knipp  Medien und Kommunikation GmbH
      -------                    Technologiepark
                                 Martin-Schmeißer-Weg 9
                                 44227 Dortmund
                                 Deutschland

     Dipl.-Informatiker          Tel:    +49 231 9703-284
                                 Fax:    +49 231 9703-200
     Dr. Michael Bauland         SIP:    michael.baul...@knipp.de
     Software-Entwicklung        E-Mail: michael.baul...@knipp.de

                                 Registereintrag:
                                 Amtsgericht Dortmund, HRB 13728

                                 Geschäftsführer:
                                 Dietmar Knipp, Elmar Knipp

Re: permanent ZSESSIONMOVED

Reply via email to