Messenger doesn't seem to have any clean way to recover from errors.

Fraser Adams Sat, 04 Oct 2014 10:41:01 -0700

Is there any way to recover from Messenger errors short of completelyfreeing the messenger instance and starting with a new one?

I've been deliberately making it fail, so for example starting amessenger with subscriptions like this:

amqp://~0.0.0.0,localhost:5672

with no broker running the first subscription should succeed and thesecond one should fail

In my case it's a bit more awkward because it's fully asynchronous, butwhat I see in this case is that it creates a connection instance tolocalhost:5672 because in pn_connect there is a test for


  if (connect(sock, addr->ai_addr, addr->ai_addrlen) == -1) {
    if (errno != EINPROGRESS) {
      pn_i_error_from_errno(io->error, "connect");
      freeaddrinfo(addr);
      close(sock);
      return PN_INVALID_SOCKET;
    }
  }

with my connect on a non-blocking socket EINPROGRESS is set so thesocket ends up being valid, but subsequently it will fail to connect.

I've actually got a listener that can detect the Connection refused, butwhat I can't seem to do is to cleanly clear the connection object.

I've tried all sorts of hacks aroundpn_messenger_resolve/pni_messenger_reclaim (in that casepn_messenger_resolve found the connection object given the name"localhost:5672" which was found OK then I tried a pni_messenger_reclaimhack to clear it, but that didn't seem to close the underlying socket).

I also tried to find the relevant selectable pn_messenger_selectablethat matched the file descriptor of the failed connection I then tried apni_connection_finalize(sel) hack. In that case I seem to free up theconnection and the underlying socket gets closed, but when Isubsequently try to connect (to the working amqp://~0.0.0.0) although Iget an accept on the right file descriptor I subsequently get anassertion failed at messenger.c,151,pni_context at Error

So in short given that a connection object gets created because of aconnect on a non-blocking socket, which subsequently and asynchronouslyfails to connect there doesn't seem any way to tidy up that failedconnection.


To be clear if I have subscriptions
amqp://~0.0.0.0,localhost:5672

And ignore any errors and don't bother to try and tidy up and Isubsequently do a client connection to amqp://0.0.0.0 my client connectsfine but on the next file descriptor up from the one created by thefailed localhost:5672 connection so basically my failed subscription hasleaked a connection. That is the listen fd for amqp://~0.0.0.0 is 3 the(failed) fd for localhost:5672 is 4 and when I connect toamqp://~0.0.0.0 the accept fd is 5, it really should be 4 but I can'tget shot of the connection object etc. for localhost:5672.

The only way to deal with it seems to be to free and create a newmessenger when anything fails, which is a pain because the subscriptionamqp://~0.0.0.0 is actually fine.

TBH messenger's error handling is driving me nuts, it has been mentionedin a few threads that it might be better to give up on messenger andjust use engine.

Is messenger really irredeemably broken? Without decent errorhandling/recovery it's very little use in a production environment.


Frase





---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Messenger doesn't seem to have any clean way to recover from errors.

Reply via email to