On 10:35, Wed 24 Nov 10, Dejan Muhamedagic wrote:
> Hi,
> 
> On Tue, Nov 23, 2010 at 11:03:33PM +0000, Dave Williams wrote:
> > Hi,
> > I have a problem that looks similar to that reported "possible deadlock
> > in lrmd" on 21st Oct
> > 
> > When running lradmin -C to list classes the first time it comes back
> > immediately with the expected list e.g.
> > 
> > r...@node1:/home# lrmadmin -C
> > There are 5 RA classes supported:
> > lsb
> > ocf
> > stonith
> > upstart
> > heartbeat
> > 
> > All subsequent attempts lrmadmin hangs and never comes back (you have to 
> > kill
> > with crtl-C). This is repeatable on all the machines I have tried it on.
> 
> I'm afraid that this was to be expected. 
Hi Dejan - thanks for your reply.

I'm not sure which you imply:
a) Its known to be buggy?
b) Its working as designed?
I presume a).

.....

> > On the surface the overall sequence makes sense but the hang doesnt and
> > clearly shouldnt happen. I am at a loss as to whether it is a GLib
> > issues (unlikely I would have thought?) or its an lrmd bug.
> 
> It's neither. It's bad usage of glib.
> 
Is there anyone working on resolving this?  I'm happy to help but dont
have the time to debug further at present - not being a glib expert.
I have other critical software projects to work on and just need
something that works in this area!

> > IMHO lrmd should NEVER hang!
> 
> If you don't use upstart, it won't hang.

Sadly I need upstart. Thats one reason I got into this situation in the
first place! 

I currently have a production clustered server down because of this and
the fact that ubuntu (I'm advised) have an inconsistently compiled set
of HA components. Certaintly both lucid and maverick released packages
leave defunct processes lying around and give highly unreliable
operation :-(

Dave
_______________________________________________________
Linux-HA-Dev: [email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha-dev
Home Page: http://linux-ha.org/

Reply via email to