On Tue, Aug 17, 2010 at 03:06:45PM +0200, Dejan Muhamedagic wrote:
> Hi,
> 
> On Tue, Aug 17, 2010 at 04:50:27PM +0900, Simon Horman wrote:
> > On Wed, Jul 21, 2010 at 01:41:09AM -0600, Tim Serong wrote:
> > > Hi All,
> > > 
> > > A while ago (April, from memory), there was an ABI change in
> > > clplumbing in cluster-glue.  Presumably this went mostly unnoticed
> > > in general usage, however I have twice seen systems where the cluster
> > > could not run because of a missing (or incorrect) libglue2 package.
> > > One was my development system, with a dodgy build, the other was
> > > mentioned on #linux-ha yesterday, and was the result of ignoring a
> > > conflict error when installing the pacemaker RPM on openSUSE.  So,
> > > let me be clear, this is not something anyone should need to worry
> > > about...  But I thought I'd mention it here, because the error
> > > messages you get are, IMO, not very obvious.
> > > 
> > > Symptoms of a mismatched pacemaker/libglue build are errors like:
> > > 
> > >   lrmd: [3004]: ERROR:
> > >     main: can not create wait connection for command.
> > >   lrmd: [3004]: ERROR:
> > >     Startup aborted (can't create comm channel).  Shutting down.
> > >   ...
> > >   pengine: [4011]: ERROR:
> > >     init_client_ipc_comms_nodispatch: Could not access channel on:
> > >     /var/run/crm/pengine
> > >   corosync[4000]: [pcmk  ] ERROR:
> > >     pcmk_wait_dispatch: Child process pengine exited (pid=4011, rc=1)
> > >   corosync[4000]: [pcmk  ] notice:
> > >     pcmk_wait_dispatch: Respawning failed child process: pengine
> > > 
> > > If your cluster won't start and you see this in /var/log/messages,
> > > make sure libglue2 is up to date.  And now that I've mentioned this
> > > here and it's made it to the mailing list archive, Google will know,
> > > and nobody else will ever have this problem again.
> > > 
> > > This has been a public service announcement.  Thank you for reading.
> > 
> > Could we get the .so bumped accordingly in the next release of
> > cluster glue? That would at least help in managing the problem
> > once the new release has been made.
> 
> I don't think that that is necessary. The ABI change in the
> _released_ cluster-glue packages was done in such a way as not to
> disturb the existing pacemaker installations, i.e. by adding
> fields to the end of the struct. Further, the library version has
> been bumped to 3:0:1 (with libtool's -version-info) at the time.
> For whatever reason that translates to so.2.1.0. Users of the new
> ABI are also using domain sockets of the new type if they want
> the new functionality.
> 
> I guess that what Tim was seeing was Pacemaker built against the
> unreleased glue versions which did have different ABI, i.e. the
> fields were inserted somewhere in the middle of the struct.

Ok, so no ABI incompatibility was introduced in 1.0.6. Great!
I will go ahead and close the related Debian bugs,
#593319, #593321, #593322 and #593323.

_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to