Unsuccessfully tried to reproduce it today, after shutting down the whole cluster and changing "lsb" back to "ocf" in /var/lib/heartbeat/crm/cib.xml.
But this one is perfectly reproducible: When ldirectord is started as a OCF-resource, the packaged init-script (/etc/init.d/ldirectord) is completely useless. It displays the wrong status and using it with "start" ends up in a second hanging ldirectord. ######################################################################### [r...@server:~]$ rpm -ql heartbeat-ldirectord-2.1.4-2.1 /etc/ha.d/resource.d/ldirectord /etc/init.d/ldirectord /etc/logrotate.d/ldirectord /sbin/rcldirectord /usr/sbin/ldirectord /usr/share/doc/heartbeat-ldirectord-2.1.4 /usr/share/doc/heartbeat-ldirectord-2.1.4/COPYING /usr/share/doc/heartbeat-ldirectord-2.1.4/README /usr/share/doc/heartbeat-ldirectord-2.1.4/ldirectord.cf /usr/share/man/man8/ldirectord.8.gz [r...@server:~]$ /etc/init.d/heartbeat start [r...@server:~]$ /etc/init.d/ldirectord status ldirectord is stopped for /etc/ha.d/ldirectord.cf But process is still running: [r...@server:~]$ ps -eaf | grep ldi | grep -v grep root 747 1 0 15:14 ? 00:00:00 /usr/bin/perl -w /usr/sbin/ldirectord /etc/ha.d/ldirectord.cf start Calling it with config is ok: [r...@server:~]$ /usr/sbin/ldirectord /etc/ha.d/ldirectord.cf status ldirectord for /etc/ha.d/ldirectord.cf is running with pid: 747 Without isn't: [r...@server:~]$ /usr/sbin/ldirectord status ldirectord is stopped for /etc/ha.d/ldirectord.cf So the init-script fails as well: [r...@server:~]$ /etc/init.d/ldirectord stop Stopping ldirectord... success [r...@server:~]$ ps -eaf | grep ldi | grep -v grep root 747 1 0 15:14 ? 00:00:04 /usr/bin/perl -w /usr/sbin/ldirectord /etc/ha.d/ldirectord.cf start Hanging 2. ldirectord: [r...@lnxp-1170:~]$ /etc/init.d/ldirectord restart Restarting ldirectord... success [r...@lnxp-1170:~]$ ps -eaf | grep ldirector | grep -v grep root 29684 1 0 15:45 ? 00:00:00 /usr/bin/perl -w /usr/sbin/ldirectord /etc/ha.d/ldirectord.cf start root 30731 1 0 15:46 ? 00:00:00 /usr/bin/perl -w /usr/sbin/ldirectord restart [r...@lnxp-1170:~]$ /etc/init.d/ldirectord stop Stopping ldirectord... success [r...@lnxp-1170:~]$ /etc/init.d/ldirectord start Starting ldirectord... success [r...@lnxp-1170:~]$ sudo su - -c "ps -eaf | grep ldirector | grep -v grep root 534 1 0 15:49 ? 00:00:00 /usr/bin/perl -w /usr/sbin/ldirectord start root 29684 1 0 15:45 ? 00:00:00 /usr/bin/perl -w /usr/sbin/ldirectord /etc/ha.d/ldirectord.cf start #################################################################### Regards, Arndt -----Ursprüngliche Nachricht----- Von: [email protected] [mailto:[email protected]] Im Auftrag von Andrew Beekhof Gesendet: Freitag, 12. Dezember 2008 12:04 An: General Linux-HA mailing list Betreff: Re: [Linux-HA] clone resource displayed lsb instead of ocf? Can you create a bug for this please? Be sure to include a hb_report archive covering the period when you used the lsb script to restart the resource. On Thu, Dec 11, 2008 at 18:38, Arndt Roth <[email protected]> wrote: > Hi *, > > > > I found a puzzling behaviour using ldirectord as a clone resource in a 3 > node setup. > > On one of the nodes "server1" (not the master) I restarted ldirectord > by hand using the lsb-script, though I know that all resources are > managed by OCF-classes. > _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
