Hello all,
I need some help with LSB nfssserver on SLES9 64bit Linux.
We run a two Node cluster with heartbeat 2.0.8 and crm=yes...
I configured DRDB/NFS/etc. All think works fine, when I start them
manually without HA.
The Problem:
Heartbeat starts IMHO correctly. I can see the nodes on the crm_mon
with status "online".
For a few seconds all resources was started on my preferred master,
but heartbeat stopped them all;(
On the the secondary the same problem;(
-- cib.xml -- resources --------------
<primitive class="lsb" id="nfsserver_4" type="nfsserver">
<operations>
<op id="nfsserver_4_mon" interval="120s" name="monitor"
timeout="240s"/>
</operations>
</primitive>
- - - - - - - - - - - - - - - -
When I remove this entry from the cib.xml alle resources will start
via HA (drbddisk/FileSystem/IPaddr/Mailto) correctly!
So I look at /var/log/messages and find some strange log entries:
--/var/log/messages -------------------
crmd: info: do_lrm_rsc_op: Performing op=nfsserver_4_start_0
lrmd: WARN: For LSB init script, no additional parameters are needed.
lrmd: info: RA output: (nfsserver_4:start:stdout) Starting kernel \
based NFS server
lrmd: WARN: on_op_timeout_expired: TIMEOUT: operation \
start[13] on lsb::nfsserver::nfsserver_4 for client, its \
parameters: CRM_meta_op_target_rc=[7] \
CRM_meta_timeout=[5000] crm_feature_set=[1.0.7] .
----> ^^^^^^^^^^^^^^^^^
crmd: [12358]: ERROR: process_lrm_event: LRM operation
nfsserver_4_start_0 (13) Timed Out (timeout=5000ms)
----> ^^^^^^^^^^^^^^^^^
crmd: [12358]: info: append_restart_list: Resource nfsserver_4 does
not support reloads
tengine: [12364]: WARN: status_from_rc: Action start on sot0000140
failed (target: (null) vs. rc: -1): Timed Out
----> ^^^^^^^^^^^^^^^^^
pengine: [12365]: notice: StopRsc: sot0000140 Stop datadisk_2
pengine: [12365]: notice: StopRsc: sot0000140 Stop Filesystem_3
pengine: [12365]: notice: StopRsc: sot0000140 Stop nfsserver_4
--------------------
Why failed the resources?
AFAIK should use HA /etc/init.d/nfssserver as init-script. When I call
the nfssserver init-script with arg "status" I got an exit code "0".
Isn't it correct?
Any hints?
Best regards.
Andre
PS: You can see the full /var/log/message && cib.xml here:
http://www.linux-experience.de/cib.xml
http://www.linux-experience.de/messages.1
http://www.linux-experience.de/messages.2
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems