[Linux-HA] nfsserver error

David . Livingstone Fri, 04 Jun 2010 13:49:34 -0700

I have a 2-node drbd/heartbeat cluster running on RHEL5.4(proliant 
dl380G6) I am attempting
to add ha nfs(nfsserver)to and have run into an error. Specifically after 
adding the nfsserver resource
and then running "crm resource cleanup nfsserver_nlsmtl" I have the 
resource running however I also have
the below failed action :


[r...@nlsmtl6 ~]# crm status
============
Last updated: Fri Jun  4 14:16:14 2010
Stack: Heartbeat
Current DC: nlsmtl6 (16fd6af0-429e-402d-a5d8-a00a818f139a) - partition 
with quorum
Version: 1.0.8-3225fc0d98c8fcd0f7b24f0134e89967136a9b00
2 Nodes configured, unknown expected votes
3 Resources configured.
============

Online: [ nlsmtl5 nlsmtl6 ]

Resource Group: grp_1
drbddisk_2 (heartbeat:drbddisk):   Started nlsmtl5
Filesystem_3       (ocf::heartbeat:Filesystem):    Started nlsmtl5
rc.primary_5       (lsb:rc.primary):       Started nlsmtl5
IPaddr_nlsmtl      (ocf::heartbeat:IPaddr):        Started nlsmtl5
nfsserver_nlsmtl   (ocf::heartbeat:nfsserver):     Started nlsmtl5
CL_stonithset_node01   (stonith:external/riloe-iders): Started nlsmtl6
CL_stonithset_node02   (stonith:external/riloe-iders): Started nlsmtl5

Failed actions:
nfsserver_nlsmtl_monitor_0 (node=nlsmtl6, call=12, rc=2, status=complete): 
invalid parameter
[r...@nlsmtl6 ~]#

At this point I cannot migrate to nlsmtl6.  I have not been able to find 
much 
documentation on nfsserver. I've found 
http://linux-ha.org/doc/re-ra-nfsserver.html
but nothing else on setup.

My config is :

r...@nlsmtl5 init.d]# crm configure show
node $id="16fd6af0-429e-402d-a5d8-a00a818f139a" nlsmtl6 \
attributes standby="off"
node $id="2f6b429e-74c3-482e-bf20-5a6b0c94cd46" nlsmtl5 \
attributes standby="off"
primitive CL_stonithset_node01 stonith:external/riloe-iders \
op monitor interval="30s" timeout="20s" on-fail="ignore" \
op start interval="0" timeout="60s" on-fail="restart" \
params hostlist="nlsmtl5" ilo_hostname="nlsmtl5-ilo" ilo_user="Heartbeat" 
ilo_password="xxx" ilo_can_reset="0" ilo_protocol="2.0" 
ilo_powerdown_method="button"
primitive CL_stonithset_node02 stonith:external/riloe-iders \
op monitor interval="30s" timeout="20s" on-fail="ignore" \
op start interval="0" timeout="60s" on-fail="restart" \
params hostlist="nlsmtl6" ilo_hostname="nlsmtl6-ilo" ilo_user="Heartbeat" 
ilo_password="xxx" ilo_can_reset="0" ilo_protocol="2.0" 
ilo_powerdown_method="button"
primitive Filesystem_3 ocf:heartbeat:Filesystem \
op monitor interval="120s" timeout="60s" \
params device="/dev/drbd0" directory="/drbd" fstype="ext3" 
options="defaults"
primitive IPaddr_nlsmtl ocf:heartbeat:IPaddr \
op monitor interval="5s" timeout="5s" \
params ip="165.115.204.222"
primitive drbddisk_2 heartbeat:drbddisk \
op monitor interval="120s" timeout="60s" \
params 1="r0"
primitive nfsserver_nlsmtl ocf:heartbeat:nfsserver \
op monitor interval="30s" timeout="60s" \
params nfs_init_script="/etc/init.d/nfs" nfs_notify_cmd="/sbin/rpc.statd" 
nfs_shared_infodir="/drbd/nfs" nfs_ip="165.115.204.222"
primitive rc.primary_5 lsb:rc.primary \
op monitor interval="120s" timeout="60s"
group grp_1 drbddisk_2 Filesystem_3 rc.primary_5 IPaddr_nlsmtl 
nfsserver_nlsmtl
location node-1-dont-run CL_stonithset_node01 -inf: nlsmtl5
location node-2-dont-run CL_stonithset_node02 -inf: nlsmtl6
location rsc_location_group_1 grp_1 100: nlsmtl5
property $id="cib-bootstrap-options" \
dc-version="1.0.8-3225fc0d98c8fcd0f7b24f0134e89967136a9b00" \
cluster-infrastructure="Heartbeat" \
no-quorum-policy="ignore" \
last-lrm-refresh="1275682003" 

My packages are :
drbd-pacemaker-8.3.7-1
heartbeat-3.0.2-2.el5
pacemaker-1.0.8-2.el5
pacemaker-libs-1.0.8-2.el5
cluster-glue-1.0.3-1.el5
cluster-glue-libs-1.0.3-1.el5
corosynclib-1.2.0-1.el5
corosync-1.2.0-1.el5

I looked at  /usr/lib/ocf/resource.d/heartbeat/nfsserver and 
nfsserver_monitor which is pretty simple.
nfsserver_monitor ()
{
fn=`/bin/mktemp`
${OCF_RESKEY_nfs_init_script} status > $fn 2>&1
rc=$?
ocf_log debug `cat $fn`
rm -f $fn

#Adapte LSB status code to OCF return code
if [ $rc -eq 0 ]; then
return $OCF_SUCCESS
elif [ $rc -eq 3 ]; then
return $OCF_NOT_RUNNING
else
return $OCF_ERR_GENERIC
fi
}

In my case I presume it would call  "/etc/init.d/nfs status"  which on :
nlsmtl5 returns :
[r...@nlsmtl5 init.d]# /etc/init.d/nfs status
rpc.mountd (pid 24994) is running...
nfsd (pid 24991 24990 24989 24988 24987 24980 24979 24969) is running...
rpc.rquotad (pid 24963) is running...
[r...@nlsmtl5 init.d]#
- return code is 0

nlsmtl6 returns :
[r...@nlsmtl6 ~]# /etc/init.d/nfs status
rpc.mountd is stopped
nfsd is stopped
rpc.rquotad is stopped
[r...@nlsmtl6 ~]#
- return code is 3

Why am I getting a rc=2 and how can I debug ? Am I missing something on 
setup ? Is 
this the best way to run nfs ?

Thanks


_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

[Linux-HA] nfsserver error

Reply via email to