Hi Dimitri, Could you show me a link to this kill -9 modification?
Karl Kloppenborg Head of Development Phone: 1300 884 839 (AU Only - Business Hours) Website: AU http://www.crucial.com.au| US http://www.crucialp.com On 05/10/2010, at 12:43, Dimitri Maziuk wrote: > Karl Kloppenborg wrote: >> Hi Linux HA users, >> >> We have setup a two node DRBD cluster running DRBD / Heartbeat and NFS. >> >> Every now and again the Linux ha will suddenly NFS crash and people > will not be able to read / write to the shared folders, when it goes > into this mode, running rpcinfo -p on the float IP shows NFS daemons not > RPC accessible like shown... >> >> I try restarting the daemons and all processes I can think of but unless I >> reboot it doesn't come back up... >> >> My system setup is as follows: >> >> -------------HARESOURCES----------------- >> storage1.clusterfarm.net.au IPaddr::[FLOATIP]/24/eth1 > drbddisk::repdata Filesystem::/dev/drbd0::/storage::ext3 portmap nfslock > nfs rpcidmapd >> ----------------------------------------------------- > > Mine is > <hostname> drbddisk::raid Filesystem::/dev/drbd0::/raid::ext3 \ > <ip addr> rpcidmapd nfslock nfsd > > (where nfsd is the init.d script modified to run kill -9 as per TFM) > >> ----------DRBD CONF--------------- >> handlers { pri-on-incon-degr "halt -f"; } >> disk { on-io-error detach; } > > I don't have those, only > net { > after-sb-0pri discard-zero-changes; > } > (in the common section) > > HTH > Dima > -- > Dimitri Maziuk > Programmer/sysadmin > BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu > _______________________________________________ > Linux-HA mailing list > [email protected] > http://lists.linux-ha.org/mailman/listinfo/linux-ha > See also: http://linux-ha.org/ReportingProblems _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
