Hi Phil Bayfiled,

We are not un-mount the nfs because i believe heartbeat is take care to 
start and stop the nfs service. For testing the failover, we stop the 
heartbeat at primary node, but secondary node is not able to take the 
resource and we are getting the below error :

Filesystem[7192]:       2009/12/04_16:22:20 ERROR: Couldn't unmount 
/dvshare; trying cleanup with SIGKILL
Filesystem[7192]:       2009/12/04_16:22:20 INFO: No processes on 
/dvshare were signalled
Filesystem[7192]:       2009/12/04_16:22:21 ERROR: Couldn't unmount 
/dvshare, giving up!
Filesystem[7181]:       2009/12/04_16:22:21 ERROR:  Generic error
ResourceManager[5157]:  2009/12/04_16:22:21 ERROR: Return code 1 from 
/etc/ha.d/resource.d/Filesystem
Filesystem[7353]:       2009/12/04_16:22:21 INFO:  Running OK
ResourceManager[5157]:  2009/12/04_16:22:21 CRIT: Resource STOP failure. 
Reboot required!
ResourceManager[5157]:  2009/12/04_16:22:21 CRIT: Killing heartbeat 
ungracefully!


Phil Bayfield wrote:
> Are you stopping the NFS server before trying to unmount?
> If the resource is busy heartbeat will not be able to unmount it.
>
> Rajkumar Agrawal wrote:
>   
>> Hi,
>> We installed the NFS-ha for the high availability of NFS server. For the 
>> testing, when we stop the heartbeat service at primary node, primary 
>> node is not releasing the resource for the secondary. We get this from 
>> ha-log. So secondary node is not able to take over the resource. The 
>> /var/log/ha-log of primary node are :
>>
>> Filesystem[7192]:       2009/12/04_16:22:20 ERROR: Couldn't unmount 
>> /dvshare; trying cleanup with SIGKILL
>> Filesystem[7192]:       2009/12/04_16:22:20 INFO: No processes on 
>> /dvshare were signalled
>> Filesystem[7192]:       2009/12/04_16:22:21 ERROR: Couldn't unmount 
>> /dvshare, giving up!
>> Filesystem[7181]:       2009/12/04_16:22:21 ERROR:  Generic error
>> ResourceManager[5157]:  2009/12/04_16:22:21 ERROR: Return code 1 from 
>> /etc/ha.d/resource.d/Filesystem
>> Filesystem[7353]:       2009/12/04_16:22:21 INFO:  Running OK
>> ResourceManager[5157]:  2009/12/04_16:22:21 CRIT: Resource STOP failure. 
>> Reboot required!
>> ResourceManager[5157]:  2009/12/04_16:22:21 CRIT: Killing heartbeat 
>> ungracefully!
>>
>>
>> Plz help us to troubleshoot this.
>>
>> Thanks
>> Rajkumar Agrawal
>>
>> _______________________________________________
>> Linux-HA mailing list
>> [email protected]
>> http://lists.linux-ha.org/mailman/listinfo/linux-ha
>> See also: http://linux-ha.org/ReportingProblems
>>   
>>     
> _______________________________________________
> Linux-HA mailing list
> [email protected]
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
>
>   


-- 
Rajkumar Agrawal. Systems Administrator Deep Value Technology Pvt Ltd
+1 646 651 4686 x122 ? +91 44 42630403 x26 www.deepvalue.net ? 90 Anna Salai 
Chennai 600 002 India

_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to