[one-users] Error "Stale NFS file handle"

Carlos Jiménez Wed, 29 Aug 2012 06:29:31 -0700

Hello everybody,

I have 4 physical computers: 1 as a FrontEnd with CentOS 6.2 andOpenNebula 3.6, two hosts with CentOS 6.2 and qemu-kvm and another as annfs server with a debian based distro.Each time I try to create a new VM, it waits for a long time in PROLOGstatus and then it fails. According to the log, it seems to be a problemconnected to NFS.In the NFS server side there are no relevant information in the logs,just a few entries like this:

Aug 29 14:56:37 localhost kernel: svc: unknown version (4)

Aug 29 14:56:37 localhost rpc.mountd: authenticated mount request fromhost2:751 for /opt/virtual/one (/opt/virtual/one)



This is the output of the VM log:

Wed Aug 29 14:58:28 2012 [DiM][I]: New VM state is ACTIVE.
Wed Aug 29 14:58:29 2012 [LCM][I]: New VM state is PROLOG.
Wed Aug 29 14:58:29 2012 [VM][I]: Virtual Machine has no context

Wed Aug 29 15:01:12 2012 [TM][I]: Command execution fail:/var/lib/one/remotes/tm/shared/clonefrontend:/var/lib/one/datastores/1/5c0455eb494fd43b5c7c576e0c642fbeHost2:/var/lib/one//datastores/0/32/disk.0 32 1Wed Aug 29 15:01:12 2012 [TM][I]: clone: Cloning../../1/5c0455eb494fd43b5c7c576e0c642fbe inHost2:/var/lib/one//datastores/0/32/disk.0Wed Aug 29 15:01:12 2012 [TM][E]: clone: Command "cd/var/lib/one/datastores/0/32; cp -r../../1/5c0455eb494fd43b5c7c576e0c642fbe/var/lib/one/datastores/0/32/disk.0" failed: cp: reading`../../1/5c0455eb494fd43b5c7c576e0c642fbe': Stale NFS file handleWed Aug 29 15:01:12 2012 [TM][E]: Error copyingfrontend:/var/lib/one/datastores/1/5c0455eb494fd43b5c7c576e0c642fbe toHost2:/var/lib/one//datastores/0/32/disk.0

Wed Aug 29 15:01:12 2012 [TM][I]: ExitCode: 1

Wed Aug 29 15:01:12 2012 [TM][E]: Error executing image transfer script:Error copyingfrontend:/var/lib/one/datastores/1/5c0455eb494fd43b5c7c576e0c642fbe toHost2:/var/lib/one//datastores/0/32/disk.0

Wed Aug 29 15:01:12 2012 [DiM][I]: New VM state is FAILED

I've checked that path and filenames, and both are correct. The fileitself is a 300MG sized and the error appears in this case just a fewminutes later but in other cases aprox. 30 minutes after the start ofthe copy, so it doesn't finish the file copy.At first I managed to solve it unmounting and remounting the shareddirectory and it was possible to fully copy the file (after aprox. 30minutes) and run the VM. But today, the problem persists after doing it(even after rebooting the clients).Additionally, I've tried doing the same operation (to copy the same filefrom a directory to another) locally in the NFS Server and it takes justa minute (aprox.) to accomplish it.

May it be related to OpenNebula? Could you give me advice ontroubleshooting it?



Thanks in advance.

Carlos.

_______________________________________________
Users mailing list
[email protected]
http://lists.opennebula.org/listinfo.cgi/users-opennebula.org

[one-users] Error "Stale NFS file handle"

Reply via email to