jgotteswinter commented on issue #13010:
URL: https://github.com/apache/cloudstack/issues/13010#issuecomment-4251213000

   > Hi [@jgotteswinter](https://github.com/jgotteswinter) thanks for reporting 
the issue. I do observe the following exception:
   > 
   > ```
   > Unable to migrate VM instance 
{"id":4930,"instanceName":"i-55-4930-VM","state":"Running","type":"User","uuid":"cf19-00b6-465e-98f1-c63b4860498d"}
 to Host 
{"id":18,"name":"XXXch02","type":"Routing","uuid":"dc51-a18d-4f7d-9a2e-7dfbb7a1b908"}
 due to [Resource [Host:18] is unreachable: Host 18: Operation timed out] 
   > ```
   > 
   > Besides host 18, are there other free hosts in the cluster? Can you share 
the agent logs from host 18 for any failures? Are you able to start back the 
stopped VM or migrate it manually and then start it on a different host?
   
   yes, there where other healthy hosts available. It seems like it does not 
jump to the next available host after a live migration error. I showed the 
error to the AI, which ended in this patch:
   
   `diff --git 
a/engine/orchestration/src/main/java/com/cloud/vm/VirtualMachineManagerImpl.java
 
b/engine/orchestration/src/main/java/com/cloud/vm/VirtualMachineManagerImpl.java
   index b3e672e2..67b8d7b2 100755
   --- 
a/engine/orchestration/src/main/java/com/cloud/vm/VirtualMachineManagerImpl.java
   +++ 
b/engine/orchestration/src/main/java/com/cloud/vm/VirtualMachineManagerImpl.java
   @@ -3902,13 +3902,14 @@ public class VirtualMachineManagerImpl extends 
ManagerBase implements VirtualMac
                } catch (ResourceUnavailableException | 
ConcurrentOperationException e) {
   -                logger.warn("Unable to migrate {} to {} due to [{}]", ...);
   +                logger.warn("Unable to migrate {} to {} due to [{}], trying 
next destination", ...);
   +                continue;
                }
    
                try {
                    advanceStop(vmUuid, true);
                    throw new CloudRuntimeException("Unable to migrate " + vm);
   -            } catch (final ResourceUnavailableException | 
ConcurrentOperationException | OperationTimedoutException e) {
   +            } catch (final ResourceUnavailableException | 
ConcurrentOperationException | OperationTimedoutException | 
AgentUnavailableException e) {
    `
   
   i was not yet able to test this so far


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to