David Mabry created CLOUDSTACK-4523:
---------------------------------------

             Summary: Secondary Storage was offline, but Cloudstack was still 
attempting to use it to create a template
                 Key: CLOUDSTACK-4523
                 URL: https://issues.apache.org/jira/browse/CLOUDSTACK-4523
             Project: CloudStack
          Issue Type: Bug
      Security Level: Public (Anyone can view this level - this is the default.)
          Components: KVM, Template
    Affects Versions: 4.1.1
         Environment: CentOS 6.3 KVM Host with NFS Secondary Storage.
            Reporter: David Mabry
            Priority: Minor


We had an issue with our Secondary Storage that caused it to become 
unavailable.  The IP wasn't available and NFS was definitely not running.  When 
looking in the Cloudstack interface the Secondary Storage still showed as 
enabled and showed green.  During the outage, a user attempted to create a 
template from a VM and Cloudstack said that is was created successfully and 
sure enough the template showed up to be used in the web interface.  Then the 
user tried to provision a new VM from the template and Cloudstack returned an 
error on the VM creation that is detailed in the logs below.

Once we cleared the issue, the template was still no where to be found and the 
user had to recreate the template from the source VM again.

Below are what I think are relevant logs to this issue.  Please let me know if 
you need more information.



====Template Creation====
2013-08-27 11:06:43,775 DEBUG [agent.transport.Request] 
(Job-Executor-148:job-1897) Seq 1-1768326350: Sending  { Cmd , MgmtId: 
159090354809909, via: 1, Ver: v1, Flags: 100011, 
[{"CreatePrivateTemplateFromVolumeCommand":{"_vmName":"i-48-359-VM","_volumePath":"a8ce8492-34d7-4ec2-86c2-fed1d8077557","_userSpecifiedName":"Wk8r2
 Gold - IIS & 
.net4.5","_uniqueName":"41d2b232-6a60-43b4-b2d7-58e82c095f91","_templateId":317,"_accountId":48,"_primaryPool":{"id":201,"uuid":"d37a92a6-8e01-41d2-ae43-0fad35f092ad","host":"localhost","path":"/store01","port":0,"type":"CLVM"},"_secondaryStorageUrl":"nfs://172.27.15.30/nfs/cssec","primaryStoragePoolNameLabel":"d37a92a6-8e01-41d2-ae43-0fad35f092ad","wait":10800}}]
 }
2013-08-27 11:15:46,557 DEBUG [agent.transport.Request] 
(AgentManager-Handler-8:null) Seq 1-1768326350: Processing:  { Ans: , MgmtId: 
159090354809909, via: 1, Ver: v1, Flags: 10, 
[{"storage.CreatePrivateTemplateAnswer":{"_path":"/template/tmpl/48/317/41d2b232-6a60-43b4-b2d7-58e82c095f91.qcow2","_virtualSize":21474836480,"_physicalSize":12816744448,"_uniqueName":"41d2b232-6a60-43b4-b2d7-58e82c095f91","_format":"QCOW2","result":true,"wait":0}}]
 }
2013-08-27 11:15:46,566 DEBUG [agent.transport.Request] 
(Job-Executor-148:job-1897) Seq 28-183400063: Sending  { Cmd , MgmtId: 
159090354809909, via: 28, Ver: v1, Flags: 100111, 
[{"ComputeChecksumCommand":{"templatePath":"/template/tmpl/48/317/41d2b232-6a60-43b4-b2d7-58e82c095f91.qcow2","secUrl":"nfs://192.168.15.30/nfs/cssec","wait":0}}]
 }
2013-08-27 11:36:37,287 DEBUG [cloud.user.AccountManagerImpl] 
(catalina-exec-5:null) Access to 
Tmpl[317-QCOW2-41d2b232-6a60-43b4-b2d7-58e82c095f91 granted to Acct[31-jsmith] 
by DomainChecker_EnhancerByCloudStack_6e37dedb
2013-08-27 11:36:37,468 DEBUG [cloud.user.AccountManagerImpl] 
(Job-Executor-149:job-1898) Access to 
Tmpl[317-QCOW2-41d2b232-6a60-43b4-b2d7-58e82c095f91 granted to Acct[31-jsmith] 
by DomainChecker_EnhancerByCloudStack_6e37dedb
2013-08-27 11:36:50,207 DEBUG [agent.transport.Request] 
(Job-Executor-149:job-1898) Seq 3-416367865: Sending  { Cmd , MgmtId: 
159090354809909, via: 3, Ver: v1, Flags: 100111, 
[{"storage.CreateCommand":{"volId":416,"pool":{"id":201,"uuid":"d37a92a6-8e01-41d2-ae43-0fad35f092ad","host":"localhost","path":"/NAzone01cluster01store01","port":0,"type":"CLVM"},"diskCharacteristics":{"size":21474836480,"tags":["prod"],"type":"ROOT","name":"ROOT-361","useLocalStorage":false,"recreatable":true,"diskOfferingId":16,"volumeId":416,"hyperType":"KVM"},"templateUrl":"nfs://192.168.15.30/nfs/cssec//template/tmpl/48/317/41d2b232-6a60-43b4-b2d7-58e82c095f91.qcow2","wait":0}}]
 }

======Failed VM Deployment======
2013-08-27 11:39:12,475 DEBUG [agent.transport.Request] 
(Job-Executor-149:job-1898) Seq 3-416367910: Received:  { Ans: , MgmtId: 
159090354809909, via: 3, Ver: v1, Flags: 110, { StopAnswer } }
2013-08-27 11:39:12,482 DEBUG [cloud.network.NetworkManagerImpl] 
(Job-Executor-149:job-1898) Changing active number of nics for network id=206 
on -1
2013-08-27 11:39:12,486 DEBUG [cloud.network.NetworkManagerImpl] 
(Job-Executor-149:job-1898) Changing active number of nics for network id=207 
on -1
2013-08-27 11:39:12,487 DEBUG [cloud.vm.VirtualMachineManagerImpl] 
(Job-Executor-149:job-1898) Successfully released network resources for the vm 
VM[User|BuildTwo]
2013-08-27 11:39:12,487 DEBUG [cloud.vm.VirtualMachineManagerImpl] 
(Job-Executor-149:job-1898) Successfully cleanued up resources for the vm 
VM[User|BuildTwo] in Starting state
2013-08-27 11:39:12,489 DEBUG [cloud.deploy.FirstFitPlanner] 
(Job-Executor-149:job-1898) DeploymentPlanner allocation algorithm: random
2013-08-27 11:39:12,489 DEBUG [cloud.deploy.FirstFitPlanner] 
(Job-Executor-149:job-1898) Trying to allocate a host and storage pools from 
dc:1, pod:1,cluster:1, requested cpu: 4000, requested ram: 2147483648
2013-08-27 11:39:12,489 DEBUG [cloud.deploy.FirstFitPlanner] 
(Job-Executor-149:job-1898) Is ROOT volume READY (pool already allocated)?: No
2013-08-27 11:39:12,489 DEBUG [cloud.deploy.FirstFitPlanner] 
(Job-Executor-149:job-1898) DeploymentPlan has host_id specified, choosing this 
host and making no checks on this host: 3
2013-08-27 11:39:12,490 DEBUG [cloud.deploy.FirstFitPlanner] 
(Job-Executor-149:job-1898) Looking for suitable pools for this host under 
zone: 1, pod: 1, cluster: 1
2013-08-27 11:39:12,492 DEBUG [cloud.deploy.FirstFitPlanner] 
(Job-Executor-149:job-1898) Checking suitable pools for volume (Id, Type): 
(416,ROOT)
2013-08-27 11:39:12,492 DEBUG [cloud.deploy.FirstFitPlanner] 
(Job-Executor-149:job-1898) We need to allocate new storagepool for this volume
2013-08-27 11:39:12,493 DEBUG [cloud.deploy.FirstFitPlanner] 
(Job-Executor-149:job-1898) Calling StoragePoolAllocators to find suitable pools
2013-08-27 11:39:12,493 DEBUG [storage.allocator.FirstFitStoragePoolAllocator] 
(Job-Executor-149:job-1898) Looking for pools in dc: 1  pod:1  cluster:1 having 
tags:[prod]
2013-08-27 11:39:12,494 DEBUG [storage.allocator.FirstFitStoragePoolAllocator] 
(Job-Executor-149:job-1898) FirstFitStoragePoolAllocator has 1 pools to check 
for allocation
2013-08-27 11:39:12,495 DEBUG [storage.allocator.AbstractStoragePoolAllocator] 
(Job-Executor-149:job-1898) Checking if storage pool is suitable, name: 
NAzone01cluster01store01 ,poolId: 201
2013-08-27 11:39:12,495 DEBUG [storage.allocator.AbstractStoragePoolAllocator] 
(Job-Executor-149:job-1898) StoragePool is in avoid set, skipping this pool
2013-08-27 11:39:12,495 DEBUG [storage.allocator.FirstFitStoragePoolAllocator] 
(Job-Executor-149:job-1898) FirstFitStoragePoolAllocator returning 0 suitable 
storage pools
2013-08-27 11:39:12,495 DEBUG [cloud.deploy.FirstFitPlanner] 
(Job-Executor-149:job-1898) No suitable pools found for volume: 
Vol[416|vm=361|ROOT] under cluster: 1
2013-08-27 11:39:12,495 DEBUG [cloud.deploy.FirstFitPlanner] 
(Job-Executor-149:job-1898) No suitable pools found
2013-08-27 11:39:12,495 DEBUG [cloud.deploy.FirstFitPlanner] 
(Job-Executor-149:job-1898) Cannnot deploy to specified host, returning.
2013-08-27 11:39:12,498 DEBUG [cloud.capacity.CapacityManagerImpl] 
(Job-Executor-149:job-1898) VM state transitted from :Starting to Stopped with 
event: OperationFailedvm's original host id: null new host id: null host id 
before state transition: 3
2013-08-27 11:39:12,502 DEBUG [cloud.capacity.CapacityManagerImpl] 
(Job-Executor-149:job-1898) Hosts's actual total CPU: 60000 and CPU after 
applying overprovisioning: 90000
2013-08-27 11:39:12,502 DEBUG [cloud.capacity.CapacityManagerImpl] 
(Job-Executor-149:job-1898) release cpu from host: 3, old used: 60596,reserved: 
0, actual total: 60000, total with overprovisioning: 90000; new used: 
56596,reserved:0; movedfromreserved: false,moveToReserveredfalse
2013-08-27 11:39:12,502 DEBUG [cloud.capacity.CapacityManagerImpl] 
(Job-Executor-149:job-1898) release mem from host: 3, old used: 
39132856320,reserved: 0, total: 101395275776; new used: 36985372672,reserved:0; 
movedfromreserved: false,moveToReserveredfalse
2013-08-27 11:39:12,505 DEBUG [cloud.vm.UserVmManagerImpl] 
(Job-Executor-149:job-1898) Destroying vm VM[User|BuildTwo] as it failed to 
create on Host with Id:null
2013-08-27 11:39:12,510 DEBUG [cloud.capacity.CapacityManagerImpl] 
(Job-Executor-149:job-1898) VM state transitted from :Stopped to Error with 
event: OperationFailedToErrorvm's original host id: null new host id: null host 
id before state transition: null
2013-08-27 11:39:22,592 INFO  [user.vm.DeployVMCmd] (Job-Executor-149:job-1898) 
com.cloud.exception.InsufficientServerCapacityException: Unable to create a 
deployment for VM[User|BuildTwo]Scope=interface com.cloud.dc.DataCenter; id=1
2013-08-27 11:39:22,592 INFO  [user.vm.DeployVMCmd] (Job-Executor-149:job-1898) 
Unable to create a deployment for VM[User|BuildTwo]
2013-08-27 11:39:22,593 DEBUG [cloud.async.AsyncJobManagerImpl] 
(Job-Executor-149:job-1898) Complete async job-1898, jobStatus: 2, resultCode: 
530, result: Error Code: 533 Error text: Unable to create a deployment for 
VM[User|BuildTwo]



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to