> starting XS agent manually Either unmanage/manage cluster or perform a force reconnect to the host.
Regards, Somesh -----Original Message----- From: tony_caot...@163.com [mailto:tony_caot...@163.com] Sent: Wednesday, August 12, 2015 5:14 AM To: users@cloudstack.apache.org Subject: Re: XenServer is disconnected after CS hosts shutdown After I disabled and enabled XS cluster & primary. it works. seems XS agent have started by ACS host. so what is the correct behavior of starting XS agent manually ? ----------- Cao Tong On 08/12/2015 04:41 PM, tony_caot...@163.com wrote: > > Hello, > > Almost one month gone, my problem is still here. I really really need > someone to help me. > > new Settings ACS 4.4.4 XS 6.2 > After reboot, errors here: > > 2015-08-12 16:18:54,557 INFO [c.c.a.t.Request] > (StatsCollector-3:ctx-b287085a) not building log message for '[{}]', > _cmds.length == 1 > 2015-08-12 16:18:54,557 DEBUG [c.c.a.m.ClusteredAgentAttache] > (StatsCollector-3:ctx-b287085a) Seq 9-4450963806725603425: Forwarding > null to 191386435611186 > 2015-08-12 16:18:54,558 DEBUG [c.c.a.m.ClusteredAgentAttache] > (AgentManager-Handler-12:null) Seq 9-4450963806725603425: Routing from > 249082151178140 > 2015-08-12 16:18:54,558 DEBUG [c.c.a.m.ClusteredAgentAttache] > (AgentManager-Handler-12:null) Seq 9-4450963806725603425: Link is closed > 2015-08-12 16:18:54,558 DEBUG [c.c.a.m.ClusteredAgentManagerImpl] > (AgentManager-Handler-12:null) Seq 9-4450963806725603425: MgmtId > 249082151178140: Req: Resource [Host:9] is unreachable: Host 9: Link > is closed > 2015-08-12 16:18:54,558 DEBUG [c.c.a.m.ClusteredAgentManagerImpl] > (AgentManager-Handler-12:null) Seq 9--1: MgmtId 249082151178140: Req: > Routing to peer > 2015-08-12 16:18:54,559 DEBUG [c.c.a.m.ClusteredAgentManagerImpl] > (AgentManager-Handler-15:null) Seq 9--1: MgmtId 249082151178140: Req: > Cancel request received > 2015-08-12 16:18:54,559 DEBUG [c.c.a.m.AgentAttache] > (AgentManager-Handler-15:null) Seq 9-4450963806725603425: Cancelling. > 2015-08-12 16:18:54,559 DEBUG [c.c.a.m.AgentAttache] > (StatsCollector-3:ctx-b287085a) Seq 9-4450963806725603425: Waiting > some more time because this is the current command > 2015-08-12 16:18:54,559 DEBUG [c.c.a.m.AgentAttache] > (StatsCollector-3:ctx-b287085a) Seq 9-4450963806725603425: Waiting > some more time because this is the current command > 2015-08-12 16:18:54,559 INFO [c.c.u.e.CSExceptionErrorCode] > (StatsCollector-3:ctx-b287085a) Could not find exception: > com.cloud.exception.OperationTimedoutException in error code list for > exceptions > 2015-08-12 16:18:54,559 INFO [c.c.a.t.Request] > (StatsCollector-3:ctx-b287085a) not building log message for '[{}]', > _cmds.length == 1 > 2015-08-12 16:18:54,559 WARN [c.c.a.m.AgentAttache] > (StatsCollector-3:ctx-b287085a) Seq 9-4450963806725603425: Timed out > on null > 2015-08-12 16:18:54,559 DEBUG [c.c.a.m.AgentAttache] > (StatsCollector-3:ctx-b287085a) Seq 9-4450963806725603425: Cancelling. > 2015-08-12 16:18:54,559 DEBUG [c.c.s.StorageManagerImpl] > (StatsCollector-3:ctx-b287085a) Unable to send storage pool command to > Pool[3|NetworkFilesystem] via 9 > com.cloud.exception.OperationTimedoutException: Commands > 4450963806725603425 to Host 9 timed out after 3600 > at > com.cloud.agent.manager.AgentAttache.send(AgentAttache.java:434) > at > com.cloud.agent.manager.AgentManagerImpl.send(AgentManagerImpl.java:418) > at > com.cloud.agent.manager.AgentManagerImpl.send(AgentManagerImpl.java:362) > at > com.cloud.storage.StorageManagerImpl.sendToPool(StorageManagerImpl.java:965) > at > com.cloud.storage.StorageManagerImpl.sendToPool(StorageManagerImpl.java:390) > at > com.cloud.storage.StorageManagerImpl.sendToPool(StorageManagerImpl.java:404) > at > com.cloud.server.StatsCollector$StorageCollector.runInContext(StatsCollector.java:642) > at > org.apache.cloudstack.managed.context.ManagedContextRunnable$1.run(ManagedContextRunnable.java:49) > at > org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:56) > at > org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:103) > at > org.apache.cloudstack.managed.context.impl.DefaultManagedContext.runWithContext(DefaultManagedContext.java:53) > at > org.apache.cloudstack.managed.context.ManagedContextRunnable.run(ManagedContextRunnable.java:46) > at > java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) > > > > > ----------- > Cao Tong > > On 07/30/2015 10:55 AM, tony_caot...@163.com wrote: >> >> Hi Yiping, Thanks for your reply first. >> >> >> My NFS server deployed in ACS manager host, another host is a single >> xenserver. >> A KVM environment deployed in ACS manager host too. >> there is three storages named, Kprimary, Xprimary, Secondary. >> >> First, I add KVM cluster with zone-wide Kprimary, it works fine even >> if after reboot. >> Second, I add Xenserver with Xprimary(cluster-wide), it still works >> fine. >> Then, I set zone to disable, shutdown all system VMs, disable >> those two hosts. finally shutdown xenserver host. >> When xenserver's shutdown finish, I stop those services by order ( >> cloudstack-agent, cloudstack-management, libvirtd, nfs, rpcbind, >> mysqld). >> At last shutdown ACS host. >> >> The start process is totally reverse of this order. >> >> when done of starting, ACS says unable to send cmd to pool via host >> xenserver. >> >> I am sure nothing have change in my environment during reboot time. >> When I fix this problem, the only change is change Kprimary from >> zone-wide to cluster-wide. >> >> I guess that after reboot some status have been initial from >> beginning, ACS found that Xenserver host have two primary can be used. >> so it use the high priority one, and it is Kprimary. >> >> Whatever, Maybe it could help you peoples to get ACS better. >> >> BTW, Error logs attached some days before is already point out this >> is a storage problem >> like: >> >> Unable to send storage pool command >> to Pool[4|NetworkFilesystem] via 4 >> >> >> >> ----------- >> Cao Tong >> >> On 07/30/2015 12:44 AM, Yiping Zhang wrote: >>> Well, sometimes people can’t answer a question because of lack of >>> relevant information, or simply because no one has encountered a >>> similar >>> situation before. >>> >>> Looking at your past messages on this thread, there were no mentions >>> about >>> primary storage. Obviously, your primary storage configuration had >>> changed >>> between the time you shut down CS manager and xenservers and the >>> time you >>> restarted them. That is the vital info the list didn’t know. >>> >>> To best of my knowledge, zone wide primary storage has never been >>> supported for Xen hypervisors. >>> >>> I do have to say that quite often CloudStack error messages are very >>> cryptic, do not provide enough *useful* information to help users >>> identify >>> and trouble shoot actual problems. Those stack trace output might be a >>> gold mine to developers, but they are utterly useless for end users. >>> >>> Just my $0.02 >>> >>> Yiping >>> >>> On 7/28/15, 11:19 PM, "tony_caot...@163.com" <tony_caot...@163.com> >>> wrote: >>> >>>> Hi, Finally I resolved this problem by my self. >>>> >>>> * Primary Storage: A storage resource typically provided to a single >>>> cluster for the actual running of instance disk images. (Zone-wide >>>> primary storage is an option, though not typically used.) >>>> >>>> This line above is from >>>> http://docs.cloudstack.apache.org/en/master/concepts.html >>>> >>>> Because I have a Zone-wide primary storage, ACS can not find the >>>> correct >>>> primary which belong to XenServer cluster after reboot. >>>> >>>> Then I change the Zone-wide primary to cluster-wide, it resolved. >>>> >>>> Right now, I have two primary storage, one is kvm cluster-wide, >>>> another >>>> is xenserver cluster-wide. >>>> >>>> Above is for people who have the same problem oneday. >>>> >>>> by the way, I am very curious why I never receive replys from this >>>> a big >>>> community ?? of course except the very beginning. >>>> >>>> Is my English skill really really poor, result in no body can >>>> understood >>>> what language I am speaking ? >>>> >>>> ----------- >>>> Cao Tong >>>> >>>> On 07/22/2015 09:03 PM, tony_caot...@163.com wrote: >>>>> Hey! help please... >>>>> >>>>> some news. >>>>> I think the cause is that the ACS host can't communicate with >>>>> XenServer host. >>>>> ACS continues outputing logs like this >>>>> >>>>> 2015-07-22 20:42:13,555 DEBUG [c.c.a.m.ClusteredAgentAttache] >>>>> (AgentManager-Handler-7:null) Seq 5-8174877748607582212: Forwarding >>>>> Seq 5-8174877748607582212: { Cmd , MgmtId: 279278805451459, via: 5, >>>>> Ver: v1, Flags: 100111, >>>>> [{"com.cloud.agent.api.MaintainCommand":{"wait":0}}] } to >>>>> 280345368052992 >>>>> >>>>> I am not sure that if the ACS status is wrong or some services on >>>>> xenserver are not opend. >>>>> >>>>> on xenserver , I found *xenheartbeat.sh is not running.* >>>>> *(/bin/bash /opt/cloud/bin/xenheartbeat.sh >>>>> 00d8e0d0-8561-4b3d-9044-cbc496ff22cc 120 60)* >>>>> >>>>> As some operations about xenserver was pending, xenserver can not be >>>>> deleted from web UI. >>>>> >>>>> I got a temporary solution >>>>> >>>>> 1. delete jobs from DB cloud.vm_work_job. >>>>> 2. delete xenserver from DB cloud.host. >>>>> 3. add xenserver host back from web UI. >>>>> >>>>> then it works. >>>>> >>>>> Does anyone have a idea for this? >>>>> >>>>> Could anyone tell what things does ACS do on xenserver host when >>>>> adding a xenserver ? >>>>> >>>>> Thanks, >>>>> >>>>> ----------- >>>>> Cao Tong >>>>> >>>>> On 07/22/2015 04:26 PM, tony_caot...@163.com wrote: >>>>>> @prashant, following it the answer of you questions >>>>>> >>>>>> 1. Yes, primary storage is connected fine for my xenserver. >>>>>> >>>>>> 2. No, Xenserver's password is not changed. >>>>>> >>>>>> 3. yes, web UI is fine, and I can login. >>>>>> >>>>>> 4. before reboot, I unmanaged and disabled resources, and after >>>>>> reboot I have enabled all of them. >>>>>> >>>>>> 5. hosts is states is UP. >>>>>> >>>>>> 6. No yum update in anywhere. >>>>>> >>>>>> 7. system VMs status is fine, i think. >>>>>> >>>>>> ----------- >>>>>> Cao Tong >>>>>> >>>>>> On 07/22/2015 04:13 PM, tony_caot...@163.com wrote: >>>>>>> Hi, >>>>>>> >>>>>>> After reinstall, I got the problem again >>>>>>> >>>>>>> So, I will describe once again. >>>>>>> >>>>>>> WHAT my environment looks like: >>>>>>> >>>>>>> I have a ACS server host and a xenserver host, After both reboot, I >>>>>>> can not create a VM on xenserver through ACS. >>>>>>> A KVM and A NFS are running together in ACS manager host. >>>>>>> >>>>>>> the status of new VM is always 'staring' on the WEB, but I can >>>>>>> create new VM using xencenter. >>>>>>> >>>>>>> ------------- ERR LOGS ---------- >>>>>>> 2015-07-22 15:56:56,357 DEBUG [c.c.s.StorageManagerImpl] >>>>>>> (StatsCollector-3:ctx-1aa2e8c9) Unable to send storage pool command >>>>>>> to Pool[4|NetworkFilesystem] via 4 >>>>>>> com.cloud.exception.OperationTimedoutException: Commands >>>>>>> 2829104990918803478 to Host 4 timed out after 3600 >>>>>>> >>>>>>> 2015-07-22 15:56:56,358 INFO [c.c.s.StatsCollector] >>>>>>> (StatsCollector-3:ctx-1aa2e8c9) Unable to reach >>>>>>> Pool[4|NetworkFilesystem] >>>>>>> com.cloud.exception.StorageUnavailableException: Resource >>>>>>> [StoragePool:4] is unreachable: Unable to send command to the pool >>>>>>> >>>>>>> >>>>>>> ------------- and there are lots of DEBUG infos ------- repeat >>>>>>> again and again ----------- >>>>>>> >>>>>>> 2015-07-22 15:36:12,887 DEBUG [c.c.a.m.ClusteredAgentAttache] >>>>>>> (AgentManager-Handler-14:null) Seq 4-8064821032713715922: >>>>>>> Forwarding >>>>>>> Seq 4-8064821032713715922: { Cmd , MgmtId: 227448510156211, >>>>>>> via: 4, >>>>>>> Ver: v1, Flags: 100111, >>>>>>> [{"com.cloud.agent.api.MaintainCommand":{"wait":0}}] } to >>>>>>> 116784073679673 >>>>>>> 2015-07-22 15:36:12,889 DEBUG [c.c.a.m.ClusteredAgentAttache] >>>>>>> (AgentManager-Handler-10:null) Seq 4-8064821032713715883: >>>>>>> Forwarding >>>>>>> Seq 4-8064821032713715883: { Cmd , MgmtId: 227448510156211, >>>>>>> via: 4, >>>>>>> Ver: v1, Flags: 100111, >>>>>>> >>>>>>> [{"org.apache.cloudstack.storage.command.CopyCommand":{"srcTO":{"org.ap >>>>>>> >>>>>>> ache.cloudstack.storage.to.TemplateObjectTO":{"path":"template/tmpl/1/5 >>>>>>> >>>>>>> /af949612-838f-3a6d-931b-312e612db740.vhd","origUrl":"http://download.c >>>>>>> >>>>>>> loud.com/templates/builtin/centos56-x86_64.vhd.bz2","uuid":"80b60e46-30 >>>>>>> >>>>>>> 17-11e5-8736-00259091a13a","id":5,"format":"VHD","accountId":1,"checksu >>>>>>> >>>>>>> m":"905cec879afd9c9d22ecc8036131a180","hvm":false,"displayText":"CentOS >>>>>>> >>>>>>> >>>>>>> 5.6(64-bit) no GUI >>>>>>> >>>>>>> (XenServer)","imageDataStore":{"com.cloud.agent.api.to.NfsTO":{"_url":" >>>>>>> >>>>>>> nfs://10.0.0.100/storage/secondary","_role":"Image"}},"name":"centos56- >>>>>>> >>>>>>> x86_64-xen","hypervisorType":"XenServer"}},"destTO":{"org.apache.clouds >>>>>>> >>>>>>> tack.storage.to.TemplateObjectTO":{"origUrl":"http://download.cloud.com >>>>>>> >>>>>>> /templates/builtin/centos56-x86_64.vhd.bz2","uuid":"80b60e46-3017-11e5- >>>>>>> >>>>>>> 8736-00259091a13a","id":5,"format":"VHD","accountId":1,"checksum":"905c >>>>>>> >>>>>>> ec879afd9c9d22ecc8036131a180","hvm":false,"displayText":"CentOS >>>>>>> 5.6(64-bit) no GUI >>>>>>> >>>>>>> (XenServer)","imageDataStore":{"org.apache.cloudstack.storage.to.Primar >>>>>>> >>>>>>> yDataStoreTO":{"uuid":"2df26406-31bf-3a95-8a61-f5008defd9a0","id":4,"po >>>>>>> >>>>>>> olType":"NetworkFilesystem","host":"10.0.0.100","path":"/storage/xen/pr >>>>>>> >>>>>>> imary","port":2049,"url":"NetworkFilesystem://10.0.0.100/storage/xen/pr >>>>>>> >>>>>>> imary/?ROLE=Primary&STOREUUID=2df26406-31bf-3a95-8a61-f5008defd9a0"}}," >>>>>>> >>>>>>> name":"centos56-x86_64-xen","hypervisorType":"XenServer"}},"executeInSe >>>>>>> >>>>>>> quence":true,"options":{},"wait":10800}}] >>>>>>> } to 116784073679673 >>>>>>> >>>>>>> >>>>>>> ----------------------------------------- >>>>>>> >>>>>>> Anyone have Any ideas? thanks. >>>>>>> >>>>>>> ----------- >>>>>>> Cao Tong >>>>>>> >>>>>>> On 07/21/2015 06:14 PM, tony_caot...@163.com wrote: >>>>>>>> Thanks all, >>>>>>>> >>>>>>>> I have already reinstall my hosts for preparing a new clear >>>>>>>> environment to restart my research. >>>>>>>> >>>>>>>> ----------- >>>>>>>> Cao Tong >>>>>>>> >>>>>>>> On 07/20/2015 09:24 PM, Prashant s wrote: >>>>>>>>> some questions : >>>>>>>>> >>>>>>>>> can you please tell ... >>>>>>>>> >>>>>>>>> 1. is your NFS storage or your primary Storage Repository in >>>>>>>>> connected >>>>>>>>> mode with no red cross mark on them in xencenter. >>>>>>>>> 2. did you change any passwords on the xenservers ? >>>>>>>>> 3. is the cloudstack web ui up , can you login to the cloudstack >>>>>>>>> web page. >>>>>>>>> 4. *are the zone , pod, or clusters in unmanaged or disabled >>>>>>>>> state >>>>>>>>> ? * >>>>>>>>> *5. is all the hosts in connected state ? * >>>>>>>>> *6. did you run yum update on host reboot on the cs manager >>>>>>>>> vm ? * >>>>>>>>> *7. system vms are stateless you can kill them and cs will >>>>>>>>> recreate a new >>>>>>>>> one .. so dont worry :-) * >>>>>>>>> >>>>>>>>> >>>>>>>>> *thanks * >>>>>>>>> *prashant * >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>> On Mon, Jul 20, 2015 at 3:47 AM, <tony_caot...@163.com> wrote: >>>>>>>>> >>>>>>>>>> Hi, I restartd All hosts (one mgr and xenserver) again. >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> Following is the error log. >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> 2015-07-20 15:33:49,688 INFO [c.c.u.e.CSExceptionErrorCode] >>>>>>>>>> (StatsCollector-3:ctx-692a5392) Could not find exception: >>>>>>>>>> com.cloud.exception.OperationTimedoutException in error code >>>>>>>>>> list >>>>>>>>>> for >>>>>>>>>> exceptions >>>>>>>>>> 2015-07-20 15:33:49,688 WARN [c.c.a.m.AgentAttache] >>>>>>>>>> (StatsCollector-3:ctx-692a5392) Seq 1-3176445112179752972: Timed >>>>>>>>>> out on null >>>>>>>>>> 2015-07-20 15:33:49,689 DEBUG [c.c.a.m.AgentAttache] >>>>>>>>>> (StatsCollector-3:ctx-692a5392) Seq 1-3176445112179752972: >>>>>>>>>> Cancelling. >>>>>>>>>> 2015-07-20 15:33:49,689 DEBUG [c.c.s.StorageManagerImpl] >>>>>>>>>> (StatsCollector-3:ctx-692a5392) Unable to send storage pool >>>>>>>>>> command to >>>>>>>>>> Pool[1|NetworkFilesystem] via 1 >>>>>>>>>> com.cloud.exception.OperationTimedoutException: Commands >>>>>>>>>> 3176445112179752972 to Host 1 timed out after 3600 >>>>>>>>>> at >>>>>>>>>> com.cloud.agent.manager.AgentAttache.send(AgentAttache.java:436) >>>>>>>>>> at >>>>>>>>>> >>>>>>>>>> com.cloud.agent.manager.AgentManagerImpl.send(AgentManagerImpl.java: >>>>>>>>>> >>>>>>>>>> 433) >>>>>>>>>> >>>>>>>>>> at >>>>>>>>>> >>>>>>>>>> com.cloud.agent.manager.AgentManagerImpl.send(AgentManagerImpl.java: >>>>>>>>>> >>>>>>>>>> 362) >>>>>>>>>> >>>>>>>>>> at >>>>>>>>>> >>>>>>>>>> com.cloud.storage.StorageManagerImpl.sendToPool(StorageManagerImpl.j >>>>>>>>>> >>>>>>>>>> ava:1000) >>>>>>>>>> >>>>>>>>>> at >>>>>>>>>> >>>>>>>>>> com.cloud.storage.StorageManagerImpl.sendToPool(StorageManagerImpl.j >>>>>>>>>> >>>>>>>>>> ava:392) >>>>>>>>>> >>>>>>>>>> at >>>>>>>>>> >>>>>>>>>> com.cloud.storage.StorageManagerImpl.sendToPool(StorageManagerImpl.j >>>>>>>>>> >>>>>>>>>> ava:406) >>>>>>>>>> >>>>>>>>>> at >>>>>>>>>> >>>>>>>>>> com.cloud.server.StatsCollector$StorageCollector.runInContext(StatsC >>>>>>>>>> >>>>>>>>>> ollector.java:642) >>>>>>>>>> >>>>>>>>>> at >>>>>>>>>> >>>>>>>>>> org.apache.cloudstack.managed.context.ManagedContextRunnable$1.run(M >>>>>>>>>> >>>>>>>>>> anagedContextRunnable.java:49) >>>>>>>>>> >>>>>>>>>> at >>>>>>>>>> >>>>>>>>>> org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.c >>>>>>>>>> >>>>>>>>>> all(DefaultManagedContext.java:56) >>>>>>>>>> >>>>>>>>>> at >>>>>>>>>> >>>>>>>>>> org.apache.cloudstack.managed.context.impl.DefaultManagedContext.cal >>>>>>>>>> >>>>>>>>>> lWithContext(DefaultManagedContext.java:103) >>>>>>>>>> >>>>>>>>>> at >>>>>>>>>> >>>>>>>>>> org.apache.cloudstack.managed.context.impl.DefaultManagedContext.run >>>>>>>>>> >>>>>>>>>> WithContext(DefaultManagedContext.java:53) >>>>>>>>>> >>>>>>>>>> at >>>>>>>>>> >>>>>>>>>> org.apache.cloudstack.managed.context.ManagedContextRunnable.run(Man >>>>>>>>>> >>>>>>>>>> agedContextRunnable.java:46) >>>>>>>>>> >>>>>>>>>> at >>>>>>>>>> >>>>>>>>>> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:4 >>>>>>>>>> >>>>>>>>>> 71) >>>>>>>>>> >>>>>>>>>> at >>>>>>>>>> java.util.concurrent.FutureTask.runAndReset(FutureTask.java:304) >>>>>>>>>> at >>>>>>>>>> >>>>>>>>>> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask >>>>>>>>>> >>>>>>>>>> .access$301(ScheduledThreadPoolExecutor.java:178) >>>>>>>>>> >>>>>>>>>> at >>>>>>>>>> >>>>>>>>>> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask >>>>>>>>>> >>>>>>>>>> .run(ScheduledThreadPoolExecutor.java:293) >>>>>>>>>> >>>>>>>>>> at >>>>>>>>>> >>>>>>>>>> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor >>>>>>>>>> >>>>>>>>>> .java:1145) >>>>>>>>>> >>>>>>>>>> at java.lang.Thread.run(Thread.java:745) >>>>>>>>>> 2015-07-20 15:33:49,689 INFO [c.c.s.StatsCollector] >>>>>>>>>> (StatsCollector-3:ctx-692a5392) Unable to reach >>>>>>>>>> Pool[1|NetworkFilesystem] >>>>>>>>>> com.cloud.exception.StorageUnavailableException: Resource >>>>>>>>>> [StoragePool:1] >>>>>>>>>> is unreachable: Unable to send command to the pool >>>>>>>>>> at >>>>>>>>>> >>>>>>>>>> com.cloud.storage.StorageManagerImpl.sendToPool(StorageManagerImpl.j >>>>>>>>>> >>>>>>>>>> ava:1010) >>>>>>>>>> >>>>>>>>>> at >>>>>>>>>> >>>>>>>>>> com.cloud.storage.StorageManagerImpl.sendToPool(StorageManagerImpl.j >>>>>>>>>> >>>>>>>>>> ava:392) >>>>>>>>>> >>>>>>>>>> at >>>>>>>>>> >>>>>>>>>> com.cloud.storage.StorageManagerImpl.sendToPool(StorageManagerImpl.j >>>>>>>>>> >>>>>>>>>> ava:406) >>>>>>>>>> >>>>>>>>>> at >>>>>>>>>> >>>>>>>>>> com.cloud.server.StatsCollector$StorageCollector.runInContext(StatsC >>>>>>>>>> >>>>>>>>>> ollector.java:642) >>>>>>>>>> >>>>>>>>>> at >>>>>>>>>> >>>>>>>>>> org.apache.cloudstack.managed.context.ManagedContextRunnable$1.run(M >>>>>>>>>> >>>>>>>>>> anagedContextRunnable.java:49) >>>>>>>>>> >>>>>>>>>> at >>>>>>>>>> >>>>>>>>>> org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.c >>>>>>>>>> >>>>>>>>>> all(DefaultManagedContext.java:56) >>>>>>>>>> >>>>>>>>>> at >>>>>>>>>> >>>>>>>>>> org.apache.cloudstack.managed.context.impl.DefaultManagedContext.cal >>>>>>>>>> >>>>>>>>>> lWithContext(DefaultManagedContext.java:103) >>>>>>>>>> >>>>>>>>>> at >>>>>>>>>> >>>>>>>>>> org.apache.cloudstack.managed.context.impl.DefaultManagedContext.run >>>>>>>>>> >>>>>>>>>> WithContext(DefaultManagedContext.java:53) >>>>>>>>>> >>>>>>>>>> at >>>>>>>>>> >>>>>>>>>> org.apache.cloudstack.managed.context.ManagedContextRunnable.run(Man >>>>>>>>>> >>>>>>>>>> agedContextRunnable.java:46) >>>>>>>>>> >>>>>>>>>> at >>>>>>>>>> >>>>>>>>>> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:4 >>>>>>>>>> >>>>>>>>>> 71) >>>>>>>>>> >>>>>>>>>> at >>>>>>>>>> java.util.concurrent.FutureTask.runAndReset(FutureTask.java:304) >>>>>>>>>> at >>>>>>>>>> >>>>>>>>>> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask >>>>>>>>>> >>>>>>>>>> .access$301(ScheduledThreadPoolExecutor.java:178) >>>>>>>>>> >>>>>>>>>> at >>>>>>>>>> >>>>>>>>>> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask >>>>>>>>>> >>>>>>>>>> .run(ScheduledThreadPoolExecutor.java:293) >>>>>>>>>> >>>>>>>>>> at >>>>>>>>>> >>>>>>>>>> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor >>>>>>>>>> >>>>>>>>>> .java:1145) >>>>>>>>>> >>>>>>>>>> at >>>>>>>>>> >>>>>>>>>> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecuto >>>>>>>>>> >>>>>>>>>> r.java:615) >>>>>>>>>> >>>>>>>>>> at java.lang.Thread.run(Thread.java:745) >>>>>>>>>> >>>>>>>>>> ----------- >>>>>>>>>> Cao Tong >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> On 07/20/2015 02:52 PM, tony_caot...@163.com wrote: >>>>>>>>>> >>>>>>>>>>> No, no one's IP was changed. >>>>>>>>>>> >>>>>>>>>>> 1. In xenserver I can not login systemvms using the internal IP >>>>>>>>>>> like >>>>>>>>>>> '169.254.1.112', There shoud be a bridge network for this >>>>>>>>>>> right? it is >>>>>>>>>>> gone. >>>>>>>>>>> >>>>>>>>>>> 2. I try to delete xenserver host from CS on web, it also >>>>>>>>>>> failed >>>>>>>>>>> with >>>>>>>>>>> lots of logs like following, then memory is full and mangement >>>>>>>>>>> down... >>>>>>>>>>> >>>>>>>>>>> 2015-07-20 14:47:30,580 DEBUG [c.c.a.m.ClusteredAgentAttache] >>>>>>>>>>> (AgentManager-Handler-15:null) Seq 1-7282039122481381399: >>>>>>>>>>> Forwarding Seq >>>>>>>>>>> 1-7282039122481381399: { Cmd , MgmtId: 104062526015411, >>>>>>>>>>> via: 1, >>>>>>>>>>> Ver: v1, >>>>>>>>>>> Flags: 100111, >>>>>>>>>>> [{"com.cloud.agent.api.MaintainCommand":{"wait":0}}] } to >>>>>>>>>>> 192405008094602 >>>>>>>>>>> 2015-07-20 14:47:30,582 DEBUG [c.c.a.m.ClusteredAgentAttache] >>>>>>>>>>> (AgentManager-Handler-5:null) Seq 1-7282039122481381399: >>>>>>>>>>> Forwarding Seq >>>>>>>>>>> 1-7282039122481381399: { Cmd , MgmtId: 104062526015411, >>>>>>>>>>> via: 1, >>>>>>>>>>> Ver: v1, >>>>>>>>>>> Flags: 100111, >>>>>>>>>>> [{"com.cloud.agent.api.MaintainCommand":{"wait":0}}] } to >>>>>>>>>>> 192405008094602 >>>>>>>>>>> 2015-07-20 14:47:30,583 DEBUG [c.c.a.m.ClusteredAgentAttache] >>>>>>>>>>> (AgentManager-Handler-1:null) Seq 1-7282039122481381399: >>>>>>>>>>> Forwarding Seq >>>>>>>>>>> 1-7282039122481381399: { Cmd , MgmtId: 104062526015411, >>>>>>>>>>> via: 1, >>>>>>>>>>> Ver: v1, >>>>>>>>>>> Flags: 100111, >>>>>>>>>>> [{"com.cloud.agent.api.MaintainCommand":{"wait":0}}] } to >>>>>>>>>>> 192405008094602 >>>>>>>>>>> 2015-07-20 14:47:30,584 DEBUG [c.c.a.m.ClusteredAgentAttache] >>>>>>>>>>> (AgentManager-Handler-14:null) Seq 1-7282039122481381399: >>>>>>>>>>> Forwarding Seq >>>>>>>>>>> 1-7282039122481381399: { Cmd , MgmtId: 104062526015411, >>>>>>>>>>> via: 1, >>>>>>>>>>> Ver: v1, >>>>>>>>>>> Flags: 100111, >>>>>>>>>>> [{"com.cloud.agent.api.MaintainCommand":{"wait":0}}] } to >>>>>>>>>>> 192405008094602 >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> I guess that, is there some service or daemons working for CS >>>>>>>>>>> is not up >>>>>>>>>>> on Xenserver ? >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> ----------- >>>>>>>>>>> Cao Tong >>>>>>>>>>> On 07/20/2015 02:35 PM, Rajani Karuturi wrote: >>>>>>>>>>> >>>>>>>>>>>> Did the management server ip change? >>>>>>>>>>>> management server ip in the configuration table is used my >>>>>>>>>>>> systemvms. >>>>>>>>>>>> select * from configuration where name like 'host'; >>>>>>>>>>>> >>>>>>>>>>>> If it changed, correct the value in db and restart systemvms. >>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>>> ~Rajani >>>>>>>>>>>> >>>>>>>>>>>> On Mon, Jul 20, 2015 at 11:56 AM,<tony_caot...@163.com> >>>>>>>>>>>> wrote: >>>>>>>>>>>> >>>>>>>>>>>> Hello, >>>>>>>>>>>>> I shutdown my cs-manager and xenserver last weekend, And now >>>>>>>>>>>>> the ssvm >>>>>>>>>>>>> and cpvm is disconnect, thost two was runing on xenserver. so >>>>>>>>>>>>> What >>>>>>>>>>>>> should i do right now ? >>>>>>>>>>>>> Please anybody help me and thanks. >>>>>>>>>>>>> >>>>>>>>>>>>> In xenserver I found that the three system VMs are not >>>>>>>>>>>>> running. >>>>>>>>>>>>> my xenserver seems can not reconnect to CS-manager. and it >>>>>>>>>>>>> seams not >>>>>>>>>>>>> under control of CS. >>>>>>>>>>>>> >>>>>>>>>>>>> >>>>>>>>>>>>> What is the right steps of shutdown all CS group machines and >>>>>>>>>>>>> resume >>>>>>>>>>>>> them? >>>>>>>>>>>>> How can i let my xenserver reconnected ? >>>>>>>>>>>>> >>>>>>>>>>>>> >>>>>>>>>>>>> Thanks, >>>>>>>>>>>>> >>>>>>>>>>>>> -- >>>>>>>>>>>>> ----------- >>>>>>>>>>>>> Cao Tong >>>>>>>>>>>>> >>>>>>>>>>>>> >>>>>>>>>>>>> >>>>>>>>>>>>> >>>>>>>>>>>>> >>>>>>>>>>>>> >>>>>>>> >>>>>>> >>>>>> >>>>> >> >> > >