[
https://issues.apache.org/jira/browse/CLOUDSTACK-4606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13911485#comment-13911485
]
Remi Bergsma commented on CLOUDSTACK-4606:
------------------------------------------
Wouldn't it be easiest to have the virtual router reboot itself in case of a
kernel panic after, say, 10 seconds?
/etc/sysctl.conf:
kernel.panic = 10
The backup node will then take over. Or if it's non-HA, it will work again
after the reboot.
> HA does not kick on Routing VM hanging on kernel panic.
> -------------------------------------------------------
>
> Key: CLOUDSTACK-4606
> URL: https://issues.apache.org/jira/browse/CLOUDSTACK-4606
> Project: CloudStack
> Issue Type: Bug
> Security Level: Public(Anyone can view this level - this is the
> default.)
> Components: Virtual Router
> Affects Versions: 4.1.1
> Reporter: Roeland Kuipers
>
> If a routing VM hangs on kernel panic. It will never be recovered by the HA
> process of Cloudstack.
> In this case the Virtual Router cannot be accessed anymore over link-local
> and/or management address and cannot be managed anymore.
> We think HA should reboot a router when this occurs.
> See also CLOUDSTACK-4607 and CLOUDSTACK-4605
> Errors observed while hanging:
> 013-09-04 18:57:42,108 DEBUG [xen.resource.CitrixResourceBase]
> (DirectAgent-446:null) callHostPlugin failed for cmd: routerProxy with args
> args: vpc_netusage.sh 169.254.3.166 -l 95.142.107.111 -g, due to Failed to
> create input stream: Read timed out
> com.cloud.utils.exception.CloudRuntimeException: callHostPlugin failed for
> cmd: routerProxy with args args: vpc_netusage.sh 169.254.3.166 -l
> 95.142.107.111 -g, due to Failed to create input stream: Read timed out
> 2013-09-04 18:57:42,109 DEBUG [agent.transport.Request]
> (DirectAgent-446:null) Seq 105-1580215291: Processing: { Ans: , MgmtId:
> 345052370017, via: 105, Ver: v1, Flags: 10,
> [{"NetworkUsageAnswer":{"result":false,"details":"Exception:
> com.cloud.utils.exception.CloudRuntimeException\nMessage: callHostPlugin
> failed for cmd: routerProxy with args args: vpc_netusage.sh 169.254.3.166 -l
> 95.142.107.111 -g, due to Failed to create input stream: Read timed
> out\nStack: com.cloud.utils.exception.CloudRuntimeException: callHostPlugin
> failed for cmd: routerProxy with args args: vpc_netusage.sh 169.254.3.166 -l
> 95.142.107.111 -g, due to Failed to create input stream: Read timed
> out\n\tat
> com.cloud.hypervisor.xen.resource.CitrixResourceBase.callHostPlugin(CitrixResourceBase.java:3745)\n\tat
>
> com.cloud.hypervisor.xen.resource.XenServer56Resource.VPCNetworkUsage(XenServer56Resource.java:186)\n\tat
>
> com.cloud.hypervisor.xen.resource.XenServer56Resource.execute(XenServer56Resource.java:211)\n\tat
>
> com.cloud.hypervisor.xen.resource.XenServer56Resource.executeRequest(XenServer56Resource.java:71)\n\tat
>
> com.cloud.agent.manager.DirectAgentAttache$Task.run(DirectAgentAttache.java:191)\n\tat
>
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:441)\n\tat
> java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)\n\tat
> java.util.concurrent.FutureTask.run(FutureTask.java:138)\n\tat
> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:98)\n\tat
>
> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:206)\n\tat
>
> java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)\n\tat
>
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)\n\tat
> java.lang.Thread.run(Thread.java:662)\n","wait":0}}] }
> Message: callHostPlugin failed for cmd: routerProxy with args args:
> vpc_netusage.sh 169.254.3.166 -l 95.142.107.111 -g, due to Failed to create
> input stream: Read timed out
> Stack: com.cloud.utils.exception.CloudRuntimeException: callHostPlugin failed
> for cmd: routerProxy with args args: vpc_netusage.sh 169.254.3.166 -l
> 95.142.107.111 -g, due to Failed to create input stream: Read timed out
> Message: callHostPlugin failed for cmd: routerProxy with args args:
> vpc_netusage.sh 169.254.3.166 -l 95.142.107.111 -g, due to Failed to create
> input stream: Read timed out
> Stack: com.cloud.utils.exception.CloudRuntimeException: callHostPlugin failed
> for cmd: routerProxy with args args: vpc_netusage.sh 169.254.3.166 -l
> 95.142.107.111 -g, due to Failed to create input stream: Read timed out
> 2013-09-04 19:08:20,909 DEBUG [xen.resource.CitrixResourceBase]
> (DirectAgent-367:null) callHostPlugin failed for cmd: routerProxy with args
> args: vpc_netusage.sh 169.254.3.166 -l 95.142.107.111 -g, due to Failed to
> create input stream: Read timed out
> com.cloud.utils.exception.CloudRuntimeException: callHostPlugin failed for
> cmd: routerProxy with args args: vpc_netusage.sh 169.254.3.166 -l
> 95.142.107.111 -g, due to Failed to create input stream: Read timed out
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)