Actually, the only issue I'm noticing now is the SSVM being automatically paused shortly after being created (while creating a new cloud).
If I go to XenCenter and forcefully shut the VM down, CloudStack restarts it OK. On Mon, Apr 28, 2014 at 7:34 PM, Mike Tutkowski < mike.tutkow...@solidfire.com> wrote: > Figured I'd CC Anthony and Edison to see if they have any input on this > (it looks like most of the changes on the relevant file > (Xenserver625StorageProcessor.java) were performed by one or the other). > > > On Mon, Apr 28, 2014 at 12:40 PM, Mike Tutkowski < > mike.tutkow...@solidfire.com> wrote: > >> Thanks for the reply, guys. >> >> Just wanted to point out that this is on 4.4 for me (although the issue >> may also be present on master). >> >> I have a sufficient number of IP addresses for both system and user VMs, >> so that should be OK (but good thought, Punith). >> >> I plan to continue debugging this later this afternoon, but have been in >> meetings all morning. >> >> Thanks! >> >> >> On Mon, Apr 28, 2014 at 10:41 AM, Dave Scott <dave.sc...@citrix.com>wrote: >> >>> Hi, >>> >>> (sorry to reply to my own email!) >>> >>> On 28 Apr 2014, at 11:42, Dave Scott <dave.sc...@citrix.com> wrote: >>> >>> > >>> > Hi Mike, >>> > >>> > On 28 Apr 2014, at 04:44, Mike Tutkowski <mike.tutkow...@solidfire.com> >>> wrote: >>> > >>> >> Hi, >>> >> >>> >> I recently installed 6.2 with XS62ESP1 and XS62ESP1004 (so that >>> >> Xenserver625StorageProcessor would be utilized). >>> >> >>> >> When I create a cloud from scratch, my SSVM starts up fine, but CPVM >>> ends >>> >> up in the Paused state. I have to force a shutdown of that VM and then >>> >> CloudStack restarts it and it works. This consistently happens. The >>> system >>> >> VMs are being deployed to the local storage of the one XS host I have >>> in my >>> >> one and only cluster. >>> >> >>> >> Any thoughts on that? >>> > >>> > I'm seeing the same symptom on my test cloud with 6.2 and XS62ESP1004. >>> I think there's a problem with XenAPI session and task handling in the >>> cloudstack master branch, although I've not tracked it down yet. In my >>> management server log I see: >>> > >>> > WARN [c.c.h.x.r.CitrixResourceBase] (DirectAgent-5:ctx-47dccee1) >>> Unable to start VM(v-2-VM) on host(1c4a31e9-469e-45c3-a0ad-9792ac7b >>> > 20f6) due to You gave an invalid session reference. It may have been >>> invalidated by a server restart, or timed out. You should get >>> > a new session handle, using one of the session.login_ calls. This >>> error does not invalidate the current connection. The handle para >>> > meter echoes the bad value given. >>> > You gave an invalid session reference. It may have been invalidated >>> by a server restart, or timed out. You should get a new session >>> > handle, using one of the session.login_ calls. This error does not >>> invalidate the current connection. The handle parameter echoes >>> > the bad value given. >>> > at com.xensource.xenapi.Types.checkResponse(Types.java:218) >>> > at com.xensource.xenapi.Connection.dispatch(Connection.java:395) >>> > at >>> com.cloud.hypervisor.xen.resource.XenServerConnectionPool$XenServerConnection.dispatch(XenServerConnectionPool.java:463) >>> > at com.xensource.xenapi.Event.from(Event.java:270) >>> > at >>> org.apache.cloudstack.hypervisor.xenserver.XenServerResourceNewBase.waitForTask(XenServerResourceNewBase.java:113) >>> > at >>> com.cloud.hypervisor.xen.resource.CitrixResourceBase.startVM(CitrixResourceBase.java:3455) >>> > >>> > Somehow the XenAPI session being used by the Event.from in the >>> XenServerResourceNewBase.waitForTask (used for recent 6.2 XenServers only) >>> is being logged-out somewhere. When this happens, the cloudstack cleanup >>> code calls Task.cancel and Task.destroy, and then the XenServer >>> Async.VM.start fails trying to update Task.progress before it internally >>> calls VM.unpause. >>> > >>> > I made a hack to disable caching of Connection/sessions: >>> > >>> > >>> https://github.com/djs55/cloudstack/commit/a388b71279086e42710e26340df0632d0d8135e4 >>> >>> For reference / experimentation, I've made a slightly more plausible >>> patch: >>> >>> >>> https://github.com/djs55/cloudstack/commit/9d40f56c6384d04a5f0fb22e5b97530c0164e0b2 >>> >>> It catches the SESSION_INVALID in the XenServerConnection and >>> transparently logs back in. This would prevent the higher level bits of the >>> XenServer plugin from having to deal with sessions being expired beneath >>> them. >>> >>> Chers, >>> Dave >>> >>> > >>> > I suspect this now leaks Connections/sessions, but the symptom goes >>> away. >>> > >>> > So far my thoughts are: >>> > >>> > 1. we need to find who's calling session.logout and why -- this will >>> help fix the problem in the short term >>> > >>> > 2. The XenServer XenAPI bindings are harder to use than they should be >>> (IMHO). In particular I think the bindings should take care of handling >>> SESSION_INVALID exceptions and re-authenticating transparently, to avoid >>> polluting the cloudstack code with rarely-used exception handlers. >>> > >>> > 3. the semantics of XenAPI task.destroy could be improved: instead of >>> immediately removing the task (which then causes cleanup code to fail >>> randomly it seems), it should be more like Unix waitpid with NOHANG i.e. >>> set a bit which says, "I'm done with this. Destroy it when you are finished >>> with it." >>> > >>> > >>> >> >>> >> Also, if I try to kick off a user VM to local storage, I get the >>> >> general-purpose InsufficientCapacityException and the virtual router >>> does >>> >> not even start up. >>> > >>> > No idea about this one :) >>> > >>> > Cheers, >>> > Dave >>> > >>> >> >>> >> Can anyone create a similar cloud to what I've described here with XS >>> 6.2, >>> >> XS62ESP1, and XS62ESP1004? I re-ran this test using a XS 6.1 host and >>> it >>> >> works just fine. >>> >> >>> >> At the moment, this is blocking a test case I'm trying to execute to >>> verify >>> >> code I had to write in Xenserver625StorageProcessor. >>> >> >>> >> Thanks! >>> >> >>> >> -- >>> >> *Mike Tutkowski* >>> >> *Senior CloudStack Developer, SolidFire Inc.* >>> >> e: mike.tutkow...@solidfire.com >>> >> o: 303.746.7302 >>> >> Advancing the way the world uses the >>> >> cloud<http://solidfire.com/solution/overview/?video=play> >>> >> *(tm)* >>> > >>> >>> >> >> >> -- >> *Mike Tutkowski* >> *Senior CloudStack Developer, SolidFire Inc.* >> e: mike.tutkow...@solidfire.com >> o: 303.746.7302 >> Advancing the way the world uses the >> cloud<http://solidfire.com/solution/overview/?video=play> >> *(tm)* >> > > > > -- > *Mike Tutkowski* > *Senior CloudStack Developer, SolidFire Inc.* > e: mike.tutkow...@solidfire.com > o: 303.746.7302 > Advancing the way the world uses the > cloud<http://solidfire.com/solution/overview/?video=play> > *(tm)* > -- *Mike Tutkowski* *Senior CloudStack Developer, SolidFire Inc.* e: mike.tutkow...@solidfire.com o: 303.746.7302 Advancing the way the world uses the cloud<http://solidfire.com/solution/overview/?video=play> *(tm)*