Ok, finally got it...Had to get a terminal ready with the virsh command and guess what the instance number was, and then run suspend right after starting with --vm-start-paused. Got it to really be paused, got into the console, booted the old kernel, and have now been repairing a bad yum transaction....I *think* I've finished that.
So, if I understand correctly, after the yum update, I should run engine-setup? Do I run that inside the engine vm, or on the host its running on? BTW: I did look up upgrade procedures on the documentation for the release. It links to two or three levels of other documents, then ends in an error 404. --Jim On Mon, Sep 3, 2018 at 6:39 PM, Jim Kusznir <[email protected]> wrote: > global maintence mode is already on. hosted-engine --vm-start-paused > results in a non-paused VM being started. Of course, this is executed > after hosted-engine --vm-poweroff and suitable time left to let things shut > down. > > I just ran another test, and did in fact see the engine was briefly > paused, but then was quickly put in the running state. I don't know by > what, though. Global maintence mode is definitely enabled, every run of > the hosted-engine command reminds me! > > > > > > On Mon, Sep 3, 2018 at 11:12 AM, Darrell Budic <[email protected]> > wrote: > >> Don’t know if there’s anything special, it’s been a while since I’ve >> needed to start it in paused mode. Try putting it in HA maintenance mode >> from the CLI and then start it in paused mode maybe? >> >> ------------------------------ >> *From:* Jim Kusznir <[email protected]> >> *Subject:* Re: [ovirt-users] Upgraded host, engine now won't boot >> *Date:* September 3, 2018 at 1:08:27 PM CDT >> >> *To:* Darrell Budic >> *Cc:* users >> >> Unfortunately, I seem unable to get connected to the console early enough >> to actually see a kernel list. >> >> I've tried the hosted-engine --start-vm-paused command, but it just >> starts it (running mode, not paused). By the time I can get vnc connected, >> I have just that last line. ctrl-alt-del doesn't do anything with it, >> either. sending a reset through virsh seems to just kill the VM (it >> doesn't respawn). >> >> ha seems to have some trouble with this too...Originally I allowed ha to >> start it, and it would take it a good long while before it gave up on the >> engine and reset it. It instantly booted to the same crashed state, and >> again waited a "good long while" (sorry, never timed it, but I know it was >> >5 min). >> >> My current thought is that I need to get the engine started in paused >> mode, connect vnc, then unpause it with virsh to catch what is happening. >> Is there any magic to getting it started in paused mode? >> >> On Mon, Sep 3, 2018 at 11:03 AM, Darrell Budic <[email protected]> >> wrote: >> >>> Send it a ctl-alt-delete and see what happens. Possibly try an older >>> kernel at the grub boot menu. Could also try stopping it with hosted-engine >>> —vm-stop and let HA reboot it, see if it boots or get onto the console >>> quickly and try and watch more of the boot. >>> >>> Ssh and yum upgrade is fine for the OS, although it’s a good idea to >>> enable Global HA Maintenance first so the HA watchdogs don’t reboot it in >>> the middle of that. After that, run “engine-setup” again, at least if there >>> are new ovirt engine updates to be done. Then disable Global HA >>> Maintenance, and run "shutdown -h now” to stop the Engine VM (rebooting >>> seems to cause it to exit anyway, HA seems to run it as a single execution >>> VM. Or at least in the past, it seems to quit anyway on me and shutdown >>> triggered HA faster). Wait a few minutes, and HA will respawn it on a new >>> instance and you can log into your engine again. >>> >>> ------------------------------ >>> *From:* Jim Kusznir <[email protected]> >>> *Subject:* Re: [ovirt-users] Upgraded host, engine now won't boot >>> *Date:* September 3, 2018 at 12:45:22 PM CDT >>> *To:* Darrell Budic >>> *Cc:* users >>> >>> >>> Thanks to Jayme who pointed me to the --add-console-password >>> hosted-engine command to set a password for vnc. Using that, I see only >>> the single line: >>> >>> Probing EDD (edd=off to disable)... ok >>> >>> --Jim >>> >>> On Mon, Sep 3, 2018 at 10:26 AM, Jim Kusznir <[email protected]> >>> wrote: >>> >>>> Is there a way to get a graphical console on boot of the engine vm so I >>>> can see what's causing the failure to boot? >>>> >>>> On Mon, Sep 3, 2018 at 10:23 AM, Jim Kusznir <[email protected]> >>>> wrote: >>>> >>>>> Thanks; I guess I didn't mention that I started there. >>>>> >>>>> The virsh list shows it in state running, and gluster is showing fully >>>>> online and healed. However, I cannot bring up a console of the engine VM >>>>> to see why its not booting, even though it shows in running state. >>>>> >>>>> In any case, the hosts and engine were running happily. I applied the >>>>> latest updates on the host, and the engine went unstable. I thought, Ok, >>>>> maybe there's an update to ovirt that also needs to be applied to the >>>>> engine, so I ssh'ed in and ran yum update (never did find clear >>>>> instructions on how one is supposed to maintain the engine, but I did see >>>>> that listed online). A while later, it reset and never booted again. >>>>> >>>>> -JIm >>>>> >>>>> On Sun, Sep 2, 2018 at 4:28 PM, Darrell Budic <[email protected]> >>>>> wrote: >>>>> >>>>>> It’s definitely not starting, you’ll have to see if you can figure >>>>>> out why. A couple things to try: >>>>>> >>>>>> - Check "virsh list" and see if it’s running, or paused for storage. >>>>>> (google "virsh saslpasswd2 >>>>>> <https://www.google.com/search?client=safari&rls=en&q=virsh+saslpasswd2&ie=UTF-8&oe=UTF-8>” >>>>>> if you need to add a user to do this with, it’s per host) >>>>>> - It’s hyper converged, so check your gluster volume for healing >>>>>> and/or split brains and wait/resolve those. >>>>>> - check “gluster peer status” and on each host and make sure your >>>>>> gluster hosts are all talking. I’ve seen an upgrade screwup the firewall, >>>>>> easy fix is to add a rule to allow the hosts to talk to each other on >>>>>> your >>>>>> gluster network, no questions asked (-j ACCEPT, no port, etc). >>>>>> >>>>>> Good luck! >>>>>> >>>>>> ------------------------------ >>>>>> *From:* Jim Kusznir <[email protected]> >>>>>> *Subject:* [ovirt-users] Upgraded host, engine now won't boot >>>>>> *Date:* September 1, 2018 at 8:38:12 PM CDT >>>>>> *To:* users >>>>>> >>>>>> Hello: >>>>>> >>>>>> I saw that there were updates to my ovirt-4.2 3 node hyperconverged >>>>>> system, so I proceeded to apply them the usual way through the UI. >>>>>> >>>>>> At one point, the hosted engine was migrated to one of the upgraded >>>>>> hosts, and then went "unstable" on me. Now, the hosted engine appears to >>>>>> be crashed: It gets powered up, but it never boots up to the point where >>>>>> it responds to pings or allows logins. After a while, the hosted engine >>>>>> shows status (via console "hosted-engine --vm-status" command) "Powering >>>>>> Down". It stays there for a long time. >>>>>> >>>>>> I tried forcing a poweroff then powering it on, but again, it never >>>>>> gets up to where it will respond to pings. --vm-status shows bad health, >>>>>> but up. >>>>>> >>>>>> I tried running the hosted-engine --console command, but got: >>>>>> >>>>>> [root@ovirt1 ~]# hosted-engine --console >>>>>> The engine VM is running on this host >>>>>> Connected to domain HostedEngine >>>>>> Escape character is ^] >>>>>> error: internal error: cannot find character device <null> >>>>>> >>>>>> [root@ovirt1 ~]# >>>>>> >>>>>> >>>>>> I tried to run the hosted-engine --upgrade-appliance command, but it >>>>>> hangs at obtaining certificate (understandably, as the hosted-engine is >>>>>> not >>>>>> up). >>>>>> >>>>>> How do i recover from this? And what caused this? >>>>>> >>>>>> --Jim >>>>>> _______________________________________________ >>>>>> Users mailing list -- [email protected] >>>>>> To unsubscribe send an email to [email protected] >>>>>> Privacy Statement: https://www.ovirt.org/site/privacy-policy/ >>>>>> oVirt Code of Conduct: https://www.ovirt.org/communit >>>>>> y/about/community-guidelines/ >>>>>> List Archives: https://lists.ovirt.org/archiv >>>>>> es/list/[email protected]/message/XBNOOF4OA5C5AFGCT3KGUPUTRSOLIPXX/ >>>>>> >>>>>> >>>>>> >>>>> >>>> >>> >>> >> >> >
_______________________________________________ Users mailing list -- [email protected] To unsubscribe send an email to [email protected] Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/[email protected]/message/ZB3RJDJVNV3RZYVXCQSMEZFYLMUYBBKI/

