On Friday, September 14, 2018 at 4:48:17 AM UTC-5, Marcus Linsner wrote: > On Thursday, September 13, 2018 at 7:43:57 PM UTC+2, Guy Frank wrote: > > On Tuesday, September 11, 2018 at 5:12:00 PM UTC-5, Guy Frank wrote: > > > On Tuesday, September 11, 2018 at 4:29:02 PM UTC-5, awokd wrote: > > > > On Tue, September 11, 2018 7:10 pm, Guy Frank wrote: > > > > > On Tuesday, September 11, 2018 at 1:44:13 PM UTC-5, Guy Frank wrote: > > > > > > > > > >> Was a bit premature thinking that my qubes installation was stable. > > > > >> About half the time I start the system, it locks up and I am only > > > > >> able > > > > >> to access Dom0 (qubes manager will not open, nor will any qubes, even > > > > >> from command line). The system gives a serious 'filenotfound' error > > > > >> msg. I've looked at previous posts on problems like this, but my > > > > >> problem doesn't seem to fit what others reported--qubes.xml is not > > > > >> empty and disk utilization is minimal (or near 50% in one case). The > > > > >> error message is: > > > > >> > > > > >> # > > > > >> Whoops. A critical error has occurred. This is most likely a bug in > > > > >> Qubes Manager > > > > >> FileNotFoundError: [Errno 2] > > > > >> No such file or directory > > > > >> at line 9 of file /usr/bin/qubes-qube-manager # > > > > >> > > > > >> > > > > >> Line 9 reads: load_entry_point('qubesmanager==4.0.16', > > > > >> 'console_scripts', 'qubes-qube-manager')() > > > > >> > > > > >> > > > > >> Ok, so the weird thing is that this works fine half the time. On > > > > >> half > > > > >> of my boot ups, I don't encounter this problem. So if there is no > > > > >> such > > > > >> file or directory, it's not there half the time. qubes.xml looks > > > > >> good > > > > >> (to my untrained eyes), and df -h shows nothing at more than 1% > > > > >> utilization except for /dev/nvme0n1p1 mounted on /boot/efi which is > > > > >> 56% > > > > >> of 200MB. nvme0n1p1 is, I believe, the GPT table? > > > > >> > > > > >> I'm worried about coming to rely on this installation if at some > > > > >> point > > > > >> the error doesn't go away every other reboot and becomes permanent. > > > > >> Am > > > > >> trying updates now--maybe that will help. > > > > >> > > > > >> Guy > > > > >> > > > > > > > > > > Updating the software in dom0 doesn't make the problem disappear, > > > > > though > > > > > now the main error message is: > > > > > > > > > > QubesDaemonCommunicationError: Failed to connect to qubesd service: > > > > > [Errno 2] No such file or directory > > > > > at line 9 of file /usr/bin/qubes-qube-manager > > > > > > > > Nothing related earlier in the "sudo journalctl -e" log? Try "sudo > > > > systemctl restart qubesd"? > > > > > > Thanks awokd! I'll give these a try next time I run into the problem > > > > Ok, so on my next reboot, it ran into this problem again. I made a copy of > > the journalctl log and tried to restart qubesd, to no effect. > > > > The attached file, jnlctlErr.txt, if you scroll down to 09:24:43, I think > > you can see where the Qubes OS daemon fails. It is immediately preceded by > > the 1d.2 pci device worker failing, suggesting that something about this > > failure is causing the daemon from starting (which occurs below the blank > > line I added to the log). 1d.2 is a PCI Bridge, Intel Corp Device a332. No > > idea what exactly this is or how to find out (not a hardware person). > > > > One thing I thought of is the fact that there's a PS/2 card in the machine > > to which a PS/2 keyboard & mouse are attached. Neither has ever worked in > > Qubes (though they worked in Windows), so maybe that's what's triggering > > the problem? Will do some testing. > > > > When I attempt to start qubes daemon w/ sudo systemctl restart qubesd, > > journalctl log shows other errors. The qubes daemon doesn't get started > > and I can't use the system. > > > > What I can do is reboot. And about every other time, Qubes comes up and is > > fine. My concern is that at some point it'll stop doing this, so I'd > > really like to figure out how to solve this problem. > > > > Guy > > Looking the the relevant errors, in context (and the time between them): > > ... > Sep 13 09:20:23 localhost kernel: usb 1-10.1: New USB device found, > idVendor=413c, idProduct=2002 > Sep 13 09:20:23 localhost kernel: usb 1-10.1: New USB device strings: Mfr=1, > Product=2, SerialNumber=0 > Sep 13 09:20:23 localhost kernel: usb 1-10.1: Product: Dell USB Keyboard Hub > Sep 13 09:20:23 localhost kernel: usb 1-10.1: Manufacturer: Dell > Sep 13 09:20:23 localhost kernel: input: Dell Dell USB Keyboard Hub as > /devices/pci0000:00/0000:00:14.0/usb1/1-10/1-10.1/1-10.1:1.0/0003:413C:2002.0001/input/input3 > Sep 13 09:20:23 localhost kernel: hid-generic 0003:413C:2002.0001: > input,hidraw0: USB HID v1.10 Keyboard [Dell Dell USB Keyboard Hub] on > usb-0000:00:14.0-10.1/input0 > Sep 13 09:20:23 localhost kernel: input: Dell Dell USB Keyboard Hub as > /devices/pci0000:00/0000:00:14.0/usb1/1-10/1-10.1/1-10.1:1.1/0003:413C:2002.0002/input/input4 > Sep 13 09:20:23 localhost kernel: usb 4-3: new low-speed USB device number 2 > using ohci-pci > ... > Sep 13 09:20:23 localhost kernel: hid-generic 0003:413C:2002.0002: > input,hidraw1: USB HID v1.10 Device [Dell Dell USB Keyboard Hub] on > usb-0000:00:14.0-10.1/input1 > ... > Sep 13 09:21:43 dom0 kernel: dcdbas dcdbas: Dell Systems Management Base > Driver (version 5.6.0-3.2) > ... > Sep 13 09:21:44 dom0 kernel: acpi PNP0C14:03: duplicate WMI GUID > 05901221-D566-11D1-B2F0-00A0C9062910 (first instance was on PNP0C14:00) > Sep 13 09:21:44 dom0 kernel: wmi_bus wmi_bus-PNP0C14:04: WQBC data block > query control method not found > Sep 13 09:21:44 dom0 kernel: acpi PNP0C14:04: duplicate WMI GUID > 05901221-D566-11D1-B2F0-00A0C9062910 (first instance was on PNP0C14:00) > Sep 13 09:21:44 dom0 kernel: input: PC Speaker as > /devices/platform/pcspkr/input/input12 > Sep 13 09:21:44 dom0 kernel: input: Dell AIO WMI hotkeys as > /devices/virtual/input/input13 > ... > Sep 13 09:21:44 dom0 kernel: dell-wmi 9DBB5994-A997-11DA-B012-B622A1EF5492: > Dell descriptor buffer has invalid buffer length (32768) > Sep 13 09:21:44 dom0 kernel: dell-wmi 9DBB5994-A997-11DA-B012-B622A1EF5492: > Detected Dell WMI interface version 1 > Sep 13 09:21:44 dom0 kernel: input: Dell WMI hotkeys as > /devices/platform/PNP0C14:04/wmi_bus/wmi_bus-PNP0C14:04/9DBB5994-A997-11DA-B012-B622A1EF5492/input/input14 > Sep 13 09:21:44 dom0 systemd[1]: Found device /dev/disk/by-uuid/A482-5EDF. > Sep 13 09:21:44 dom0 systemd-udevd[1677]: Error calling EVIOCSKEYCODE on > device node '/dev/input/event14' (scan code 0x150, key code 190): Invalid > argument > ... > Sep 13 09:21:49 dom0 kernel: snd_hda_codec_realtek hdaudioC0D0: Failed to > find dell wmi symbol dell_micmute_led_set > ... > Sep 13 09:22:00 dom0 kernel: input: HDA Intel PCH HDMI/DP,pcm=10 as > /devices/pci0000:00/0000:00:1f.3/sound/card0/input21 > Sep 13 09:22:43 dom0 systemd-udevd[1652]: seq 2961 > '/devices/pci0000:00/0000:00:1d.2/0000:05:00.0/0000:06:00.0/usb4' is taking a > long time > Sep 13 09:23:44 dom0 systemd[1]: systemd-udev-settle.service: Main process > exited, code=exited, status=1/FAILURE > Sep 13 09:23:44 dom0 audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 > ses=4294967295 msg='unit=systemd-udev-settle comm="systemd" > exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=failed' > Sep 13 09:23:44 dom0 systemd[1]: Failed to start udev Wait for Complete > Device Initialization. > Sep 13 09:23:44 dom0 kernel: audit: type=1130 audit(1536848624.020:69): pid=1 > uid=0 auid=4294967295 ses=4294967295 msg='unit=systemd-udev-settle > comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? > res=failed' > Sep 13 09:23:44 dom0 systemd[1]: systemd-udev-settle.service: Unit entered > failed state. > Sep 13 09:23:44 dom0 systemd[1]: systemd-udev-settle.service: Failed with > result 'exit-code'. > ... > Sep 13 09:23:44 dom0 systemd-logind[2748]: Watching system buttons on > /dev/input/event14 (Dell WMI hotkeys) > Sep 13 09:23:44 dom0 systemd-logind[2748]: Watching system buttons on > /dev/input/event13 (Dell AIO WMI hotkeys) > ... > Sep 13 09:23:45 dom0 qmemman.systemstate[2758]: stat: xenfree=29212956823 > memset_reqs=[] > Sep 13 09:24:43 dom0 systemd-udevd[1652]: seq 2961 > '/devices/pci0000:00/0000:00:1d.2/0000:05:00.0/0000:06:00.0/usb4' killed > Sep 13 09:24:43 dom0 systemd-udevd[1652]: worker [1674] terminated by signal > 9 (Killed) > Sep 13 09:24:43 dom0 systemd-udevd[1652]: worker [1674] failed while handling > '/devices/pci0000:00/0000:00:1d.2/0000:05:00.0/0000:06:00.0/usb4' > #pci device 1d.2 is: 00:1d.2 PCI bridge: Intel Corporation Device a332 (rev > f0), which I imagine might be related to: 00:17.0 SATA controller: Intel > Corporation Device a352 (rev 10) > > Sep 13 09:25:14 dom0 systemd[1]: qubesd.service: Start operation timed out. > Terminating. > Sep 13 09:25:14 dom0 systemd[1]: Failed to start Qubes OS daemon. > > > If this guy is correct https://bugs.freedesktop.org/show_bug.cgi?id=75875#c1 > about `you can safely ignore these errors` then the following workaround > attempts are probably not going to make any difference(but hey, I tried): > > > What's the output of `lsmod|grep -i wmi` ? I'm guessing there should be > something like `dell_wmi` which means it's possible to get it blacklisted (if > even temporarily), so with something like `modprobe.blacklist=dell_wmi` as a > kernel parameter (ie. cat /proc/cmdline) (or should it be dell-wmi ? that is, > with dash instead of underscore). There's a list of ways on how to do that > here https://askubuntu.com/questions/110341/how-to-blacklist-kernel-modules > but if you're using UEFI, that would mean appending that to the `kernel=` > line (of the default= kernel) in /boot/efi/EFI/qubes/xen.cfg then rebooting; > however I do recommend having a working copy first, as a backup, just in case > your xen.cfg modifications render the system unbootable, for inspiration on > how to do that, maybe see from here: > https://groups.google.com/d/msg/qubes-users/CZ5vMNL_c7k/btiRvk9eBAAJ > > If you don't want to blacklist that dell_wmi(guessing) module, or can't, then > maybe consider temporarily commenting out this whole block of lines: > # Dell Latitude microphone mute > evdev:name:Dell WMI hotkeys:dmi:bvn*:bvr*:bd*:svnDell*:pnLatitude* > > > # Dell Precision microphone mute > evdev:name:Dell WMI hotkeys:dmi:bvn*:bvr*:bd*:svnDell*:pnPrecision* > KEYBOARD_KEY_150=f20 # Mic mute toggle, > should be micmute > > in dom0 file: /usr/lib/udev/hwdb.d/60-keyboard.hwdb > which would mean that some multimedia keys won't work but it should also not > stall your boot process. Inspiration for this solution is from: > https://ubuntuforums.org/showthread.php?t=2250210&p=13153308#post13153308 > (or maybe more/different lines need to be blacklisted? unsure) > > Another solution would be to use a newer kernel (like 4.18.7 from the > unstable repo, see file /etc/yum.repos.d/qubes-dom0.repo if you want to > `enabled = 1` it, under section [qubes-dom0-unstable]), but before you do I > still recommend having another UEFI entry with the normal kernel(s) just in > case the new one will not boot at all (tho unlikely), so you can use your > BIOS boot menu to select between the two (tho be aware the currently running > one will be updated(xen.cfg -wise) when you `sudo qubes-dom0-update` to a > newer kernel). However I do note that your UEFI partition is only 200MiB so > you might need to only copy the running kernel (the one referenced by > `default=`) instead of everything, or you may not have enough space on it; > this only makes sense in this context: > https://groups.google.com/d/msg/qubes-users/CZ5vMNL_c7k/btiRvk9eBAAJ
Thanks awokd and Marcus! The points you made got me to unplugging the non-working PS/2 keyboard and mouse on my computer (which plug into a PCI card). I've rebooted 5 times since and have not run into an error. So it looks like something about the PS/2 peripherals were causing the problem. Which of course leads to the next question of why does this cause problems and why don't the keyboard & mouse work. Well, questions for another thread. -- You received this message because you are subscribed to the Google Groups "qubes-users" group. To unsubscribe from this group and stop receiving emails from it, send an email to qubes-users+unsubscr...@googlegroups.com. To post to this group, send email to qubes-users@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/qubes-users/82f1145e-94cf-445f-a2ed-cf341fb12cd7%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.