On Friday, September 14, 2018 at 4:48:17 AM UTC-5, Marcus Linsner wrote:
> On Thursday, September 13, 2018 at 7:43:57 PM UTC+2, Guy Frank wrote:
> > On Tuesday, September 11, 2018 at 5:12:00 PM UTC-5, Guy Frank wrote:
> > > On Tuesday, September 11, 2018 at 4:29:02 PM UTC-5, awokd wrote:
> > > > On Tue, September 11, 2018 7:10 pm, Guy Frank wrote:
> > > > > On Tuesday, September 11, 2018 at 1:44:13 PM UTC-5, Guy Frank wrote:
> > > > >
> > > > >> Was a bit premature thinking that my qubes installation was stable.
> > > > >> About half the time I start the system, it locks up and I am only 
> > > > >> able
> > > > >> to access Dom0 (qubes manager will not open, nor will any qubes, even
> > > > >> from command line).  The system gives a serious 'filenotfound' error
> > > > >> msg.  I've looked at previous posts on problems like this, but my
> > > > >> problem doesn't seem to fit what others reported--qubes.xml is not
> > > > >> empty and disk utilization is minimal (or near 50% in one case).  The
> > > > >> error message is:
> > > > >>
> > > > >> #
> > > > >> Whoops.  A critical error has occurred.  This is most likely a bug in
> > > > >> Qubes Manager
> > > > >> FileNotFoundError:  [Errno 2]
> > > > >> No such file or directory
> > > > >> at line 9 of file /usr/bin/qubes-qube-manager #
> > > > >>
> > > > >>
> > > > >> Line 9 reads:  load_entry_point('qubesmanager==4.0.16',
> > > > >> 'console_scripts', 'qubes-qube-manager')()
> > > > >>
> > > > >>
> > > > >> Ok, so the weird thing is that this works fine half the time.  On 
> > > > >> half
> > > > >> of my boot ups, I don't encounter this problem.  So if there is no 
> > > > >> such
> > > > >> file or directory, it's not there half the time.  qubes.xml looks 
> > > > >> good
> > > > >> (to my untrained eyes), and df -h shows nothing at more than 1%
> > > > >> utilization except for /dev/nvme0n1p1 mounted on /boot/efi which is 
> > > > >> 56%
> > > > >> of 200MB.  nvme0n1p1 is, I believe, the GPT table?
> > > > >>
> > > > >> I'm worried about coming to rely on this installation if at some 
> > > > >> point
> > > > >> the error doesn't go away every other reboot and becomes permanent.  
> > > > >> Am
> > > > >> trying updates now--maybe that will help.
> > > > >>
> > > > >> Guy
> > > > >>
> > > > >
> > > > > Updating the software in dom0 doesn't make the problem disappear, 
> > > > > though
> > > > > now the main error message is:
> > > > >
> > > > > QubesDaemonCommunicationError: Failed to connect to qubesd service:
> > > > > [Errno 2] No such file or directory
> > > > > at line 9 of file /usr/bin/qubes-qube-manager
> > > > 
> > > > Nothing related earlier in the "sudo journalctl -e" log? Try "sudo
> > > > systemctl restart qubesd"?
> > > 
> > > Thanks awokd!  I'll give these a try next time I run into the problem
> > 
> > Ok, so on my next reboot, it ran into this problem again.  I made a copy of 
> > the journalctl log and tried to restart qubesd, to no effect.  
> > 
> > The attached file, jnlctlErr.txt, if you scroll down to 09:24:43, I think 
> > you can see where the Qubes OS daemon fails.  It is immediately preceded by 
> > the 1d.2 pci device worker failing, suggesting that something about this 
> > failure is causing the daemon from starting (which occurs below the blank 
> > line I added to the log). 1d.2 is a PCI Bridge, Intel Corp Device a332.  No 
> > idea what exactly this is or how to find out (not a hardware person).  
> > 
> > One thing I thought of is the fact that there's a PS/2 card in the machine 
> > to which a PS/2 keyboard & mouse are attached.  Neither has ever worked in 
> > Qubes (though they worked in Windows), so maybe that's what's triggering 
> > the problem?  Will do some testing.
> > 
> > When I attempt to start qubes daemon w/ sudo systemctl restart qubesd, 
> > journalctl log shows other errors.  The qubes daemon doesn't get started 
> > and I can't use the system.
> > 
> > What I can do is reboot.  And about every other time, Qubes comes up and is 
> > fine.  My concern is that at some point it'll stop doing this, so I'd 
> > really like to figure out how to solve this problem.
> > 
> > Guy
> 
> Looking the the relevant errors, in context (and the time between them):
> 
> ...
> Sep 13 09:20:23 localhost kernel: usb 1-10.1: New USB device found, 
> idVendor=413c, idProduct=2002
> Sep 13 09:20:23 localhost kernel: usb 1-10.1: New USB device strings: Mfr=1, 
> Product=2, SerialNumber=0
> Sep 13 09:20:23 localhost kernel: usb 1-10.1: Product: Dell USB Keyboard Hub
> Sep 13 09:20:23 localhost kernel: usb 1-10.1: Manufacturer: Dell
> Sep 13 09:20:23 localhost kernel: input: Dell Dell USB Keyboard Hub as 
> /devices/pci0000:00/0000:00:14.0/usb1/1-10/1-10.1/1-10.1:1.0/0003:413C:2002.0001/input/input3
> Sep 13 09:20:23 localhost kernel: hid-generic 0003:413C:2002.0001: 
> input,hidraw0: USB HID v1.10 Keyboard [Dell Dell USB Keyboard Hub] on 
> usb-0000:00:14.0-10.1/input0
> Sep 13 09:20:23 localhost kernel: input: Dell Dell USB Keyboard Hub as 
> /devices/pci0000:00/0000:00:14.0/usb1/1-10/1-10.1/1-10.1:1.1/0003:413C:2002.0002/input/input4
> Sep 13 09:20:23 localhost kernel: usb 4-3: new low-speed USB device number 2 
> using ohci-pci
> ...
> Sep 13 09:20:23 localhost kernel: hid-generic 0003:413C:2002.0002: 
> input,hidraw1: USB HID v1.10 Device [Dell Dell USB Keyboard Hub] on 
> usb-0000:00:14.0-10.1/input1
> ...
> Sep 13 09:21:43 dom0 kernel: dcdbas dcdbas: Dell Systems Management Base 
> Driver (version 5.6.0-3.2)
> ...
> Sep 13 09:21:44 dom0 kernel: acpi PNP0C14:03: duplicate WMI GUID 
> 05901221-D566-11D1-B2F0-00A0C9062910 (first instance was on PNP0C14:00)
> Sep 13 09:21:44 dom0 kernel: wmi_bus wmi_bus-PNP0C14:04: WQBC data block 
> query control method not found
> Sep 13 09:21:44 dom0 kernel: acpi PNP0C14:04: duplicate WMI GUID 
> 05901221-D566-11D1-B2F0-00A0C9062910 (first instance was on PNP0C14:00)
> Sep 13 09:21:44 dom0 kernel: input: PC Speaker as 
> /devices/platform/pcspkr/input/input12
> Sep 13 09:21:44 dom0 kernel: input: Dell AIO WMI hotkeys as 
> /devices/virtual/input/input13
> ...
> Sep 13 09:21:44 dom0 kernel: dell-wmi 9DBB5994-A997-11DA-B012-B622A1EF5492: 
> Dell descriptor buffer has invalid buffer length (32768)
> Sep 13 09:21:44 dom0 kernel: dell-wmi 9DBB5994-A997-11DA-B012-B622A1EF5492: 
> Detected Dell WMI interface version 1
> Sep 13 09:21:44 dom0 kernel: input: Dell WMI hotkeys as 
> /devices/platform/PNP0C14:04/wmi_bus/wmi_bus-PNP0C14:04/9DBB5994-A997-11DA-B012-B622A1EF5492/input/input14
> Sep 13 09:21:44 dom0 systemd[1]: Found device /dev/disk/by-uuid/A482-5EDF.
> Sep 13 09:21:44 dom0 systemd-udevd[1677]: Error calling EVIOCSKEYCODE on 
> device node '/dev/input/event14' (scan code 0x150, key code 190): Invalid 
> argument
> ...
> Sep 13 09:21:49 dom0 kernel: snd_hda_codec_realtek hdaudioC0D0: Failed to 
> find dell wmi symbol dell_micmute_led_set
> ...
> Sep 13 09:22:00 dom0 kernel: input: HDA Intel PCH HDMI/DP,pcm=10 as 
> /devices/pci0000:00/0000:00:1f.3/sound/card0/input21
> Sep 13 09:22:43 dom0 systemd-udevd[1652]: seq 2961 
> '/devices/pci0000:00/0000:00:1d.2/0000:05:00.0/0000:06:00.0/usb4' is taking a 
> long time
> Sep 13 09:23:44 dom0 systemd[1]: systemd-udev-settle.service: Main process 
> exited, code=exited, status=1/FAILURE
> Sep 13 09:23:44 dom0 audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 
> ses=4294967295 msg='unit=systemd-udev-settle comm="systemd" 
> exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=failed'
> Sep 13 09:23:44 dom0 systemd[1]: Failed to start udev Wait for Complete 
> Device Initialization.
> Sep 13 09:23:44 dom0 kernel: audit: type=1130 audit(1536848624.020:69): pid=1 
> uid=0 auid=4294967295 ses=4294967295 msg='unit=systemd-udev-settle 
> comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? 
> res=failed'
> Sep 13 09:23:44 dom0 systemd[1]: systemd-udev-settle.service: Unit entered 
> failed state.
> Sep 13 09:23:44 dom0 systemd[1]: systemd-udev-settle.service: Failed with 
> result 'exit-code'.
> ...
> Sep 13 09:23:44 dom0 systemd-logind[2748]: Watching system buttons on 
> /dev/input/event14 (Dell WMI hotkeys)
> Sep 13 09:23:44 dom0 systemd-logind[2748]: Watching system buttons on 
> /dev/input/event13 (Dell AIO WMI hotkeys)
> ...
> Sep 13 09:23:45 dom0 qmemman.systemstate[2758]: stat: xenfree=29212956823 
> memset_reqs=[]
> Sep 13 09:24:43 dom0 systemd-udevd[1652]: seq 2961 
> '/devices/pci0000:00/0000:00:1d.2/0000:05:00.0/0000:06:00.0/usb4' killed
> Sep 13 09:24:43 dom0 systemd-udevd[1652]: worker [1674] terminated by signal 
> 9 (Killed)
> Sep 13 09:24:43 dom0 systemd-udevd[1652]: worker [1674] failed while handling 
> '/devices/pci0000:00/0000:00:1d.2/0000:05:00.0/0000:06:00.0/usb4'
> #pci device 1d.2 is: 00:1d.2 PCI bridge: Intel Corporation Device a332 (rev 
> f0), which I imagine might be related to: 00:17.0 SATA controller: Intel 
> Corporation Device a352 (rev 10)
> 
> Sep 13 09:25:14 dom0 systemd[1]: qubesd.service: Start operation timed out. 
> Terminating.
> Sep 13 09:25:14 dom0 systemd[1]: Failed to start Qubes OS daemon.
> 
> 
> If this guy is correct https://bugs.freedesktop.org/show_bug.cgi?id=75875#c1 
> about `you can safely ignore these errors` then the following workaround 
> attempts are probably not going to make any difference(but hey, I tried):
> 
> 
> What's the output of `lsmod|grep -i wmi` ? I'm guessing there should be 
> something like `dell_wmi` which means it's possible to get it blacklisted (if 
> even temporarily), so with something like `modprobe.blacklist=dell_wmi` as a 
> kernel parameter (ie. cat /proc/cmdline) (or should it be dell-wmi ? that is, 
> with dash instead of underscore). There's a list of ways on how to do that 
> here https://askubuntu.com/questions/110341/how-to-blacklist-kernel-modules
> but if you're using UEFI, that would mean appending that to the `kernel=` 
> line (of the default= kernel) in /boot/efi/EFI/qubes/xen.cfg then rebooting; 
> however I do recommend having a working copy first, as a backup, just in case 
> your xen.cfg modifications render the system unbootable, for inspiration on 
> how to do that, maybe see from here: 
> https://groups.google.com/d/msg/qubes-users/CZ5vMNL_c7k/btiRvk9eBAAJ
> 
> If you don't want to blacklist that dell_wmi(guessing) module, or can't, then 
> maybe consider temporarily commenting out this whole block of lines:
> # Dell Latitude microphone mute
> evdev:name:Dell WMI hotkeys:dmi:bvn*:bvr*:bd*:svnDell*:pnLatitude*            
>                                                                               
>        
> # Dell Precision microphone mute
> evdev:name:Dell WMI hotkeys:dmi:bvn*:bvr*:bd*:svnDell*:pnPrecision*
>  KEYBOARD_KEY_150=f20                                   # Mic mute toggle, 
> should be micmute
> 
> in dom0 file: /usr/lib/udev/hwdb.d/60-keyboard.hwdb
> which would mean that some multimedia keys won't work but it should also not 
> stall your boot process. Inspiration for this solution is from: 
> https://ubuntuforums.org/showthread.php?t=2250210&p=13153308#post13153308
> (or maybe more/different lines need to be blacklisted? unsure)
> 
> Another solution would be to use a newer kernel (like 4.18.7 from the 
> unstable repo, see file /etc/yum.repos.d/qubes-dom0.repo if you want to 
> `enabled = 1` it, under section [qubes-dom0-unstable]), but before you do I 
> still recommend having another UEFI entry with the normal kernel(s) just in 
> case the new one will not boot at all (tho unlikely), so you can use your 
> BIOS boot menu to select between the two (tho be aware the currently running 
> one will be updated(xen.cfg -wise) when you `sudo qubes-dom0-update` to a 
> newer kernel). However I do note that your UEFI partition is only 200MiB so 
> you might need to only copy the running kernel (the one referenced by 
> `default=`) instead of everything, or you may not have enough space on it; 
> this only makes sense in this context: 
> https://groups.google.com/d/msg/qubes-users/CZ5vMNL_c7k/btiRvk9eBAAJ

Thanks awokd and Marcus!  The points you made got me to unplugging the 
non-working PS/2 keyboard and mouse on my computer (which plug into a PCI 
card).  I've rebooted 5 times since and have not run into an error.  So it 
looks like something about the PS/2 peripherals were causing the problem.  
Which of course leads to the next question of why does this cause problems and 
why don't the keyboard & mouse work.  Well, questions for another thread.

-- 
You received this message because you are subscribed to the Google Groups 
"qubes-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to qubes-users+unsubscr...@googlegroups.com.
To post to this group, send email to qubes-users@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/qubes-users/82f1145e-94cf-445f-a2ed-cf341fb12cd7%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to