Your message dated Tue, 03 Jul 2012 22:54:49 +0400 with message-id <[email protected]> and subject line Re: Bug#637675: qemu-kvm: 2.6.32 kernels don't boot 1/3 of times has caused the Debian Bug report #637675, regarding squeeze 2.6.32 smp guest don't boot randomly to be marked as done.
This means that you claim that the problem has been dealt with. If this is not the case it is now your responsibility to reopen the Bug report if necessary, and/or fix the problem forthwith. (NB: If you are a system administrator and have no idea what this message is talking about, this may indicate a serious mail system misconfiguration somewhere. Please contact [email protected] immediately.) -- 637675: http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=637675 Debian Bug Tracking System Contact [email protected] with problems
--- Begin Message ---Package: qemu-kvm Version: 0.12.5+dfsg-5+squeeze3 Severity: important When I try to boot multiple 2.6.32 kernels in KVM, one third of the times they will fail to boot. They get stuck at 'Loading, please wait...', leaving CPU usage at 0% and non showing any message. I think it doesn't initialize IDE interface so it can not mount the root fs, but doesn't give any message about that. I have tried removing 'quiet' parameter to see more messages, but doesn't show nothing special (it echoes some about SCSI, but my machine uses IDE, so it's nothing special) To boot it, I have to virsh destroy && virsh start until it finally boots. Guest Kernel (alaways 2.6.32) combinations that I've tried: - Lenny + 2.6.32 from backports, installed in LVM - Squeeze with stock kernel, installed in LVM - Squeeze with stock kernel, installed on /dev/sda - Squeeze with custom latest longterm from kernel.org, installed on /dev/sda In the host I use a plain image file, I don't have obscure storage configurations. I'm using an AMD desktop machine with hardware virt. support, with updated Squeeze (stock amd64 kernel). I attach machine configuration file and VNC screenshot with last message it throws (I don't know how to get a text version). Thank you in advance. Best regards. -- Package-specific info: /proc/cpuinfo: processor : 0 vendor_id : AuthenticAMD cpu family : 15 model : 107 model name : AMD Athlon(tm) 64 X2 Dual Core Processor 4000+ stepping : 1 cpu MHz : 1000.000 cache size : 512 KB physical id : 0 siblings : 2 core id : 0 cpu cores : 2 apicid : 0 initial apicid : 0 fpu : yes fpu_exception : yes cpuid level : 1 wp : yes flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt rdtscp lm 3dnowext 3dnow rep_good extd_apicid pni cx16 lahf_lm cmp_legacy svm extapic cr8_legacy 3dnowprefetch bogomips : 2009.10 TLB size : 1024 4K pages clflush size : 64 cache_alignment : 64 address sizes : 40 bits physical, 48 bits virtual power management: ts fid vid ttp tm stc 100mhzsteps processor : 1 vendor_id : AuthenticAMD cpu family : 15 model : 107 model name : AMD Athlon(tm) 64 X2 Dual Core Processor 4000+ stepping : 1 cpu MHz : 1000.000 cache size : 512 KB physical id : 0 siblings : 2 core id : 1 cpu cores : 2 apicid : 1 initial apicid : 1 fpu : yes fpu_exception : yes cpuid level : 1 wp : yes flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt rdtscp lm 3dnowext 3dnow rep_good extd_apicid pni cx16 lahf_lm cmp_legacy svm extapic cr8_legacy 3dnowprefetch bogomips : 2009.10 TLB size : 1024 4K pages clflush size : 64 cache_alignment : 64 address sizes : 40 bits physical, 48 bits virtual power management: ts fid vid ttp tm stc 100mhzsteps -- System Information: Debian Release: 6.0.2 APT prefers stable APT policy: (500, 'stable') Architecture: amd64 (x86_64) Kernel: Linux 2.6.32-5-amd64 (SMP w/2 CPU cores) Locale: LANG=es_ES.utf8, LC_CTYPE=es_ES.utf8 (charmap=UTF-8) Shell: /bin/sh linked to /bin/bash Versions of packages qemu-kvm depends on: ii adduser 3.112+nmu2 add and remove users and groups ii bridge-utils 1.4-5 Utilities for configuring the Linu ii iproute 20100519-3 networking and traffic control too ii libaio1 0.3.107-7 Linux kernel AIO access library - ii libasound2 1.0.23-2.1 shared library for ALSA applicatio ii libbluetooth3 4.66-3 Library to use the BlueZ Linux Blu ii libbrlapi0.5 4.2-7 braille display access via BRLTTY ii libc6 2.11.2-10 Embedded GNU C Library: Shared lib ii libcurl3-gnutls 7.21.0-1 Multi-protocol file transfer libra ii libgnutls26 2.8.6-1 the GNU TLS library - runtime libr ii libncurses5 5.7+20100313-5 shared libraries for terminal hand ii libpci3 1:3.1.7-6 Linux PCI Utilities (shared librar ii libpulse0 0.9.21-3+squeeze1 PulseAudio client libraries ii libsasl2-2 2.1.23.dfsg1-7 Cyrus SASL - authentication abstra ii libsdl1.2debian 1.2.14-6.1 Simple DirectMedia Layer ii libuuid1 2.17.2-9 Universally Unique ID library ii libvdeplug2 2.2.3-3 Virtual Distributed Ethernet - Plu ii libx11-6 2:1.3.3-4 X11 client-side library ii python 2.6.6-3+squeeze6 interactive high-level object-orie ii zlib1g 1:1.2.3.4.dfsg-3 compression library - runtime Versions of packages qemu-kvm recommends: ii linux-image-2.6.32-5-amd64 [l 2.6.32-35 Linux 2.6.32 for 64-bit PCs Versions of packages qemu-kvm suggests: ii debootstrap 1.0.26+squeeze1 Bootstrap a basic Debian system pn samba <none> (no description available) ii vde2 2.2.3-3 Virtual Distributed Ethernet<domain type='kvm'> <name>virtualm</name> <uuid>36df1bad-d640-e4f2-3084-bc9ab4b3aa56</uuid> <memory>629600</memory> <currentMemory>629408</currentMemory> <vcpu>2</vcpu> <os> <type arch='x86_64' machine='pc-0.12'>hvm</type> <boot dev='hd'/> </os> <features> <acpi/> <apic/> <pae/> </features> <clock offset='utc'/> <on_poweroff>destroy</on_poweroff> <on_reboot>restart</on_reboot> <on_crash>restart</on_crash> <devices> <emulator>/usr/bin/kvm</emulator> <disk type='file' device='disk'> <driver name='qemu' type='raw'/> <source file='/etc/libvirt/qemu/virtualm_kvm.img'/> <target dev='hda' bus='ide'/> <address type='drive' controller='0' bus='0' unit='0'/> </disk> <controller type='ide' index='0'> <address type='pci' domain='0x0000' bus='0x00' slot='0x01' function='0x1'/> </controller> <interface type='bridge'> <mac address='52:54:00:b4:ab:0b'/> <source bridge='br0'/> <model type='e1000'/> <address type='pci' domain='0x0000' bus='0x00' slot='0x03' function='0x0'/> </interface> <serial type='pty'> <target port='0'/> </serial> <console type='pty'> <target type='serial' port='0'/> </console> <input type='tablet' bus='usb'/> <input type='mouse' bus='ps2'/> <graphics type='vnc' port='-1' autoport='yes' keymap='es'/> <video> <model type='cirrus' vram='9216' heads='1'/> <address type='pci' domain='0x0000' bus='0x00' slot='0x02' function='0x0'/> </video> <memballoon model='virtio'> <address type='pci' domain='0x0000' bus='0x00' slot='0x04' function='0x0'/> </memballoon> </devices> </domain><<attachment: loading_vnc.png>>
--- End Message ---
--- Begin Message ---Version: 0.13.0+dfsg-1 [Acting on old bugreports] On 21.01.2012 12:59, Michael Tokarev wrote: > retitle 637675 squeeze 2.6.32 smp guest don't boot randomly > thanks > > On 15.08.2011 18:36, Miguel Mismo wrote: >> --- El lun, 15/8/11, Michael Tokarev <[email protected]> escribió: >> >>> Ok, that gives us something as a starting point. Now to find >>> out what breaks when you start it from libvirt. >>> Several vCPUs? >> >> Yes, I repeated the testing process, and the problem appears to arise >> when in machine's xml, i increment >> >> <vcpu>1</vcpu> >> >> to >> >> <vcpu>2</vcpu> >> >> When I use only one virtual CPU, also with libvirt, it alaways boot >> correctly. >> >> If i use multiple of them, then it fails to boot as reported in the bug. > > Please excuse me for a long delay replying to this. > > I retitled the bug to include "smp" to reflect the > fact that it only happens with smp guests. This is > very important. > > In initial report you mentioned: > >> They get stuck at 'Loading, please wait...', leaving >> CPU usage at 0% and non showing any message. > > In order to debug this, you may enable boot messages. > For this, while in grub menu, hit "e" (for Edit) at > the entry you're about to boot, scroll to line starting > with "linux" - it should end with "quiet" word, or > include that word somewhere -- remove this word and > hit Ctrl+x to boot the modified entry. After this, > kernel will show all boot messages on the screen, and > you'll be able to see where it hangs. > > I suspect it is hanging in a timer-related code, -- > there were several bugs fixe in 2.6.32 kernels related > to "virtual" timers... > >> Ok, in stock squeeze guest it is: >> >> ii linux-image-2.6-amd64 2.6.32+29 >> Linux 2.6 for 64-bit PCs (meta-package) >> ii linux-image-2.6.32-5-amd64 2.6.32-35 >> Linux 2.6.32 for 64-bit PCs > > ..but you said your guest kernel is quite recent > (2.6.32-35 is new enough, which should include all > required fixes). The same kernel is used on host > as well, as far as I can see -- 2.6.32-35. > > So, I'm not really sure what to do here. Can you > check if the problem is still exists with current > kernels as well? > > One more thing I noticed: you're using older AMD > CPU -- "AMD Athlon(tm) 64 X2 Dual Core Processor > 4000+" -- that's their first-generation X2-64 which > does not have synced TSC (this is a time stamp > counter register in CPU, and in these models it > shows different values for different cores). > > Do you have cpu frequency scaling enabled on the > host? If yes, it is worth to try to disable it. > > Anyway, please boot with non-verbose mode and see > where it hangs. From this point it will be more > clear what's going on. > > One additional tip: you can save boot messages in > a file by adding: > > -serial file:/some/where/file/name > > to qemu-kvm command line, and to add > > console=ttyS0 console=tty1 > > to the linux line in grub (the same line where you > want to remove the "quiet" parameter). Yes this is > 2 consoles, to show messages in two places - on regular > guest screen and on serial line. I guess this has been fixed in more recent host or guest kernel or qemu-kvm. Since you haven't provided any more information for about half a year, I'm closing this bugreport now, using somewhat arbitrary version number. If you still think the issue is present, let's reopen it and actually work to reslove it. Thank you! /mjt
--- End Message ---

