Public bug reported:
Hello,
we run several hundred servers with Ubuntu 10.04 as virtualisation nodes
(with about 20-30 virtual machines each) with qemu-kvm 0.12.3 and had to
find out the hard way that some kernel regression was introduced in
linux-image-2.6.32-32-server that made our servers quite instable.
Once every few days they just froze randomly and showed nothing but a
black screen. We then changed back to linux-image-2.6.32-31-server and
got our systems stable again. Unfortunately we were not able to
reproduce this behaviour.
After booting a crashed machine, we found entries like the following in
syslog. Opposed to what it says, I'm quite sure that it's not a hardware
bug, as the same machines just run fine with 2.6.32-31 kernel.
We also tried several upstream kernels from 2.6.36, 2.6.37, 2.6.38 and
even 2.6.39 series - all with the same problem.
------------
mcelog: failed to prefill DIMM database from DMI data
mcelog: Kernel does not support page offline interface
mcelog: HARDWARE ERROR. This is *NOT* a software problem!
mcelog: Please contact your hardware vendor
mcelog: MCE 0
mcelog: CPU 0 BANK 5
mcelog: MISC 7fff ADDR 3fff81024ae8
mcelog: TIME 1310033176 Thu Jul 7 12:06:16 2011
mcelog: MCG status:
mcelog: MCi status:
mcelog: Error overflow
mcelog: Uncorrected error
mcelog: Error enabled
mcelog: MCi_MISC register valid
mcelog: MCi_ADDR register valid
mcelog: Processor context corrupt
mcelog: MCA: Internal Timer error
mcelog: STATUS fe00000000800400 MCGSTATUS 0
mcelog: MCGCAP 1c09 APICID 0 SOCKETID 0
mcelog: CPUID Vendor Intel Family 6 Model 26
mcelog: HARDWARE ERROR. This is *NOT* a software problem!
mcelog: Please contact your hardware vendor
mcelog: MCE 1
mcelog: CPU 1 BANK 5
mcelog: MISC 7fff ADDR 3fffa003b652
mcelog: TIME 1310033176 Thu Jul 7 12:06:16 2011
mcelog: MCG status:
mcelog: MCi status:
mcelog: Error overflow
mcelog: Uncorrected error
mcelog: Error enabled
mcelog: MCi_MISC register valid
mcelog: MCi_ADDR register valid
mcelog: Processor context corrupt
mcelog: MCA: Internal Timer error
mcelog: STATUS fe00000000800400 MCGSTATUS 0
mcelog: MCGCAP 1c09 APICID 2 SOCKETID 0
mcelog: CPUID Vendor Intel Family 6 Model 26
------------
Cheers,
David
ProblemType: Bug
DistroRelease: Ubuntu 10.04
Package: linux-image-2.6.32-32-server (not installed)
Regression: Yes
Reproducible: No
ProcVersionSignature: Ubuntu 2.6.32-32.62-server 2.6.32.38+drm33.16
Uname: Linux 2.6.32-32-server x86_64
NonfreeKernelModules: sch_htb xt_physdev xt_mac ib_iser rdma_cm ib_cm iw_cm
ib_sa ib_mad ib_core ib_addr iscsi_tcp libiscsi_tcp libiscsi
scsi_transport_iscsi ebtable_nat ebtables snd_hda_codec_atihdmi fbcon tileblit
font bitblit softcursor vga16fb vgastate kvm_intel kvm ip6table_filter
ip6_tables xt_tcpudp bridge nf_conntrack_ipv4 nf_defrag_ipv4 xt_state
nf_conntrack iptable_filter ip_tables snd_hda_intel x_tables radeon
snd_hda_codec stp ttm snd_hwdep drm_kms_helper snd_pcm snd_timer snd drm
i2c_algo_bit soundcore snd_page_alloc lp parport multipath linear 3w_9xxx
3w_xxxx raid10 raid456 async_pq async_xor xor async_memcpy async_raid6_recov
raid6_pq async_tx raid1 raid0 e1000 sata_nv ahci aacraid r8169 mii sata_sil
sata_via
AlsaVersion: Advanced Linux Sound Architecture Driver Version 1.0.21.
AplayDevices: Error: [Errno 2] No such file or directory
Architecture: amd64
ArecordDevices: Error: [Errno 2] No such file or directory
AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/by-path',
'/dev/snd/controlC0', '/dev/snd/hwC0D0', '/dev/snd/pcmC0D3p', '/dev/snd/timer']
failed with exit code 1:
CRDA: Error: [Errno 2] No such file or directory
Card0.Amixer.info: Error: [Errno 2] No such file or directory
Card0.Amixer.values: Error: [Errno 2] No such file or directory
Date: Tue Jul 12 14:02:14 2011
Frequency: Once every few days.
HibernationDevice: RESUME=UUID=27b2f2d3-0ec2-4f22-9f6d-c857b6830ab6
InstallationMedia:
IwConfig: Error: [Errno 2] No such file or directory
MachineType: MSI MS-7522
ProcCmdLine: root=/dev/mapper/vg0-root ro
ProcEnviron:
LANG=en_US.UTF-8
SHELL=/bin/bash
RelatedPackageVersions: linux-firmware 1.34.7
RfKill: Error: [Errno 2] No such file or directory
SourcePackage: linux
WifiSyslog:
dmi.bios.date: 11/02/2010
dmi.bios.vendor: American Megatrends Inc.
dmi.bios.version: V8.14
dmi.board.asset.tag: To Be Filled By O.E.M.
dmi.board.name: MSI X58 Pro-E (MS-7522)
dmi.board.vendor: MSI
dmi.board.version: 3.0
dmi.chassis.asset.tag: To Be Filled By O.E.M.
dmi.chassis.type: 3
dmi.chassis.vendor: MICRO-STAR INTERNATIONAL CO.,LTD
dmi.chassis.version: 3.0
dmi.modalias:
dmi:bvnAmericanMegatrendsInc.:bvrV8.14:bd11/02/2010:svnMSI:pnMS-7522:pvr3.0:rvnMSI:rnMSIX58Pro-E(MS-7522):rvr3.0:cvnMICRO-STARINTERNATIONALCO.,LTD:ct3:cvr3.0:
dmi.product.name: MS-7522
dmi.product.version: 3.0
dmi.sys.vendor: MSI
** Affects: linux (Ubuntu)
Importance: Undecided
Status: New
** Tags: amd64 apport-bug lucid needs-upstream-testing regression-update
--
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/809313
Title:
mcelog errors and server freeze with qemu-kvm 0.12.3 and linux-
image-2.6.32-32-server
To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/809313/+subscriptions
--
ubuntu-bugs mailing list
[email protected]
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs