Package: mcelog
Version: 1.0~pre3-72-gcbd4da4-1
Severity: important
Dear Maintainer,
after upgrading from Squeeze to Wheezy, I am no longer informed about memory
ECC errors.
*** Please consider answering these questions, where appropriate ***
* What led up to the situation?
Upgrade/fresh install of Wheezy
* What exactly did you do (or not do) that was effective (or
ineffective)?
I was checking the configuration, and even did a reinstall of the mcelog package
since I had a "logfile=..." line appended to /etc/mcelog/mcelog.conf originally.
* What was the outcome of this action?
root@o011:~# debsums mcelog
/usr/sbin/mcelog OK
/usr/share/doc/mcelog/NEWS.Debian.gz OK
/usr/share/doc/mcelog/README.Debian OK
/usr/share/doc/mcelog/README.gz OK
/usr/share/doc/mcelog/changelog.Debian.gz OK
/usr/share/doc/mcelog/changelog.gz OK
/usr/share/doc/mcelog/copyright OK
/usr/share/man/man8/mcelog.8.gz OK
root@o011:~# apt-get install --reinstall mcelog
Reading package lists... Done
Building dependency tree
Reading state information... Done
0 upgraded, 0 newly installed, 1 reinstalled, 0 to remove and 0 not upgraded.
Need to get 62.9 kB of archives.
After this operation, 0 B of additional disk space will be used.
Get:1 http://10.100.200.98/o/debian/ wheezy/main mcelog amd64
1.0~pre3-72-gcbd4da4-1 [62.9 kB]
Fetched 62.9 kB in 0s (0 B/s)
(Reading database ... 101643 files and directories currently installed.)
Preparing to replace mcelog 1.0~pre3-72-gcbd4da4-1 (using
.../mcelog_1.0~pre3-72-gcbd4da4-1_amd64.deb) ...
Stopping Machine Check Exceptions decoder: mcelog.
Unpacking replacement mcelog ...
Processing triggers for man-db ...
Setting up mcelog (1.0~pre3-72-gcbd4da4-1) ...
Starting Machine Check Exceptions decoder: mcelog.
root@o011:~# ps auxw | grep mcelog
root 15443 0.0 0.0 4884 324 ? Ss 13:32 0:00
/usr/sbin/mcelog --daemon
root 15452 0.0 0.0 7832 836 pts/0 S+ 13:32 0:00 grep mcelog
root@o011:~# find /etc -name mcelog\*
/etc/default/mcelog
/etc/mcelog
/etc/mcelog/mcelog.conf
/etc/init.d/mcelog
root@o011:~# lsmod | grep edac
amd64_edac_mod 22334 0
edac_mce_amd 17103 1 amd64_edac_mod
edac_core 35258 3 amd64_edac_mod
ECC errors are reported to the syslog as
Feb 26 13:35:24 o011 kernel: [79200.804037] [Hardware Error]: CPU:1
MC0_STATUS[Over|CE|-|-|AddrV|CECC]: 0xd458400000000833
Feb 26 13:35:24 o011 kernel: [79200.804286] [Hardware Error]: MC0_ADDR:
0x00000000370fb080
Feb 26 13:35:24 o011 kernel: [79200.804380] [Hardware Error]: Data Cache Error:
during system linefill.
Feb 26 13:35:24 o011 kernel: [79200.804537] [Hardware Error]: cache level:
L3/GEN, mem/io: MEM, mem-tx: DRD, part-proc: SRC (no timeout)
Feb 26 13:35:24 o011 kernel: [79200.804899] [Hardware Error]: CPU:1
MC1_STATUS[Over|CE|-|-|AddrV|CECC]: 0xd400400000000853
Feb 26 13:35:24 o011 kernel: [79200.805138] [Hardware Error]: MC1_ADDR:
0x00000000012396c0
Feb 26 13:35:24 o011 kernel: [79200.805230] [Hardware Error]: Instruction Cache
Error: during system linefill.
Feb 26 13:35:24 o011 kernel: [79200.805403] [Hardware Error]: cache level:
L3/GEN, mem/io: MEM, mem-tx: IRD, part-proc: SRC (no timeout)
Feb 26 13:35:24 o011 kernel: [79200.805764] [Hardware Error]: CPU:1
MC2_STATUS[Over|CE|-|-|-|CECC]: 0xd000400000000863
Feb 26 13:35:24 o011 kernel: [79200.805999] [Hardware Error]: Bus Unit Error:
PRF/ECC error in data read from NB: SRC.
Feb 26 13:35:24 o011 kernel: [79200.806174] [Hardware Error]: cache level:
L3/GEN, mem/io: MEM, mem-tx: PRF, part-proc: SRC (no timeout)
but "mcelog --client" doesn't return anything.
Probably related warning when restarting the mcelog daemon:
root@o011:~# grep mcelog /var/log/syslog
Feb 26 13:23:56 o011 mcelog: failed to prefill DIMM database from DMI data
Feb 26 13:32:42 o011 mcelog: failed to prefill DIMM database from DMI data
* What outcome did you expect instead?
The manual pages suggest that "mcelog --client" would read a memory buffer
filled with error messages.
Although memory errors happened, the return of this command is empty.
Information:
Debian Release: 7.4
APT prefers stable
APT policy: (500, 'stable')
Architecture: amd64 (x86_64)
Kernel: Linux 3.2.0-4-amd64 (SMP w/2 CPU cores)
Locale: LANG=C, LC_CTYPE=C (charmap=ANSI_X3.4-1968)
Shell: /bin/sh linked to /bin/dash
Versions of packages mcelog depends on:
ii debconf [debconf-2.0] 1.5.49
ii libc6 2.13-38+deb7u1
ii udev 175-7.2
mcelog recommends no packages.
mcelog suggests no packages.
-- Configuration Files:
-- no debconf information
--
To UNSUBSCRIBE, email to [email protected]
with a subject of "unsubscribe". Trouble? Contact [email protected]