Here is output from MCE log, for that one in ML

mare...@queeg:~$ echo CPU 0: Machine Check Exception: 4 Bank 4: fe28a001fd080813 TSC 2eefd49369 ADDR f0050 MISC c0090e7e00000000 | /usr/sbin/mcelog --ascii --k8
mcelog: Cannot open /dev/mem for DMI decoding: Permission denied
HARDWARE ERROR. This is *NOT* a software problem!
Please contact your hardware vendor
CPU 0 4 northbridge   Northbridge RAM Chipkill ECC error
  Chipkill ECC syndrome = fd51
       bit32 = err cpu0
       bit45 = uncorrected ecc error
       bit57 = processor context corrupt
       bit59 = misc error valid
       bit61 = error uncorrected
       bit62 = error overflow (multiple errors)
  bus error 'local node origin, request didn't time out
      generic read mem transaction
      memory access, level generic'
STATUS fe28a001fd080813 MCGSTATUS 4


And from Ward:
CPU 0 4 northbridge   Northbridge RAM Chipkill ECC error
  Chipkill ECC syndrome = fd51
       bit32 = err cpu0
       bit45 = uncorrected ecc error
       bit57 = processor context corrupt
       bit59 = misc error valid
       bit61 = error uncorrected
       bit62 = error overflow (multiple errors)
  bus error 'local node origin, request didn't time out
      generic read mem transaction
      memory access, level generic'
STATUS fe28a001fd080813 MCGSTATUS 4

Seems something went wrong with ECC. Maybe because the memory is not cleared anymore?

Can someone test if http://tracker.coreboot.org/trac/coreboot/changeset/4099/trunk/coreboot-v2/src/cpu/amd/car/clear_init_ram.c

this change is reverted problem goes away?

Rudolf

--
coreboot mailing list: [email protected]
http://www.coreboot.org/mailman/listinfo/coreboot

Reply via email to