On Sunday 29 Mar 2015 16:42:10 Sebas Pedersen wrote:
> On 28-03-2015 08:50 PM, Mick wrote:
> > On Saturday 28 Mar 2015 22:48:48 Sebas Pedersen wrote:
> >> On 28-03-2015 07:37 PM, Volker Armin Hemmann wrote:
> >> > Am 28.03.2015 um 23:00 schrieb Sebas Pedersen:
> >> >> On 28-03-2015 06:45 PM, Volker Armin Hemmann wrote:
> >> >>> Am 28.03.2015 um 14:58 schrieb Sebas Pedersen:
> >> >>>> Hi guys,
> >> >>>> 
> >> >>>> From a few days ago I am experimenting an MCE error.
> >> >>>> Sometimes I turn on the computer and at some point while booting
> >> >>>> the kernel (after the grub menu) just freezes and puts this:
> >> >>>> 
> >> >>>> CPU 0: Machine Check Exception: 4 Bank 4: b200000000070f0f
> >> >>>> TSC f5acc9180
> >> >>>> PROCESSOR 2:20fc2 TIME 1427486735 SOCKET 0 APIC 0 microcode 0
> >> >>>> 
> >> >>>> the number for TSC may vary, but the b200000000070f0f it's always
> >> >>>> the
> >> >>>> same (at least for now). The error message suggest to parse the
> >> >>>> above
> >> >>>> error with mcelog. I did that and the result was:
> >> >>>> 
> >> >>>> Hardware event. This is not a software error.
> >> >>>> CPU 0 4 northbridge TSC f5acc9180
> >> >>>> TIME 1427486735 Fri Mar 27 17:05:35 2015
> >> >>>> 
> >> >>>>   Northbridge Watchdog error
> >> >>>>   
> >> >>>>        bit57 = processor context corrupt
> >> >>>>        bit61 = error uncorrected
> >> >>>>   
> >> >>>>   bus error 'generic participation, request timed out
> >> >>>>   
> >> >>>>              generic error mem transaction
> >> >>>>              generic access, level generic'
> >> >>>> 
> >> >>>> STATUS b200000000070f0f MCGSTATUS 4
> >> >>>> CPUID Vendor AMD Family 15 Model 44
> >> >>>> SOCKET 0 APIC 0 microcode 0
> >> >>>> 
> >> >>>> The error suggest it's a hardware problem. I replace de RAM with no
> >> >>>> luck. Same error keeps happening.
> >> >>>> 
> >> >>>> Any suggestion for identifying the problem or how to procede?
> >> >>>> 
> >> >>>> Many thanks in advance!
> >> >>>> 
> >> >>>> Sebas
> >> >>> 
> >> >>> bios update/microcode update. A google search suggests that you have
> >> >>> run
> >> >>> into an errata.
> >> >> 
> >> >> Oh OK, thank you. Must have miss that in the search. So you are
> >> >> saying that the error comes from a bios errata (and don't know what
> >> >> microdode is), and the fix is to update bios?
> >> > 
> >> > no, possibly from a CPU errata and a bios update might bring in the
> >> > microcode update that works around that.
> >> 
> >> I see, thanks for clarifying that. So looks like not too many options,
> >> either try to update the bios and/or replace the CPU.
> >> 
> >> I really appreciated you replys and time.
> >> 
> >> Thanks!,
> >> Sebas
> > 
> > There's 'CONFIG_MICROCODE=y' and friends in the kernel which along with
> > sys-
> > apps/microcode-ctl will load what ever is the latest Intel/AMD CPU code
> > (firmware) to patch any bugs with instructions that the CPU
> > manufacturers have
> > discovered.
> 
> That's nice. I'm gonna compile the kernel and see what happends.
> 
> Many thanks!

Don't forget to enable the relevant module for your type of CPU.
-- 
Regards,
Mick

Attachment: signature.asc
Description: This is a digitally signed message part.

Reply via email to