Hi Pluggers,

I assumed this is a Xeon processor. 

It could be a failure processor and affects the channels  of memory banks that 
link to it. This is the difference of Xeon to Itanium. If Xeon proc fails or 
one of the core fails server halt and it won't  start.

Regards,
Boker

________________________________________
From: plug-boun...@lists.linux.org.ph [plug-boun...@lists.linux.org.ph] on 
behalf of plug-requ...@lists.linux.org.ph [plug-requ...@lists.linux.org.ph]
Sent: Tuesday, October 25, 2016 12:00 PM
To: plug@lists.linux.org.ph
Subject: PLUG Digest, Vol 130, Issue 2

Send PLUG mailing list submissions to
        plug@lists.linux.org.ph

To subscribe or unsubscribe via the World Wide Web, visit
        http://lists.linux.org.ph/mailman/listinfo/plug
or, via email, send a message with subject or body 'help' to
        plug-requ...@lists.linux.org.ph

You can reach the person managing the list at
        plug-ow...@lists.linux.org.ph

When replying, please edit your Subject line so it is more specific
than "Re: Contents of PLUG digest..."


Today's Topics:

   1. decoding further a Machine Check Excepton (Michael Tinsay)


----------------------------------------------------------------------

Message: 1
Date: Tue, 25 Oct 2016 01:31:38 +0000 (UTC)
From: Michael Tinsay <tinsa...@yahoo.com>
To: "Philippine Linux Users' Group (PLUG) Technical Discussion List"
        <plug@lists.linux.org.ph>
Subject: [plug] decoding further a Machine Check Excepton
Message-ID: <1193330576.64910.1477359098...@mail.yahoo.com>
Content-Type: text/plain; charset="utf-8"

Hi!
Yesterday one of our servers had this on the console:
[ 1184.087973] mce: [Hardware Error]: CPU 0: Machine Check Exception: 4 Bank 8: 
ba000000000000b2[ 1184.087973] mce: [Hardware Error]: TSC 3a3965b65c0 MISC 
80000[ 1184.087973] mce: [Hardware Error]: PROCESSOR 0:206c2 TIME 1477301538 
SOCKET 0 APIC 0 microcode 2[ 1184.087973] mce: [Hardware Error]: Machine check: 
Processor context corrupt
So I did some research and found out that I can use an app named mcelog to 
decode this. ?This was the output from it:
Hardware event. This is not a software error.CPU 0 BANK 8 TSC 3a3965b65c0?MISC 
80000?TIME 1477301538 Mon Oct 24 17:32:18 2016MCG status:MCIP?MCi 
status:Uncorrected errorError enabledMCi_MISC register validProcessor context 
corruptMCA: MEMORY CONTROLLER AC_CHANNEL2_ERRTransaction: Address/Command 
errorMemory corrected error count (CORE_ERR_CNT): 0Memory transaction Tracker 
ID (RTId): 0Memory DIMM ID of error: 0Memory channel ID of error: 2Memory ECC 
syndrome: 0STATUS ba000000000000b2 MCGSTATUS 4CPUID Vendor Intel Family 6 Model 
44SOCKET 0 APIC 0 microcode 2tinsaymc@IT-046641:~$ ?cat mce.txtCPU 0: Machine 
Check Exception: 4 Bank 8: ba000000000000b2TSC 3a3965b65c0 MISC 80000PROCESSOR 
0:206c2 TIME 1477301538 SOCKET 0 APIC 0 microcode 2
So my question now, for those who know more about this area than I, is: ?Is the 
exception due to a problem in the CPU itself or somewhere on the motherboard?
Regards.

--- mike t.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
<http://lists.linux.org.ph/mailman/private/plug/attachments/20161025/9f7ecb76/attachment.html>

------------------------------

_________________________________________________
Philippine Linux Users' Group (PLUG) Mailing List
http://lists.linux.org.ph/mailman/listinfo/plug
Searchable Archives: http://archives.free.net.ph

End of PLUG Digest, Vol 130, Issue 2
************************************
_________________________________________________
Philippine Linux Users' Group (PLUG) Mailing List
http://lists.linux.org.ph/mailman/listinfo/plug
Searchable Archives: http://archives.free.net.ph

Reply via email to