Hi Pluggers, I assumed this is a Xeon processor.
It could be a failure processor and affects the channels of memory banks that link to it. This is the difference of Xeon to Itanium. If Xeon proc fails or one of the core fails server halt and it won't start. Regards, Boker ________________________________________ From: plug-boun...@lists.linux.org.ph [plug-boun...@lists.linux.org.ph] on behalf of plug-requ...@lists.linux.org.ph [plug-requ...@lists.linux.org.ph] Sent: Tuesday, October 25, 2016 12:00 PM To: plug@lists.linux.org.ph Subject: PLUG Digest, Vol 130, Issue 2 Send PLUG mailing list submissions to plug@lists.linux.org.ph To subscribe or unsubscribe via the World Wide Web, visit http://lists.linux.org.ph/mailman/listinfo/plug or, via email, send a message with subject or body 'help' to plug-requ...@lists.linux.org.ph You can reach the person managing the list at plug-ow...@lists.linux.org.ph When replying, please edit your Subject line so it is more specific than "Re: Contents of PLUG digest..." Today's Topics: 1. decoding further a Machine Check Excepton (Michael Tinsay) ---------------------------------------------------------------------- Message: 1 Date: Tue, 25 Oct 2016 01:31:38 +0000 (UTC) From: Michael Tinsay <tinsa...@yahoo.com> To: "Philippine Linux Users' Group (PLUG) Technical Discussion List" <plug@lists.linux.org.ph> Subject: [plug] decoding further a Machine Check Excepton Message-ID: <1193330576.64910.1477359098...@mail.yahoo.com> Content-Type: text/plain; charset="utf-8" Hi! Yesterday one of our servers had this on the console: [ 1184.087973] mce: [Hardware Error]: CPU 0: Machine Check Exception: 4 Bank 8: ba000000000000b2[ 1184.087973] mce: [Hardware Error]: TSC 3a3965b65c0 MISC 80000[ 1184.087973] mce: [Hardware Error]: PROCESSOR 0:206c2 TIME 1477301538 SOCKET 0 APIC 0 microcode 2[ 1184.087973] mce: [Hardware Error]: Machine check: Processor context corrupt So I did some research and found out that I can use an app named mcelog to decode this. ?This was the output from it: Hardware event. This is not a software error.CPU 0 BANK 8 TSC 3a3965b65c0?MISC 80000?TIME 1477301538 Mon Oct 24 17:32:18 2016MCG status:MCIP?MCi status:Uncorrected errorError enabledMCi_MISC register validProcessor context corruptMCA: MEMORY CONTROLLER AC_CHANNEL2_ERRTransaction: Address/Command errorMemory corrected error count (CORE_ERR_CNT): 0Memory transaction Tracker ID (RTId): 0Memory DIMM ID of error: 0Memory channel ID of error: 2Memory ECC syndrome: 0STATUS ba000000000000b2 MCGSTATUS 4CPUID Vendor Intel Family 6 Model 44SOCKET 0 APIC 0 microcode 2tinsaymc@IT-046641:~$ ?cat mce.txtCPU 0: Machine Check Exception: 4 Bank 8: ba000000000000b2TSC 3a3965b65c0 MISC 80000PROCESSOR 0:206c2 TIME 1477301538 SOCKET 0 APIC 0 microcode 2 So my question now, for those who know more about this area than I, is: ?Is the exception due to a problem in the CPU itself or somewhere on the motherboard? Regards. --- mike t. -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.linux.org.ph/mailman/private/plug/attachments/20161025/9f7ecb76/attachment.html> ------------------------------ _________________________________________________ Philippine Linux Users' Group (PLUG) Mailing List http://lists.linux.org.ph/mailman/listinfo/plug Searchable Archives: http://archives.free.net.ph End of PLUG Digest, Vol 130, Issue 2 ************************************ _________________________________________________ Philippine Linux Users' Group (PLUG) Mailing List http://lists.linux.org.ph/mailman/listinfo/plug Searchable Archives: http://archives.free.net.ph