[CentOS] kernel: Machine check events logged

2010-07-07 Thread Alexander Farber
Hello, every few hours I get the following message in /var/log/message: Jul 5 20:23:28 hXXX kernel: Machine check events logged Jul 5 20:53:28 hXXX kernel: Machine check events logged Jul 5 22:13:28 hXXX kernel: Machine check events logged Jul 5 23:53:28 hXXX kernel: Machine check events

Re: [CentOS] kernel: Machine check events logged

2010-07-07 Thread m . roth
Alexander Farber wrote: Hello, every few hours I get the following message in /var/log/message: Jul 5 20:23:28 hXXX kernel: Machine check events logged snip And in the /var/log/mcelog I see: MCE 0 HARDWARE ERROR. This is *NOT* a software problem! Please contact your hardware vendor CPU

Re: [CentOS] kernel: Machine check events logged

2010-07-07 Thread Alexander Farber
Hello Mark, On Wed, Jul 7, 2010 at 2:51 PM, m.r...@5-cent.us wrote: First, this is *very* bad - I'm not good enough on this to tell you if it's the CPU, or the motherboard, but it's one of the two, *not* just memory. Second, if you're paying for hosting, and it's *their* server, you need to

Re: [CentOS] kernel: Machine check events logged

2010-07-07 Thread m . roth
Alexander Farber wrote: Hello Mark, On Wed, Jul 7, 2010 at 2:51 PM, m.r...@5-cent.us wrote: First, this is *very* bad - I'm not good enough on this to tell you if it's the CPU, or the motherboard, but it's one of the two, *not* just memory. Second, if you're paying for hosting, and it's

Re: [CentOS] kernel: Machine check events logged

2010-07-07 Thread Peter Kjellstrom
On Wednesday 07 July 2010, m.r...@5-cent.us wrote: Alexander Farber wrote: every few hours I get the following message in /var/log/message: Jul 5 20:23:28 hXXX kernel: Machine check events logged ... MCE 0 HARDWARE ERROR. This is *NOT* a software problem! Please contact your hardware

Re: [CentOS] kernel: Machine check events logged

2010-07-07 Thread m . roth
Peter Kjellstrom wrote: On Wednesday 07 July 2010, m.r...@5-cent.us wrote: Alexander Farber wrote: every few hours I get the following message in /var/log/message: Jul 5 20:23:28 hXXX kernel: Machine check events logged ... MCE 0 HARDWARE ERROR. This is *NOT* a software problem!

Re: [CentOS] kernel: Machine check events logged

2010-07-07 Thread Alexander Farber
I've only found this Solaris blog, but don't understand it well enough: http://blogs.sun.com/gavinm/entry/amd_opteron_athlon64_turion64_fault Can't provide you more details, because my dedicated server is under hoster's hardware tests since 5 hours :-( (and I guess everyone will run home for the

Re: [CentOS] kernel: Machine check events logged

2010-07-07 Thread m . roth
Alexander Farber wrote: I've only found this Solaris blog, but don't understand it well enough: http://blogs.sun.com/gavinm/entry/amd_opteron_athlon64_turion64_fault Can't provide you more details, because my dedicated server is under hoster's hardware tests since 5 hours :-( (and I guess

Re: [CentOS] kernel: Machine check events logged

2010-07-07 Thread Peter Kjellstrom
On Wednesday 07 July 2010, m.r...@5-cent.us wrote: Peter Kjellstrom wrote: On Wednesday 07 July 2010, m.r...@5-cent.us wrote: Alexander Farber wrote: ... MISC c0080100 ADDR 1148f5940 Northbridge NB Array Error bit35 = err cpu3 bit42 = L3 subcache in error

Re: [CentOS] kernel: Machine check events logged

2010-07-07 Thread Alexander Farber
Anyway my hoster has finished the hardware tests (probably just kept running memtest86 or some vendor CD?) on my CentOS 5.5/64bit machine with quad Opteron 1381 and said that they haven't found any issues. I'll post here a short note if I will experience any issues on my LAPP server (preferans.de