Re: MCE: Does this look possibly like a slot issue?

2022-06-21 Thread Chris
On 2022-06-21 12:23, Larry Rosenman wrote: On 06/21/2022 1:23 pm, Chris wrote: On 2022-06-20 17:23, Larry Rosenman wrote: I'm seeing them constantly: FWIW it looks like a sync(ing) problem between your RAM && CPU cache. Are are your clocks set correctly for your CPU && RAM? Is your CPU too

Re: MCE: Does this look possibly like a slot issue?

2022-06-21 Thread Larry Rosenman
On 06/21/2022 1:23 pm, Chris wrote: On 2022-06-20 17:23, Larry Rosenman wrote: I'm seeing them constantly: FWIW it looks like a sync(ing) problem between your RAM && CPU cache. Are are your clocks set correctly for your CPU && RAM? Is your CPU too hot? Is the CPU cache ECC? root@freenas[~]#

Re: MCE: Does this look possibly like a slot issue?

2022-06-21 Thread Chris
On 2022-06-20 17:23, Larry Rosenman wrote: I'm seeing them constantly: FWIW it looks like a sync(ing) problem between your RAM && CPU cache. Are are your clocks set correctly for your CPU && RAM? Is your CPU too hot? Is the CPU cache ECC? root@freenas[~]# mcelog --dmi Hardware event. This is

Re: MCE: Does this look possibly like a slot issue?

2022-06-21 Thread Ultima
Completely agree with you, Rodney. The LGA on the motherboard can be bent very easy when moving so I wanted to recommend this last. Larry, as Rodney mentioned, it's more or less your last option. This is likely the CPU and not the module itself. There is still a small chance that is

Re: MCE: Does this look possibly like a slot issue?

2022-06-21 Thread Larry Rosenman
Looks like it might be just that, Rodney: root@freenas[~]# mcelog Hardware event. This is not a software error. MCE 0 CPU 14 BANK 8 TSC 525efc019bb6 MISC ac29890200040083 ADDR ee2f6e800 TIME 1655827944 Tue Jun 21 11:12:24 2022 MCG status: Memory read ECC error Memory corrected error count

Re: MCE: Does this look possibly like a slot issue?

2022-06-21 Thread Rodney W. Grimes
> > > Swapped 2 DIMMS, now we wait for the ZFS ARC to fill and start using all > the memory. Depending on the results of that one thing that is often overlooked when trying to trouble shoot memory systems in modern Intel systems is the fact that the DIMM now talks directly to the CPU chip that