Are you running the latest firmware for the NICs and the BIOS? Cameron
On Fri, Jan 20, 2017 at 1:18 PM, Russell Kackley <rkack...@naoj.org> wrote: > Hi Tuc, > > We had what sounds like a similar problem to yours. Fortunately, the > change to the fan speed offset solved the problem for us. Here are the > details: > > We have 3 PE R720 servers that were purchased in mid-2013. All of them > have Intel X540/2P I350 rNDC 10 Gb/s NIC's in them. I'm guessing that is > similar to what you have in your R720XD. Two of the servers have worked > flawlessly for the past three years. However, one of them, even from the > early days of use, would intermittently report the following error: > > The system board NDC PG voltage is outside of range. > > That would cause a reboot event, which was obviously a serious problem. > The server was under warranty and the technician tried replaced both the > NIC and the motherboard. We still got the same error and reboot problem. > Eventually, the issue got elevated to a L3 technician at Dell and they > advised us to set the "Fan speed offset" to the "Low Fan Speed Offset"" > setting. We made that change to the problem server and its performance has > been perfectly fine since then. I'm guessing that the fan speed change > solved the NIC overheating problem. > > I'm sorry to hear that it doesn't seem to have solved your problem. I > think that the iDRAC Settings-Thermal GUI offers the "High Fan Speed > Offset", which runs the fans faster than the "Low Fan Speed Offset" > setting. Did you try the "High Fan Speed Offset" setting to see if that > corrects the problem? > > On Fri, Jan 20, 2017 at 6:47 AM, Tuc at Beach House <tuct...@gmail.com> > wrote: > >> Hi, >> >> We have an older R720XD (Ship date: September 07, 2012) that has X540-AT2 >> NICs (10G). The system seems to shut down the NICS for overheating. >> Apparently, Dell told them just to change the "Thermal Profile" to Max, and >> the "Fan Speed Offset" to low. That worked for a while, but now its >> happening again. The unit isn't under warranty, so I can't call Dell >> anymore. >> >> Has anyone else found a way around this. Its a hadoop node that just >> "disappears" on us and causes problems. >> >> Thanks, Tuc >> >> _______________________________________________ >> Linux-PowerEdge mailing list >> Linux-PowerEdge@dell.com >> https://lists.us.dell.com/mailman/listinfo/linux-poweredge >> >> > > > -- > Russell Kackley > Subaru Telescope > Hilo, Hawaii > > > _______________________________________________ > Linux-PowerEdge mailing list > Linux-PowerEdge@dell.com > https://lists.us.dell.com/mailman/listinfo/linux-poweredge > >
_______________________________________________ Linux-PowerEdge mailing list Linux-PowerEdge@dell.com https://lists.us.dell.com/mailman/listinfo/linux-poweredge