On 10/4/2006 4:40 PM, Jarrod Johnson wrote:
> On Wed, 2006-10-04 at 15:40 -0400, Eric A. Hall wrote:
>> On 10/4/2006 10:28 AM, Colin Keith wrote:
>>
>>> I'm still trying to work out my IPMI problems and I was wondering if anyone
>>> could make suggestions on the following. I have a server out at a data
>>> center its a dual AMD 254 Operton box on a Tyan m/b
>>> I can access the box via IPMI over the LAN without problems when it is up
>>> and running, but for some reason it is falling down every so few weeks
>>> (running FC5 and it seems to be a problem with the aacraid driver in 2.6.17
>>> ..). When it does I can't get any commands through at all, including a
>>> reset, which was our entire reason for getting the IPMI support on the box.
>> My Tyan S5162 with M3291 IPMI is just as flaky. If the machine is on then
>> IPMI works fine. If I power off the machine with one method, then IPMI
>> will usually continue working, but if I power it off with another method,
>> then IPMI connectivity will disappear altogether. They don't really seem

> This is another problem we faced and had broadcom implement a correction
> in firmware, so you may want to check for newer firmware for the NIC
> bit.
> 
> This case is actually very similar to the panic.  The root cause is
> likely the system powering down without unloading the NIC driver.  Just
> like in a panic, the NIC doesn't realize that the driver went away so it
> continues operating optimizing for NIC driver being loaded and will mess
> up when buffers fill.  For this problem the fix was simple for them,
> their firmware in some loop checks to see if it thinks the driver is
> loaded and if power is on.  If the system is off and thinks the driver
> is loaded, it takes corrective measures (since *obviously* the driver
> can't be loaded if the system is off).

This discussion in conjunction with another conversation elsewhere led me
to try using the e1000 drivers from Intel's site, instead of the e1000
drivers that are bundled into the suse distro. It seems that those drivers
do alleviate some of my problems, presumably because they leave the
hardware in a usable state.

While there are still some bad problems with management power from hard
power-loss, OS-level shutdown seems to be a lot cleaner.

FYI to others having similar problems

-- 
Eric A. Hall                                        http://www.ehsco.com/
Internet Core Protocols          http://www.oreilly.com/catalog/coreprot/

-------------------------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Ipmitool-devel mailing list
Ipmitool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ipmitool-devel

Reply via email to