On Fri, 2006-12-01 at 18:06 -0500, Miguel wrote:
> I am an ipmitool newbie.
> 
> My questions must be FAQs, but I was unable to locate the answers.
> 
> Dell PowerEdge 850 running BMC version 1.52 ... the latest firmware.
> 
> playing around with basic 'power chassis' commands (status, on, off, soft,
> reset) across the 'lan' interface.
> 
> I observe that after sending a command that changes the power status (on,
> off, reset) the BMC controller does not respond for a while.
> 
> I send the following sequence:
> 
>   $ ipmitool -I lan -H pe850a-ipmi -U root -P pw chassis power off
>   Chassis Power Control: Down/Off
> 
>   $ ipmitool -I lan -H pe850a-ipmi -U root -P pw chassis power status
>   Error: Unable to establish LAN session
>   ipmi_lan_send_cmd failed to open intf
>   Unable to get Chassis Power Status
> 
>   $ ipmitool -I lan -H pe850a-ipmi -U root -P pw chassis power status
>   Chassis Power is off
> 
> If I power on or reset the same thing happens ... BMC does not respond for
> approx 15 seconds after a power change.
> 
> 
> Q: Is it normal for BMC to not be available, including checking status)
> for some period of time after a power transition?

Depends frequently on your switch configuration.  A system reset/on/off
means link renogtiation.  If your switch has LACP enabled for the port
or spanning tree is enabled and not edge-port or portfast (depending on
your switch vendor's terminalogy), it will take time.  15 seconds sounds
a lot like typical LACP link delay on some switches.  It could be
spanning-tree delay, but 15 seconds is a bit on the short side for such
a delay.  Also, it is not uncommon for the interface to operate at
different speeds when off and on.  Have had more than one customer try
to hard-set their switch ports to gigabit speed, then call wondering why
the BMCs were unreachable in the off state.  In standby power situation,
100Mbit link will often be used rather than gigabit.  This shouldn't be
your problem, but just a warning not to attempt hard-setting the link
speed to shorten negotiation (unless only 100Mbit is fine by you, then
you can try it).  Some poor switches could indeed incur a significant
penalty for mere link speed negotiation.

> 
> Q: Is there a specification that defines how long this period of time
> should be?

No, well, maybe if you dug into the various relevant protocols your
switch implements you might find something, but as far as IPMI is
concerned, there is nothing on this.

> 
> Q: If not, what kinds of times are typical?

Optimally, no more than 2-3 seconds.  With switches configured for LACP
and/or spanning tree, could block port from forwarding packets for up to
60 seconds or so.  Some switches will cause longer outages no matter
what.



> 
> Q: Does it vary from implementation to implementation?
> 

Probably, though it probably has more to do with the switching equipment
hooked up and configured, and somewhat to do with what NIC chip is used
in the server and how the server's NIC is set up in firmware.

> 
> 
> I wrote a command line and that asked for 'power chassis status' and slept
> for 2 seconds. I observe that there are times during OS driver
> initialization when it times out.
> 
> Presumably this happens because the driver is interfering with the NIC and
> blocking BMC access.
> 
> Q: What are 'typical' times for this blockage when the NIC driver
> initializes?
> 

See answer above since that event also evokes a link renegotiation,
though this is potentially more complicated.  You could sit on local
console, restart the networking service (best if system has static
rather than dhcp config), and leave a ping running to see how many
packets get lost during the restart.  That will measure delay due to
switch not negotiating the link/forwarding packets.  If it matches the
BMC outage (which it probably will), you will know any optimization to
be had is in the switch to NIC interaction.

If shorter than BMC outage, keep in mind that some NICs on driver load
and PXE rom execution will have resources that are required be tied up
for some period of time.  For example, the Broadcom 5704 chip will not
be able to forward IPMI traffic and PXE at the same time, though newer
chips (NetXtreme II family  and later NetXtreme nics like used in IBM
e326m and JS-21) can.  However, at least with broadcom NICs, driver load
time outage is pretty much on the order of the same delay incurred at
power transition times, in other words potentially neglibile.  If 15
seconds in all scenarios, I'd look at your switchport config or ask your
network administrator to do so and as is feasible shorten STP delays and
disable LACP if enabled unless you are using it.

Some other cases to be wary of is to panic the kernel while the system
NIC is up, and to chassis power off without proper shutdown.  Keep
traffic up to the BMC, maybe an amount of broadcast traffic on the
network and see if the BMC will continue to work indefinitely.  If not
investigate the possiblity of network driver and NIC firmware upgrades,
since the inability to access your BMC after a panic kinda puts a damper
on the whole manageability thing.   Alternatively, since a panicked/hung
system is not something you want to leave in a production environment
anyway, consider using some watchdog program with your BMC.  If linux,
OpenIPMI drivers included in most distros (just a 'modprobe
ipmi_si;modprobe ipmi_watchdog' away) and any standard linux watchdog
program (i.e. http://www.ibiblio.org/pub/Linux/system/daemons/watchdog/)
and it will auto-reset (or auto-off) your system in the event of a hang.
When setting this up I set up the BMC watchdog timeout to be a bit over
3 times the watchdog checkin interval, because I'm paranoid.  My typical
config is watchdog will fire after 60 seconds, and the daemon checks in
every 15 seconds to BMC.  I wrote an even simpler watchdog than what I
linked to, but it is nearly the simplest possible with even fewer
features.

  
> 
> 
> Thanks
> Miguel
> 
> 
> -------------------------------------------------------------------------
> Take Surveys. Earn Cash. Influence the Future of IT
> Join SourceForge.net's Techsay panel and you'll get the chance to share your
> opinions on IT & business topics through brief surveys - and earn cash
> http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
> _______________________________________________
> Ipmitool-devel mailing list
> Ipmitool-devel@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/ipmitool-devel


-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Ipmitool-devel mailing list
Ipmitool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ipmitool-devel

Reply via email to