Per Jensen wrote, On 10/23/2009 06:52 AM:
> List,
>
> I have just received a T610 which is being setup for Xen dom0 backup use.
...
> When running 'ipmitool sdr' I get a 'cr' on one of the reported temperatures,
> as
> shown in the snip below.
...
> Temp | 62 degrees C | cr
...
> How do I find out what the "cr" is about, should I be concerned ?
Per,
You need to get the "Entity ID" via:
# ipmitool sdr type temp
Temp | 01h | ok | 3.1 | -56 degrees C
Temp | 02h | ok | 3.2 | -54 degrees C
Temp | 05h | ok | 10.1 | 37 degrees C
Temp | 06h | ok | 10.2 | 36 degrees C
Ambient Temp | 0Eh | ok | 7.1 | 25 degrees C
Planar Temp | 0Fh | ok | 7.1 | 42 degrees C
IOH THERMTRIP | 5Dh | ns | 7.1 | Disabled
CPU Temp Interf | 76h | ns | 7.1 | Disabled
Temp | 0Ah | ok | 8.1 | 34 degrees C
Temp | 0Bh | ok | 8.1 | 39 degrees C
Temp | 0Ch | ucr | 8.1 | 50 degrees C
I also appear to have an Upper Critical on entity 8.1 on an R710.
I'll have to check my other systems to see if this is an anomoly
or a "bug/deficiency" in Dell's implementation.
# ipmitool -v sdr entity 8.1
....
Sensor ID : Temp (0xc)
Entity ID : 8.1 (Memory Module)
Sensor Type (Analog) : Temperature
Sensor Reading : 49 (+/- 1) degrees C
Status : Upper Critical
Nominal Reading : 23.000
Normal Minimum : 11.000
Normal Maximum : 69.000
Upper critical : 47.000
Upper non-critical : 42.000
Lower critical : 3.000
Lower non-critical : 8.000
Positive Hysteresis : 1.000
Negative Hysteresis : 1.000
Minimum sensor range : Unspecified
Maximum sensor range : Unspecified
Event Message Control : Per-threshold
Readable Thresholds : lcr lnc unc ucr
Settable Thresholds : lcr lnc unc ucr
Threshold Read Mask : lcr lnc unc ucr
Event Status : Event Messages Disabled
Assertion Events : unc+ ucr+
Event Enable : Event Messages Disabled
Assertions Enabled :
This is a Memory Module. Not sure how to map that to any particular
DIMM/slot/cpu/sensor-location, though, as i have 6 DIMMs (3/cpu)
# dmidecode -t memory | sed -ne '/Memory Device/,/Part Number/ {
> /Size:/h; /^[[:space:]]*Locator:/ {p;x;p}; /Speed:/p}' | paste - - - | tr -s
> '\t
> ' | expand -t 1,20,50
Locator: DIMM_A1 Size: 4096 MB Speed: 1333 MHz (0.8 ns)
Locator: DIMM_A2 Size: 4096 MB Speed: 1333 MHz (0.8 ns)
Locator: DIMM_A3 Size: 4096 MB Speed: 1333 MHz (0.8 ns)
Locator: DIMM_A4 Size: No Module Installed Speed: Unknown
Locator: DIMM_A5 Size: No Module Installed Speed: Unknown
Locator: DIMM_A6 Size: No Module Installed Speed: Unknown
Locator: DIMM_A7 Size: No Module Installed Speed: Unknown
Locator: DIMM_A8 Size: No Module Installed Speed: Unknown
Locator: DIMM_A9 Size: No Module Installed Speed: Unknown
Locator: DIMM_B1 Size: 4096 MB Speed: 1333 MHz (0.8 ns)
Locator: DIMM_B2 Size: 4096 MB Speed: 1333 MHz (0.8 ns)
Locator: DIMM_B3 Size: 4096 MB Speed: 1333 MHz (0.8 ns)
Locator: DIMM_B4 Size: No Module Installed Speed: Unknown
Locator: DIMM_B5 Size: No Module Installed Speed: Unknown
Locator: DIMM_B6 Size: No Module Installed Speed: Unknown
Locator: DIMM_B7 Size: No Module Installed Speed: Unknown
Locator: DIMM_B8 Size: No Module Installed Speed: Unknown
Locator: DIMM_B9 Size: No Module Installed Speed: Unknown
But, the presumption from this, then, is that the memory is
overheating *IF* it's not some incomplete function of the BMC.
Well, to confirm, this seems to be common on the R710s i've checked.
lager:~# ipmitool sdr entity 8.1.0
Temp | 0Ah | ok | 8.1 | 27 degrees C
Temp | 0Bh | ok | 8.1 | 24 degrees C
Temp | 0Ch | ucr | 8.1 | 59 degrees C
pub:~# ipmitool sdr entity 8.1.0
Temp | 0Ah | ok | 8.1 | 32 degrees C
Temp | 0Bh | ok | 8.1 | 32 degrees C
Temp | 0Ch | unc | 8.1 | 45 degrees C
The last sensor is MUCH higher than the other two.
I think someone from Dell needs to chime in on this
--stephen
--
Stephen Dowdy - Systems Administrator - NCAR/RAL
303.497.2869 - [email protected] - http://www.ral.ucar.edu/~sdowdy/
_______________________________________________
Linux-PowerEdge mailing list
[email protected]
https://lists.us.dell.com/mailman/listinfo/linux-poweredge
Please read the FAQ at http://lists.us.dell.com/faq