Per Jensen wrote, On 10/23/2009 06:52 AM:
> List,
> 
> I have just received a T610 which is being setup for Xen dom0 backup use.
...
> When running 'ipmitool sdr' I get a 'cr' on one of the reported temperatures, 
> as
> shown in the snip below.
...
> Temp             | 62 degrees C      | cr
...
> How do I find out what the "cr" is about, should I be concerned ?

Per,

You need to get the "Entity ID" via:

# ipmitool sdr type temp
Temp             | 01h | ok  |  3.1 | -56 degrees C
Temp             | 02h | ok  |  3.2 | -54 degrees C
Temp             | 05h | ok  | 10.1 | 37 degrees C
Temp             | 06h | ok  | 10.2 | 36 degrees C
Ambient Temp     | 0Eh | ok  |  7.1 | 25 degrees C
Planar Temp      | 0Fh | ok  |  7.1 | 42 degrees C
IOH THERMTRIP    | 5Dh | ns  |  7.1 | Disabled
CPU Temp Interf  | 76h | ns  |  7.1 | Disabled
Temp             | 0Ah | ok  |  8.1 | 34 degrees C
Temp             | 0Bh | ok  |  8.1 | 39 degrees C
Temp             | 0Ch | ucr |  8.1 | 50 degrees C

I also appear to have an Upper Critical on entity 8.1 on an R710.
I'll have to check my other systems to see if this is an anomoly
or a "bug/deficiency" in Dell's implementation.

# ipmitool -v sdr entity 8.1
....
Sensor ID              : Temp (0xc)
 Entity ID             : 8.1 (Memory Module)
 Sensor Type (Analog)  : Temperature
 Sensor Reading        : 49 (+/- 1) degrees C
 Status                : Upper Critical
 Nominal Reading       : 23.000
 Normal Minimum        : 11.000
 Normal Maximum        : 69.000
 Upper critical        : 47.000
 Upper non-critical    : 42.000
 Lower critical        : 3.000
 Lower non-critical    : 8.000
 Positive Hysteresis   : 1.000
 Negative Hysteresis   : 1.000
 Minimum sensor range  : Unspecified
 Maximum sensor range  : Unspecified
 Event Message Control : Per-threshold
 Readable Thresholds   : lcr lnc unc ucr
 Settable Thresholds   : lcr lnc unc ucr
 Threshold Read Mask   : lcr lnc unc ucr
 Event Status          : Event Messages Disabled
 Assertion Events      : unc+ ucr+
 Event Enable          : Event Messages Disabled
 Assertions Enabled    :

This is a Memory Module.  Not sure how to map that to any particular
DIMM/slot/cpu/sensor-location, though, as i have 6 DIMMs (3/cpu)

# dmidecode -t memory | sed -ne '/Memory Device/,/Part Number/ {
> /Size:/h; /^[[:space:]]*Locator:/ {p;x;p}; /Speed:/p}' | paste - - - | tr -s 
> '\t
> ' | expand -t 1,20,50
 Locator: DIMM_A1   Size: 4096 MB                 Speed: 1333 MHz (0.8 ns)
 Locator: DIMM_A2   Size: 4096 MB                 Speed: 1333 MHz (0.8 ns)
 Locator: DIMM_A3   Size: 4096 MB                 Speed: 1333 MHz (0.8 ns)
 Locator: DIMM_A4   Size: No Module Installed     Speed: Unknown
 Locator: DIMM_A5   Size: No Module Installed     Speed: Unknown
 Locator: DIMM_A6   Size: No Module Installed     Speed: Unknown
 Locator: DIMM_A7   Size: No Module Installed     Speed: Unknown
 Locator: DIMM_A8   Size: No Module Installed     Speed: Unknown
 Locator: DIMM_A9   Size: No Module Installed     Speed: Unknown
 Locator: DIMM_B1   Size: 4096 MB                 Speed: 1333 MHz (0.8 ns)
 Locator: DIMM_B2   Size: 4096 MB                 Speed: 1333 MHz (0.8 ns)
 Locator: DIMM_B3   Size: 4096 MB                 Speed: 1333 MHz (0.8 ns)
 Locator: DIMM_B4   Size: No Module Installed     Speed: Unknown
 Locator: DIMM_B5   Size: No Module Installed     Speed: Unknown
 Locator: DIMM_B6   Size: No Module Installed     Speed: Unknown
 Locator: DIMM_B7   Size: No Module Installed     Speed: Unknown
 Locator: DIMM_B8   Size: No Module Installed     Speed: Unknown
 Locator: DIMM_B9   Size: No Module Installed     Speed: Unknown

But, the presumption from this, then, is that the memory is
overheating *IF* it's not some incomplete function of the BMC.

Well, to confirm, this seems to be common on the R710s i've checked.

lager:~# ipmitool  sdr entity 8.1.0
Temp             | 0Ah | ok  |  8.1 | 27 degrees C
Temp             | 0Bh | ok  |  8.1 | 24 degrees C
Temp             | 0Ch | ucr |  8.1 | 59 degrees C

pub:~# ipmitool  sdr entity 8.1.0
Temp             | 0Ah | ok  |  8.1 | 32 degrees C
Temp             | 0Bh | ok  |  8.1 | 32 degrees C
Temp             | 0Ch | unc |  8.1 | 45 degrees C

The last sensor is MUCH higher than the other two.

I think someone from Dell needs to chime in on this
--stephen

-- 
Stephen Dowdy  -  Systems Administrator  -  NCAR/RAL
303.497.2869   -  [email protected]        -  http://www.ral.ucar.edu/~sdowdy/

_______________________________________________
Linux-PowerEdge mailing list
[email protected]
https://lists.us.dell.com/mailman/listinfo/linux-poweredge
Please read the FAQ at http://lists.us.dell.com/faq

Reply via email to