On Thu, Sep 13, 2012 at 07:48:46AM -0400, Kamaraju S Kusumanchi wrote: > 1) The man page of smartctl is not much helpful in figuring out what various > fields in the output of "smartctl -a" stand for. For example, what does > fields such as Raw_Read_Error_Rate, Seek_Error_Rate, Hardware_ECC_Recovered, > Multi_Zone_Error_Rate mean? Is there any page that describes all these > fields in good detail?
Actually, the man page has useful information, just perhaps not in a useful format. Data, rather than information. In particular, it details that values in the ATRRIBUTES table are normalised (by the drive firmware) to a VALUE between 1 and 254. Also noted is a THREShold value (between 0 and 254). If an attribute is *less than or equal to* its threshold, it is deemed to have failed. The man page also notes that if this is a "Pre-Fail" attribute (rather than an "Old_age" attribute), then this indicates drive failure is imminent. > > 2) Can someone please tell me if this hard drive is dying. The following is > the difference between two smartctl outputs that are a week apart. Bearing the above in mind > > $diff smartctl_20120905 smartctl_20120912 > 14c14 > < Local Time is: Wed Sep 5 21:55:18 2012 EDT > --- > > Local Time is: Wed Sep 12 20:03:10 2012 EDT > 58c58 ID ATTRIBUTE FLAG VALUE WORST THRESH Type > < 1 Raw_Read_Error_Rate 0x000f 100 100 046 Pre-fail Always > > > 1 Raw_Read_Error_Rate 0x000f 100 100 046 Pre-fail Always > > 100 > 46, so this is fine > < 4 Start_Stop_Count 0x0032 099 099 000 Old_age Always > > > 4 Start_Stop_Count 0x0032 099 099 000 Old_age Always > > > < 7 Seek_Error_Rate 0x000e 100 100 000 Old_age Always > > > 7 Seek_Error_Rate 0x000e 100 100 000 Old_age Always > > > < 9 Power_On_Seconds 0x0032 023 023 000 Old_age Always > > > 9 Power_On_Seconds 0x0032 023 023 000 Old_age Always > > > < 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always > > > 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always > > These are also all steady and above their threshold. Note, however, that Power_On_Seconds is a little low. It's "Old_age" so when that reaches 0, you don't need to worry, but you can probably expect that that's what the drive manufacturer believes is the typical life of the drive. > < 193 Load_Cycle_Count 0x0032 069 069 000 Old_age Always > > - 635379 > < 194 Temperature_Celsius 0x0022 100 090 000 Old_age Always > > - 51 (Min/Max 8/62) Here we see that the temperature IS fine, but HAS BEEN slightly towards failure (in other words, it probably got warm). This is nothing too worry about. > < 195 Hardware_ECC_Recovered 0x001a 100 100 000 Old_age Always > > - 9258 > --- > > 193 Load_Cycle_Count 0x0032 069 069 000 Old_age Always > > > - 635384 > > 194 Temperature_Celsius 0x0022 100 090 000 Old_age Always > > > - 49 (Min/Max 8/62) > > 195 Hardware_ECC_Recovered 0x001a 100 100 000 Old_age Always > > > - 15601 > 76,77c76,77 > < 200 Multi_Zone_Error_Rate 0x000e 100 100 000 Old_age Always > > - 1038 > < 203 Run_Out_Cancel 0x0002 100 100 000 Old_age Always > > - 3732311179847 > --- > > 200 Multi_Zone_Error_Rate 0x000e 100 100 000 Old_age Always > > > - 5133 > > 203 Run_Out_Cancel 0x0002 100 100 000 Old_age Always > > > - 2632791622157 And again, these are all well above their threshold. > > This is a hard drive in Dell Inspiron E1505 laptop. Any help is greatly > appreciated. > According to SMART, your hard drive is fine.
signature.asc
Description: Digital signature