is this hard drive dying?

2012-09-13 Thread Kamaraju S Kusumanchi
1) The man page of smartctl is not much helpful in figuring out what various 
fields in the output of smartctl -a stand for. For example, what does 
fields such as Raw_Read_Error_Rate, Seek_Error_Rate, Hardware_ECC_Recovered, 
Multi_Zone_Error_Rate mean? Is there any page that describes all these 
fields in good detail?

2) Can someone please tell me if this hard drive is dying. The following is 
the difference between two smartctl outputs that are a week apart.

$diff smartctl_20120905 smartctl_20120912 
14c14
 Local Time is:Wed Sep  5 21:55:18 2012 EDT
---
 Local Time is:Wed Sep 12 20:03:10 2012 EDT
58c58
   1 Raw_Read_Error_Rate 0x000f   100   100   046Pre-fail  Always  
 
-   74727
---
   1 Raw_Read_Error_Rate 0x000f   100   100   046Pre-fail  Always  
  
-   108562
61c61
   4 Start_Stop_Count0x0032   099   099   000Old_age   Always  
 
-   2242
---
   4 Start_Stop_Count0x0032   099   099   000Old_age   Always  
  
-   2244
63c63
   7 Seek_Error_Rate 0x000e   100   100   000Old_age   Always  
 
-   1872
---
   7 Seek_Error_Rate 0x000e   100   100   000Old_age   Always  
  
-   3683
65c65
   9 Power_On_Seconds0x0032   023   023   000Old_age   Always  
 
-   38808h+46m+11s
---
   9 Power_On_Seconds0x0032   023   023   000Old_age   Always  
  
-   38965h+50m+24s
67c67
  12 Power_Cycle_Count   0x0032   100   100   000Old_age   Always  
 
-   1717
---
  12 Power_Cycle_Count   0x0032   100   100   000Old_age   Always  
  
-   1719
69,71c69,71
 193 Load_Cycle_Count0x0032   069   069   000Old_age   Always  
 
-   635379
 194 Temperature_Celsius 0x0022   100   090   000Old_age   Always  
 
-   51 (Min/Max 8/62)
 195 Hardware_ECC_Recovered  0x001a   100   100   000Old_age   Always  
 
-   9258
---
 193 Load_Cycle_Count0x0032   069   069   000Old_age   Always  
  
-   635384
 194 Temperature_Celsius 0x0022   100   090   000Old_age   Always  
  
-   49 (Min/Max 8/62)
 195 Hardware_ECC_Recovered  0x001a   100   100   000Old_age   Always  
  
-   15601
76,77c76,77
 200 Multi_Zone_Error_Rate   0x000e   100   100   000Old_age   Always  
 
-   1038
 203 Run_Out_Cancel  0x0002   100   100   000Old_age   Always  
 
-   3732311179847
---
 200 Multi_Zone_Error_Rate   0x000e   100   100   000Old_age   Always  
  
-   5133
 203 Run_Out_Cancel  0x0002   100   100   000Old_age   Always  
  
-   2632791622157


I am also attaching smartctl_20120912 as http://pastebin.com/L5gRvQPV . This 
file is generated by
$ sudo smartctl -a /dev/sda  smartctl_20120912

This is a hard drive in Dell Inspiron E1505 laptop. Any help is greatly 
appreciated.

thanks
raju
-- 
Kamaraju S Kusumanchi
http://malayamaarutham.blogspot.com/


-- 
To UNSUBSCRIBE, email to debian-user-requ...@lists.debian.org 
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: http://lists.debian.org/k2shbm$1la$1...@ger.gmane.org



Re: is this hard drive dying?

2012-09-13 Thread David
On 13/09/2012, Kamaraju S Kusumanchi raju.mailingli...@gmail.com wrote:
 1) The man page of smartctl is not much helpful in figuring out what various
 fields in the output of smartctl -a stand for. For example, what does
 fields such as Raw_Read_Error_Rate, Seek_Error_Rate, Hardware_ECC_Recovered,
 Multi_Zone_Error_Rate mean? Is there any page that describes all these
 fields in good detail?

Read the links under the first answer in the smartmontools FAQ at
http://sourceforge.net/apps/trac/smartmontools/wiki/FAQ


-- 
To UNSUBSCRIBE, email to debian-user-requ...@lists.debian.org 
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: 
http://lists.debian.org/CAMPXz=pGULdS+8yt_4sMNnJnfrGY55a=sl2x-j0ektyrz2l...@mail.gmail.com



Re: is this hard drive dying?

2012-09-13 Thread Gary Dale

On 13/09/12 07:48 AM, Kamaraju S Kusumanchi wrote:

1) The man page of smartctl is not much helpful in figuring out what various
fields in the output of smartctl -a stand for. For example, what does
fields such as Raw_Read_Error_Rate, Seek_Error_Rate, Hardware_ECC_Recovered,
Multi_Zone_Error_Rate mean? Is there any page that describes all these
fields in good detail?

2) Can someone please tell me if this hard drive is dying. The following is
the difference between two smartctl outputs that are a week apart.

$diff smartctl_20120905 smartctl_20120912
14c14
  Local Time is:Wed Sep  5 21:55:18 2012 EDT
---

Local Time is:Wed Sep 12 20:03:10 2012 EDT

58c58
1 Raw_Read_Error_Rate 0x000f   100   100   046Pre-fail  Always
-   74727
---

   1 Raw_Read_Error_Rate 0x000f   100   100   046Pre-fail  Always

-   108562
61c61
4 Start_Stop_Count0x0032   099   099   000Old_age   Always
-   2242
---

   4 Start_Stop_Count0x0032   099   099   000Old_age   Always

-   2244
63c63
7 Seek_Error_Rate 0x000e   100   100   000Old_age   Always
-   1872
---

   7 Seek_Error_Rate 0x000e   100   100   000Old_age   Always

-   3683
65c65
9 Power_On_Seconds0x0032   023   023   000Old_age   Always
-   38808h+46m+11s
---

   9 Power_On_Seconds0x0032   023   023   000Old_age   Always

-   38965h+50m+24s
67c67
   12 Power_Cycle_Count   0x0032   100   100   000Old_age   Always
-   1717
---

  12 Power_Cycle_Count   0x0032   100   100   000Old_age   Always

-   1719
69,71c69,71
  193 Load_Cycle_Count0x0032   069   069   000Old_age   Always
-   635379
  194 Temperature_Celsius 0x0022   100   090   000Old_age   Always
-   51 (Min/Max 8/62)
  195 Hardware_ECC_Recovered  0x001a   100   100   000Old_age   Always
-   9258
---

193 Load_Cycle_Count0x0032   069   069   000Old_age   Always

-   635384

194 Temperature_Celsius 0x0022   100   090   000Old_age   Always

-   49 (Min/Max 8/62)

195 Hardware_ECC_Recovered  0x001a   100   100   000Old_age   Always

-   15601
76,77c76,77
  200 Multi_Zone_Error_Rate   0x000e   100   100   000Old_age   Always
-   1038
  203 Run_Out_Cancel  0x0002   100   100   000Old_age   Always
-   3732311179847
---

200 Multi_Zone_Error_Rate   0x000e   100   100   000Old_age   Always

-   5133

203 Run_Out_Cancel  0x0002   100   100   000Old_age   Always

-   2632791622157


I am also attaching smartctl_20120912 as http://pastebin.com/L5gRvQPV . This
file is generated by
$ sudo smartctl -a /dev/sda  smartctl_20120912

This is a hard drive in Dell Inspiron E1505 laptop. Any help is greatly
appreciated.

thanks
raju


I can't be sure but I would take steps to ensure that I always have good 
backups just in case. Of course this is good advice at any time. New 
hard drives are cheap enough but data can be priceless.



--
To UNSUBSCRIBE, email to debian-user-requ...@lists.debian.org 
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org

Archive: http://lists.debian.org/5051df7d.1000...@rogers.com



Re: is this hard drive dying?

2012-09-13 Thread Darac Marjal
On Thu, Sep 13, 2012 at 07:48:46AM -0400, Kamaraju S Kusumanchi wrote:
 1) The man page of smartctl is not much helpful in figuring out what various 
 fields in the output of smartctl -a stand for. For example, what does 
 fields such as Raw_Read_Error_Rate, Seek_Error_Rate, Hardware_ECC_Recovered, 
 Multi_Zone_Error_Rate mean? Is there any page that describes all these 
 fields in good detail?

Actually, the man page has useful information, just perhaps not in a
useful format. Data, rather than information.

In particular, it details that values in the ATRRIBUTES table are
normalised (by the drive firmware) to a VALUE between 1 and 254. Also
noted is a THREShold value (between 0 and 254). If an attribute is *less
than or equal to* its threshold, it is deemed to have failed. The man
page also notes that if this is a Pre-Fail attribute (rather than an
Old_age attribute), then this indicates drive failure is imminent.

 
 2) Can someone please tell me if this hard drive is dying. The following is 
 the difference between two smartctl outputs that are a week apart.

Bearing the above in mind

 
 $diff smartctl_20120905 smartctl_20120912 
 14c14
  Local Time is:Wed Sep  5 21:55:18 2012 EDT
 ---
  Local Time is:Wed Sep 12 20:03:10 2012 EDT
 58c58

ID  ATTRIBUTE   FLAG VALUE WORST THRESH Type

1 Raw_Read_Error_Rate 0x000f   100   100   046Pre-fail  Always

1 Raw_Read_Error_Rate 0x000f   100   100   046Pre-fail  Always
 

100  46, so this is fine

4 Start_Stop_Count0x0032   099   099   000Old_age   Always

4 Start_Stop_Count0x0032   099   099   000Old_age   Always
 
7 Seek_Error_Rate 0x000e   100   100   000Old_age   Always

7 Seek_Error_Rate 0x000e   100   100   000Old_age   Always
 
9 Power_On_Seconds0x0032   023   023   000Old_age   Always

9 Power_On_Seconds0x0032   023   023   000Old_age   Always
 
   12 Power_Cycle_Count   0x0032   100   100   000Old_age   Always

   12 Power_Cycle_Count   0x0032   100   100   000Old_age   Always
 

These are also all steady and above their threshold. Note, however, that
Power_On_Seconds is a little low. It's Old_age so when that reaches 0,
you don't need to worry, but you can probably expect that that's what
the drive manufacturer believes is the typical life of the drive.

  193 Load_Cycle_Count0x0032   069   069   000Old_age   Always

 -   635379
  194 Temperature_Celsius 0x0022   100   090   000Old_age   Always

 -   51 (Min/Max 8/62)

Here we see that the temperature IS fine, but HAS BEEN slightly towards
failure (in other words, it probably got warm). This is nothing too
worry about.

  195 Hardware_ECC_Recovered  0x001a   100   100   000Old_age   Always

 -   9258
 ---
  193 Load_Cycle_Count0x0032   069   069   000Old_age   Always
 
 -   635384
  194 Temperature_Celsius 0x0022   100   090   000Old_age   Always
 
 -   49 (Min/Max 8/62)
  195 Hardware_ECC_Recovered  0x001a   100   100   000Old_age   Always
 
 -   15601
 76,77c76,77
  200 Multi_Zone_Error_Rate   0x000e   100   100   000Old_age   Always

 -   1038
  203 Run_Out_Cancel  0x0002   100   100   000Old_age   Always

 -   3732311179847
 ---
  200 Multi_Zone_Error_Rate   0x000e   100   100   000Old_age   Always
 
 -   5133
  203 Run_Out_Cancel  0x0002   100   100   000Old_age   Always
 
 -   2632791622157

And again, these are all well above their threshold.

 
 This is a hard drive in Dell Inspiron E1505 laptop. Any help is greatly 
 appreciated.
 

According to SMART, your hard drive is fine.



signature.asc
Description: Digital signature


Re: is this hard drive dying?

2012-09-13 Thread Miles Fidelman


Kamaraju S Kusumanchi wrote:

2) Can someone please tell me if this hard drive is dying. The following is
the difference between two smartctl outputs that are a week apart.

58c58
   1 Raw_Read_Error_Rate 0x000f   100   100   046Pre-fail  Always
-   74727
---

   1 Raw_Read_Error_Rate 0x000f   100   100   046Pre-fail  Always

-   108562



oh yes, badly -- I've generally found that if the raw_read_error_rate is
anything other than 0, a drive is on its last legs

I expect that you've noticed your machine running increasingly slowly of
late (as it takes larger numbers of attempts before the drive
successfully reads a data block)

Miles Fidelman

--
In theory, there is no difference between theory and practice.
In practice, there is.    Yogi Berra


--
To UNSUBSCRIBE, email to debian-user-requ...@lists.debian.org 
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org

Archive: http://lists.debian.org/5051e215.7060...@meetinghouse.net



Re: is this hard drive dying?

2012-09-13 Thread Camaleón
On Thu, 13 Sep 2012 07:48:46 -0400, Kamaraju S Kusumanchi wrote:

 1) The man page of smartctl is not much helpful in figuring out what
 various fields in the output of smartctl -a stand for. For example,
 what does fields such as Raw_Read_Error_Rate, Seek_Error_Rate,
 Hardware_ECC_Recovered, Multi_Zone_Error_Rate mean? Is there any page
 that describes all these fields in good detail?

http://en.wikipedia.org/wiki/S.M.A.R.T.#Known_ATA_S.M.A.R.T._attributes

 2) Can someone please tell me if this hard drive is dying. The following
 is the difference between two smartctl outputs that are a week apart.

(...)

We already talked about thist, right? :-?

Normally, when a hard disk is dying you start noticing becasue of SMART 
warnings and your logs get full of badblocks messages... Whether in 
doubt, run the hard disk manufacturer test tools to diagnose a problem 
with your hdd.

Greetings,

-- 
Camaleón


-- 
To UNSUBSCRIBE, email to debian-user-requ...@lists.debian.org 
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: http://lists.debian.org/k2sq0a$jks$6...@ger.gmane.org



Re: Is my hard drive dying?

2003-03-26 Thread Paul Johnson
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

Please turn your line wraps on to something like 72 columns instead of
1 paragraph.

On Wed, Mar 26, 2003 at 08:12:48AM +0800, Robert Storey wrote:
 I had these exact same error messages, starting about two weeks
 ago. I assumed the hard drive was dying, but it turned out to be a
 bad motherboard. Actually, I wish it had been the drive - hard
 drives are cheaper (and easier) to replace than motherboards.

Might hit outpost.com, or better yet if you live near one, a Fry's
Electronics.  There, hard drives and decent motherboards run around
the same price.

- -- 
 .''`. Baloo Ursidae [EMAIL PROTECTED]
: :'  :proud Debian admin and user
`. `'`
  `-  Debian - when you have better things to do than fix a system
-BEGIN PGP SIGNATURE-
Version: GnuPG v1.2.1 (GNU/Linux)

iD8DBQE+gqJkJ5vLSqVpK2kRAnrwAKCDGmZLi8UDZHzY3gg2Xiwh+T93lACgokCK
Rqer+YUDhMRhhHJZwl55FVg=
=/M+K
-END PGP SIGNATURE-


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED] 
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Re: Is my hard drive dying?

2003-03-25 Thread Klaus Imgrund
Dave Sherohman wrote:
On Mon, Mar 24, 2003 at 10:16:28PM -, Andrew Pritchard wrote:

Mar 24 17:14:51 orion kernel: hda: dma_intr: status=0x51 { DriveReady
SeekComplete Error }
Mar 24 17:14:51 orion kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=3994439, sector=63232
Mar 24 17:14:51 orion kernel: end_request: I/O error, dev 03:01 (hda),
sector 63232
Does this mean my drive is dying? I've not had any problems with the machine
till now.
I got those errors twice on a machine with sis chipset and ext3.
Other partitions with different file systems never were affected.
Klaus



--
To UNSUBSCRIBE, email to [EMAIL PROTECTED] 
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Re: Is my hard drive dying?

2003-03-25 Thread Andrew Pritchard
  I've been looking through the logcheck on one of my machines, and I've
seen
  a lot of these types of messages:
 
  Mar 24 17:14:51 orion kernel: hda: dma_intr: status=0x51 { DriveReady
  SeekComplete Error }
  Mar 24 17:14:51 orion kernel: hda: dma_intr: error=0x40 {
  UncorrectableError }, LBAsect=3994439, sector=63232
  Mar 24 17:14:51 orion kernel: end_request: I/O error, dev 03:01 (hda),
  sector 63232
 
  Does this mean my drive is dying? I've not had any problems with the
machine
  till now.

 not necessarily...

 === What did you CHANGE ??

I don't think I've changed anything :(

 -- if you changed the kernel ...  make sure yur kernel supports the
motherboard ( ide chipset )

Nope, been running with this kernel for months.

 -- did you add a new disk somewhere ??
 - use only one ide drive per cable if you're having these problems

Nope - and the drive I'm using is rather old :( It's a rather old machine.

 -- check your ide cables for kinks and corners and places where it touches
metal

I haven't opened the box recently, but I'll check anyway.

 -- is it ext2 or ext3 ??
 - does (e2fsck) ext2 come out clean

ext2 and the first time I ran e2fsck it didn't come out clean. The
subsequent runs did.

 -- make sure your IDE cables is secure and is 80 conductor ( not
fatter looking 40-conductor cables )

See above

 -- if the disks has been running for a while..
 - what changed

Not a lot. The machine doesn't get rebooted very often

 - when you make backups ... use a DIFFERENT media for backups
 when you already suspect a bad disk... as your good backup
 will be overwritten by bad data..

Because the machine isn't very busy it doesn't get backed up very often,
but I'll have a look at the the backups too.

 - when you restore the data, you might be restoring the
 original bad-disk-symptom vs the disk problem itself

Not much data to restore, only a small amount of data from a MySQL
database, which is emailed to me every day.

I was looking for a reason (other than the sheer hell of it) for rebuilding
this box - I guess I just found it!

Cheers everyone,


Andrew



-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED] 
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Re: Is my hard drive dying?

2003-03-25 Thread Paul Johnson
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

On Mon, Mar 24, 2003 at 10:16:28PM -, Andrew Pritchard wrote:
 Does this mean my drive is dying? I've not had any problems with the machine
 till now.

If you don't have hdparm spinning down your drives, odds are you are
looking at the early stages of a hrd drive failure.

- -- 
 .''`. Baloo Ursidae [EMAIL PROTECTED]
: :'  :proud Debian admin and user
`. `'`
  `-  Debian - when you have better things to do than fix a system
-BEGIN PGP SIGNATURE-
Version: GnuPG v1.2.1 (GNU/Linux)

iD8DBQE+gD49J5vLSqVpK2kRAohrAKCVFAYEZoyAg5L2xufMzeU3lWxytACfSfNB
Ny/Ee7fRWtJNfJ2ql0wC6kI=
=1VF0
-END PGP SIGNATURE-


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED] 
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Re: Is my hard drive dying?

2003-03-25 Thread Rogier Wolff
On Tue, Mar 25, 2003 at 03:32:13AM -0800, Paul Johnson wrote:
 -BEGIN PGP SIGNED MESSAGE-
 Hash: SHA1
 
 On Mon, Mar 24, 2003 at 10:16:28PM -, Andrew Pritchard wrote:
  Does this mean my drive is dying? I've not had any problems with the machine
  till now.
 
 If you don't have hdparm spinning down your drives, odds are you are
 looking at the early stages of a hrd drive failure.

Early? One of the blocks on the drive just went away. It's most
likely unrecoverable. Moreover, it's not just a block that wasn't 
used at all, because otherwise, the kernel wouldn't have requested
it from the drive. So, Andrew: Get your data off that drive ASAP, 
consider the drive a goner until proven otherwise. 

(Some people are giving lots of possibilities of the drive not 
giving the ghost. I find this dangerous: If I'd get lots of advice
it could be this or that, which all indicate that the data will
be safe on the drive, I'd consider checking that out while in fact
I should be backing up the data that can still be read from the 
drive. 

Andrew already got his important data off the drive. Good.  But
for the others: You might end up feeling very sorry if you push 
someone with a faulty drive into checking out a couple of other 
possiblities before they try to backup their data, and it's too 
late)

Roger. 

-- 
** [EMAIL PROTECTED] ** http://www.BitWizard.nl/ ** +31-15-2600998 **
*-- BitWizard writes Linux device drivers for any device you may have! --*
* The Worlds Ecosystem is a stable system. Stable systems may experience *
* excursions from the stable situation. We are currently in such an  * 
* excursion: The stable situation does not include humans. ***


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED] 
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Re: Is my hard drive dying?

2003-03-25 Thread Robert Storey
On Mon, 24 Mar 2003 22:16:28 -
Andrew Pritchard [EMAIL PROTECTED] wrote:

 I've been looking through the logcheck on one of my machines, and I've seen
 a lot of these types of messages:
 
 Mar 24 17:14:51 orion kernel: hda: dma_intr: status=0x51 { DriveReady
 SeekComplete Error }
 Mar 24 17:14:51 orion kernel: hda: dma_intr: error=0x40 {
 UncorrectableError }, LBAsect=3994439, sector=63232
 Mar 24 17:14:51 orion kernel: end_request: I/O error, dev 03:01 (hda),
 sector 63232
 
 Does this mean my drive is dying? I've not had any problems with the machine
 till now.

I had these exact same error messages, starting about two weeks ago. I assumed the 
hard drive was dying, but it turned out to be a bad motherboard. Actually, I wish it 
had been the drive - hard drives are cheaper (and easier) to replace than motherboards.

If possible, try to test the drive in another computer before tossing it out. But like 
others have said, back up your data IMMEDIATELY.



-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED] 
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Re: Is my hard drive dying?

2003-03-25 Thread Daniel B.
Andrew Pritchard wrote:
 
 I've been looking through the logcheck on one of my machines, and I've seen
 a lot of these types of messages:
 
 Mar 24 17:14:51 orion kernel: hda: dma_intr: status=0x51 { DriveReady
 SeekComplete Error }
 Mar 24 17:14:51 orion kernel: hda: dma_intr: error=0x40 {
 UncorrectableError }, LBAsect=3994439, sector=63232
 Mar 24 17:14:51 orion kernel: end_request: I/O error, dev 03:01 (hda),
 sector 63232
 
 Does this mean my drive is dying? I've not had any problems with the machine
 till now.

You should probably disable IDE DMA (see hdparm) immediately.  If kernel
IDE problems are your problem, that should prevent additional corruption.

Daniel
-- 
Daniel Barclay
[EMAIL PROTECTED]


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED] 
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Re: Is my hard drive dying?

2003-03-25 Thread Sharninder

 You should probably disable IDE DMA (see hdparm) immediately.  If
 kernel IDE problems are your problem, that should prevent
 additional corruption.


also start using smartsuite utilities. they are a part of woody
atleast.
Sharninder Singh
National Institute Of Management, Calcutta

--
'M.C.S.E - Minesweeper Consultant  Solitaire Expert'



-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED] 
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Is my hard drive dying?

2003-03-24 Thread Andrew Pritchard
I've been looking through the logcheck on one of my machines, and I've seen
a lot of these types of messages:

Mar 24 17:14:51 orion kernel: hda: dma_intr: status=0x51 { DriveReady
SeekComplete Error }
Mar 24 17:14:51 orion kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=3994439, sector=63232
Mar 24 17:14:51 orion kernel: end_request: I/O error, dev 03:01 (hda),
sector 63232

Does this mean my drive is dying? I've not had any problems with the machine
till now.

A



-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED] 
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Re: Is my hard drive dying?

2003-03-24 Thread Jonathan Matthews
On Mon, Mar 24, 2003 at 10:16:28PM -, Andrew Pritchard wrote:
 I've been looking through the logcheck on one of my machines, and I've seen
 a lot of these types of messages:
 
 Mar 24 17:14:51 orion kernel: hda: dma_intr: status=0x51 { DriveReady
 SeekComplete Error }
 Mar 24 17:14:51 orion kernel: hda: dma_intr: error=0x40 {
 UncorrectableError }, LBAsect=3994439, sector=63232
 Mar 24 17:14:51 orion kernel: end_request: I/O error, dev 03:01 (hda),
 sector 63232
 
 Does this mean my drive is dying? I've not had any problems with the machine
 till now.

I'd tend to say yes - get your backups done now!

Can you tell from the logs if there was a hard start date, or did 
the errors just start to trickle in?  How often are they occuring?

  jc


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED] 
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Re: Is my hard drive dying?

2003-03-24 Thread Matthias Hentges
Am Mon, 2003-03-24 um 23.16 schrieb Andrew Pritchard:

[...]

 Does this mean my drive is dying? I've not had any problems with the machine
 till now.

Hate to say it, but yes it looks like you'd better get a backup done.
Or try to mkfs the entire drive (delete and recreate all partitions).
That *may* help if your partition table is faulty.

HTH

-- 

Matthias Hentges 
Cologne / Germany

[www.hentges.net] - PGP welcome, HTML tolerated
ICQ: 97 26 97 4   - No files, no URL's

My OS: Debian Woody: Geek by Nature, Linux by Choice



signature.asc
Description: Dies ist ein digital signierter Nachrichtenteil


Re: Is my hard drive dying?

2003-03-24 Thread nate
Andrew Pritchard said:
 I've been looking through the logcheck on one of my machines, and I've
 seen a lot of these types of messages:

 Mar 24 17:14:51 orion kernel: hda: dma_intr: status=0x51 { DriveReady
 SeekComplete Error }
 Mar 24 17:14:51 orion kernel: hda: dma_intr: error=0x40 {
 UncorrectableError }, LBAsect=3994439, sector=63232
 Mar 24 17:14:51 orion kernel: end_request: I/O error, dev 03:01 (hda),
 sector 63232

 Does this mean my drive is dying? I've not had any problems with the
 machine till now.

that is a common symtom of a drive that is beginning to fail. I would
reccomend running the vendor's diagnostic tools on the disk asap, and
backup your important data just incase.

depending on the drive, you may be able to low level format it using
the vendor's tools(this will erase all data) and get more life out of
it, one of my maxtor drives got an extra 6 months of life by low level
formatting it.

nate




-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED] 
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Re: Is my hard drive dying?

2003-03-24 Thread Dave Sherohman
On Mon, Mar 24, 2003 at 10:16:28PM -, Andrew Pritchard wrote:
 Mar 24 17:14:51 orion kernel: hda: dma_intr: status=0x51 { DriveReady
 SeekComplete Error }
 Mar 24 17:14:51 orion kernel: hda: dma_intr: error=0x40 {
 UncorrectableError }, LBAsect=3994439, sector=63232
 Mar 24 17:14:51 orion kernel: end_request: I/O error, dev 03:01 (hda),
 sector 63232
 
 Does this mean my drive is dying? I've not had any problems with the machine
 till now.

$ fortune -m LikeThis 
(knghtbrd)
%
Overfiend Thunder-: when you get { MessagesLikeThisFromYourHardDrive }
Overfiend Thunder-: it either means { TheDriverIsScrewy }
Overfiend or
Overfiend { YourDriveIsFlakingOut BackUpYourDataBeforeIt'sTooLate
PrayToGod }
%

-- 
The freedoms that we enjoy presently are the most important victories of the
White Hats over the past several millennia, and it is vitally important that
we don't give them up now, only because we are frightened.
  - Eolake Stobblehouse (http://stobblehouse.com/text/battle.html)


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED] 
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Re: Is my hard drive dying?

2003-03-24 Thread Matthew Daubenspeck
On Mon, Mar 24, 2003 at 03:06:43PM -0800, nate wrote:
 Andrew Pritchard said:
  I've been looking through the logcheck on one of my machines, and I've
  seen a lot of these types of messages:
 
  Mar 24 17:14:51 orion kernel: hda: dma_intr: status=0x51 { DriveReady
  SeekComplete Error }
  Mar 24 17:14:51 orion kernel: hda: dma_intr: error=0x40 {
  UncorrectableError }, LBAsect=3994439, sector=63232
  Mar 24 17:14:51 orion kernel: end_request: I/O error, dev 03:01 (hda),
  sector 63232
 
  Does this mean my drive is dying? I've not had any problems with the
  machine till now.

I have also found that this happens when you are not using the proper
IDE kernel driver...


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED] 
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Re: Is my hard drive dying?

2003-03-24 Thread Alvin Oga


On Mon, 24 Mar 2003, Andrew Pritchard wrote:

 I've been looking through the logcheck on one of my machines, and I've seen
 a lot of these types of messages:
 
 Mar 24 17:14:51 orion kernel: hda: dma_intr: status=0x51 { DriveReady
 SeekComplete Error }
 Mar 24 17:14:51 orion kernel: hda: dma_intr: error=0x40 {
 UncorrectableError }, LBAsect=3994439, sector=63232
 Mar 24 17:14:51 orion kernel: end_request: I/O error, dev 03:01 (hda),
 sector 63232
 
 Does this mean my drive is dying? I've not had any problems with the machine
 till now.

not necessarily...

=== What did you CHANGE ??

-- if you changed the kernel ...  make sure yur kernel supports the
   motherboard ( ide chipset )

-- did you add a new disk somewhere ??
- use only one ide drive per cable if you're having these problems

-- check your ide cables for kinks and corners and places where it touches
   metal
 
-- is it ext2 or ext3 ??
- does (e2fsck) ext2 come out clean

-- make sure your IDE cables is secure and is 80 conductor ( not
   fatter looking 40-conductor cables )

-- if the disks has been running for a while..
- what changed

- when you make backups ... use a DIFFERENT media for backups
when you already suspect a bad disk... as your good backup
will be overwritten by bad data..

- when you restore the data, you might be restoring the
original bad-disk-symptom vs the disk problem itself

-- if its a brand new disks..
- you could have a bad disk
- or lots of possible reasons

-- few other stuff to do before tossing the disks..

-- move the disk to a different box... does it show the same symptom
   ( and if it did... the disk might be bad ...

c ya
alvin


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED] 
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]