How To Monitor Disk Errors?

2005-11-22 Thread Drew Tomlinson
I have an old machine running 4.11.  It died sometime last night from 
what I think was a disk problem.  The machine was still running and 
still passing packets (it is my firewall) but I could not log in via the 
console, ssh, or telnet.  I powered the machine off/on and heard the 
click of death coming from one of the internal IDE drives.  By some 
miracle, the machine did finally boot and is running again.


I'm sure I'm on borrowed time here.  However I would like to find some 
way to monitor drive errors so I know which drive is failing so replace 
the correct drive.  I have two in the machine.  I've checked 
/var/log/messages but see no entries there regarding the drive.  Is 
there some utility that will let me see the current number of errors 
since boot?


Thanks,

Drew

--
Visit The Alchemist's Warehouse
Magic Tricks, DVDs, Videos, Books,  More!

http://www.alchemistswarehouse.com

___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: How To Monitor Disk Errors?

2005-11-22 Thread Nicolas Blais
On November 22, 2005 09:05 pm, Drew Tomlinson wrote:
 I have an old machine running 4.11.  It died sometime last night from
 what I think was a disk problem.  The machine was still running and
 still passing packets (it is my firewall) but I could not log in via the
 console, ssh, or telnet.  I powered the machine off/on and heard the
 click of death coming from one of the internal IDE drives.  By some
 miracle, the machine did finally boot and is running again.

 I'm sure I'm on borrowed time here.  However I would like to find some
 way to monitor drive errors so I know which drive is failing so replace
 the correct drive.  I have two in the machine.  I've checked
 /var/log/messages but see no entries there regarding the drive.  Is
 there some utility that will let me see the current number of errors
 since boot?

 Thanks,

 Drew

Check out /sysutils/smartmontools in the ports.  It could be what you need.
When installed, a simple smartctl -H /dev/*disk* like :
smartctl -H /dev/ad0 will tell you if your drive is healthy or not.  

You can also set up the smartd which will check your drives at certain 
intervals.

Nicolas
-- 
FreeBSD 7.0-CURRENT #1: Sat Nov 19 12:36:29 EST 2005 
[EMAIL PROTECTED]:/usr/obj/usr/src/sys/CLK01A 
PGP? (updated 16 Nov 05) : http://www.clkroot.net/security/nb_root.asc


pgp0ncXariPbZ.pgp
Description: PGP signature


Re: How To Monitor Disk Errors?

2005-11-22 Thread Joao Barros
Try smartmontools: http://www.freshports.org/sysutils/smartmontools/

If you have IBM Deskstars see if there is newer firmware available.

On 11/23/05, Drew Tomlinson [EMAIL PROTECTED] wrote:
 I have an old machine running 4.11.  It died sometime last night from
 what I think was a disk problem.  The machine was still running and
 still passing packets (it is my firewall) but I could not log in via the
 console, ssh, or telnet.  I powered the machine off/on and heard the
 click of death coming from one of the internal IDE drives.  By some
 miracle, the machine did finally boot and is running again.

 I'm sure I'm on borrowed time here.  However I would like to find some
 way to monitor drive errors so I know which drive is failing so replace
 the correct drive.  I have two in the machine.  I've checked
 /var/log/messages but see no entries there regarding the drive.  Is
 there some utility that will let me see the current number of errors
 since boot?

 Thanks,

 Drew

 --
 Visit The Alchemist's Warehouse
 Magic Tricks, DVDs, Videos, Books,  More!

 http://www.alchemistswarehouse.com

 ___
 freebsd-questions@freebsd.org mailing list
 http://lists.freebsd.org/mailman/listinfo/freebsd-questions
 To unsubscribe, send any mail to [EMAIL PROTECTED]



--
Joao Barros
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: How To Monitor Disk Errors?

2005-11-22 Thread Drew Tomlinson

On 11/22/2005 6:18 PM Nicolas Blais wrote:


On November 22, 2005 09:05 pm, Drew Tomlinson wrote:
 


I have an old machine running 4.11.  It died sometime last night from
what I think was a disk problem.  The machine was still running and
still passing packets (it is my firewall) but I could not log in via the
console, ssh, or telnet.  I powered the machine off/on and heard the
click of death coming from one of the internal IDE drives.  By some
miracle, the machine did finally boot and is running again.

I'm sure I'm on borrowed time here.  However I would like to find some
way to monitor drive errors so I know which drive is failing so replace
the correct drive.  I have two in the machine.  I've checked
/var/log/messages but see no entries there regarding the drive.  Is
there some utility that will let me see the current number of errors
since boot?

Thanks,

Drew
   



Check out /sysutils/smartmontools in the ports.  It could be what you need.
When installed, a simple smartctl -H /dev/*disk* like :
smartctl -H /dev/ad0 will tell you if your drive is healthy or not.  

You can also set up the smartd which will check your drives at certain 
intervals.


Nicolas
 




Thanks for your reply.  However it appears I need to be at FBSD ver 5.0 
or higher to use this tool.  Here's the output:


blacksheep# smartctl -H /dev/ad0
smartctl version 5.33 [i386-portbld-freebsd4.11] Copyright (C) 2002-4 
Bruce Allen

Home page is http://smartmontools.sourceforge.net/

ATA support is not provided for this kernel version. Please ugrade to a 
recent 5-CURRENT kernel (post 09/01/2003 or so)

Smartctl: Device Read Identity Failed (not an ATA/ATAPI device)

A mandatory SMART command failed: exiting. To continue, add one or more 
'-T permissive' options.


Cheers,

Drew

--
Visit The Alchemist's Warehouse
Magic Tricks, DVDs, Videos, Books,  More!

http://www.alchemistswarehouse.com

___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]