Re: Care and feeding of RAID?

2006-09-10 Thread Benjamin Schieder
On 06.09.2006 20:49:45, Luca Berra wrote:
> On Wed, Sep 06, 2006 at 09:12:24AM +0200, Benjamin Schieder wrote:
> Personally, I use this script from cron:
> http://shellscripts.org/project/hdtest
> >>
> >>nice race :)
> >
> >As in race condition? Where?
> 
> mktemp
> rm
> touch
> why do you do that?

Probably because of a wrong understanding of mktemp. I fixed this now.

> i retested now with 5.36 and it seems the output did _not_ change, i
> don't know what i saw this morning.
> 
> but then it errors on the line
> IFS="   " read type status online < <( smartctl -d ata -a ${disk} | grep
> \#\ 1 | sed 's,  \+,   ,g' | cut -f 2,3,5 )

I think I know what you mean. I've seen these errors sometimes, too. But I
was too lazy to investigate yet since they only pop up once a month or so.


Greetings,
Benjamin

-- 
 _  _ _   __   
| \| |___| |_| |_  __ _ __| |__
| .` / -_)  _| ' \/ _` / _| / /
|_|\_\___|\__|_||_\__,_\__|_\_\
| |  (_)_ _ _  ___ __
| |__| | ' \ || \ \ /
||_|_||_\_,_/_\_\
Play Nethack anywhere with an x86 computer:
http://www.crash-override.net/nethacklinux.html


pgpZUXJSsS1ly.pgp
Description: PGP signature


Re: Care and feeding of RAID?

2006-09-09 Thread dean gaudet
On Tue, 5 Sep 2006, Paul Waldo wrote:

> What about bitmaps?  Nobody has mentioned them.  It is my understanding that
> you just turn them on with "mdadm /dev/mdX -b internal".  Any caveats for
> this?

bitmaps have been working great for me on a raid5 and raid1.  it makes it 
that much more tolerable when i accidentally crash the box and don't have 
to wait forever for a resync.

i don't notice the extra write traffic all that much... under heavy 
traffic i see about 3 writes/s to the spare disk in the raid5 -- i assume 
those are all due to the bitmap in the superblock on the spare.

i've considered using an external bitmap, i forget why i didn't do that 
initially.  the filesystem on the raid5 already has an external journal on 
raid1.

-dean
-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Care and feeding of RAID?

2006-09-09 Thread Nix
On 5 Sep 2006, Paul Waldo uttered the following:
> What about bitmaps?  Nobody has mentioned them.  It is my
> understanding that you just turn them on with "mdadm /dev/mdX -b
> internal".  Any caveats for this?

Notably, how many additional writes does it incur? I have some RAID
arrays using drives which are quiet *until* you access them, and which
then make a bloody racket. The superblock updates are bad enough, but
bitmap updates, well, I don't really like seeing one write turned into
twelve-odd disk hits that much (just a back-of-the-envelope guess for a
three-disk RAID-5 array).

-- 
`In typical emacs fashion, it is both absurdly ornate and
 still not really what one wanted.' --- jdev
-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Care and feeding of RAID?

2006-09-09 Thread Nix
On 6 Sep 2006, Mario Holbe spake:
> You don't necessarily need one. However, since Neil considers in-kernel
> RAID-autodetection a bad thing and since mdadm typically relies on
> mdadm.conf for RAID-assembly

You can specify the UUID on the command-line too (although I don't).

The advantage of the config file from my POV is that it lets me activate
*all* my RAID arrays with one command, and the command doesn't change, no
matter how complex the array configuration. (I'll admit that the sheer
number of options to mdadm has always overwhelmed me to some degree,
despite the excellent documentation, so I prefer approaches that keep
a working command-line unchanged, especially for something as critical
as boot-time assembly.)

-- 
`In typical emacs fashion, it is both absurdly ornate and
 still not really what one wanted.' --- jdev
-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Care and feeding of RAID?

2006-09-06 Thread Luca Berra

On Wed, Sep 06, 2006 at 09:12:24AM +0200, Benjamin Schieder wrote:

>>Personally, I use this script from cron:
>>http://shellscripts.org/project/hdtest

nice race :)


As in race condition? Where?


mktemp
rm
touch
why do you do that?


I'm running smartmontools 5.33 here. When did the output change? It still
works fine here.


i retested now with 5.36 and it seems the output did _not_ change, i
don't know what i saw this morning.

but then it errors on the line
IFS="   " read type status online < <( smartctl -d ata -a ${disk} | grep
\#\ 1 | sed 's,  \+,   ,g' | cut -f 2,3,5 )

L.

--
Luca Berra -- [EMAIL PROTECTED]
   Communication Media & Services S.r.l.
/"\
\ / ASCII RIBBON CAMPAIGN
 XAGAINST HTML MAIL
/ \
-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Care and feeding of RAID?

2006-09-06 Thread Mario 'BitKoenig' Holbe
Gordon Henderson <[EMAIL PROTECTED]> wrote:
> I don't have any mdadm.conf files ... What am I missing? (I've always been
> under the impression that after needing the /etc/raidtab file with the old
> raidtools, you didn't need a config file as such under mdadm...  However,

You don't necessarily need one. However, since Neil considers in-kernel
RAID-autodetection a bad thing and since mdadm typically relies on
mdadm.conf for RAID-assembly and since especially with newer kernels you
probably need to auto-create device nodes (2.6 and udev), it's more
convenient to have one. Though you could live without one even then.

I did also run without one over a long time and I also don't like it to
have one, however, sometimes in the past convenience won :)


regards
   Mario
-- 
There is nothing more deceptive than an obvious fact.
 -- Sherlock Holmes by Arthur Conan Doyle

-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Care and feeding of RAID?

2006-09-06 Thread Mario 'BitKoenig' Holbe
Benjamin Schieder <[EMAIL PROTECTED]> wrote:
> I have made the experience that long tests slow down the raid to a point
> where the system becomes unusable.

Even though we're quite off-topic here with that since it's more
SMART-related... this is at least unusual.
I'm also running regular SMART selftests (short daily, long weekly) and
usually they don't affect drive's performance very much. However,
whenever I experienced massive slow-downs while selftests are running,
this did always point to disk problems... too much (and strategic
disadvantageous) reallocated sectors (keep an ear on the disks, hear
them seek()ing :)), non-reallocatable sectors or just temperature (for
example, WD drives tend to show a bad performance when getting too hot).


regards
   Mario
-- 
But after a while I learned the trick of speaking fast. You don't have
to think any faster; just use twice as many words to say everything.
-- Paul Graham

-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Care and feeding of RAID?

2006-09-06 Thread Luca Berra

On Tue, Sep 05, 2006 at 05:47:57PM -0400, Steve Cousins wrote:



Luca Berra wrote:


On Tue, Sep 05, 2006 at 02:29:48PM -0400, Steve Cousins wrote:




Benjamin Schieder wrote:


On 05.09.2006 11:03:45, Steve Cousins wrote:

Would people be willing to list their setup? Including such things 
as mdadm.conf file, crontab -l, plus scripts that they use to check 
the smart data and the array, mdadm daemon parameters and anything 
else that is relevant to checking and maintaining an array? 




Personally, I use this script from cron:
http://shellscripts.org/project/hdtest



nice race :)


I'm not sure what you mean?


tmp="`mktemp`"
rm -f ${tmp}
touch ${tmp}

the last two lines are unneeded and can be tricked to overwrite
arbitrary filenames


I tried smartctl -t short -d scsi /dev/sdb where /dev/sdb is a 250GB 
SATA drive.


it is '-d ata'

What command do you use for SATA drives?  The sourceforge page implies 
that -d sata doesn't exist yet.  I'm using FC 5 with 2.6.17 kernel and 
smartmontools version 5.33.  Do you have a sample configuration script 
that you could show me?


# monitor two sata disks, show temperature in degrees,
# do a long test every sunday and a short every other day
# at 1am on sda and at 2am on sdb, YMMV
/dev/sda -d ata -a -R 194 -s (L/../../7|S/../../[123456])/01
/dev/sdb -d ata -a -R 194 -s (L/../../7|S/../../[123456])/02

--
Luca Berra -- [EMAIL PROTECTED]
   Communication Media & Services S.r.l.
/"\
\ / ASCII RIBBON CAMPAIGN
 XAGAINST HTML MAIL
/ \
-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Care and feeding of RAID?

2006-09-06 Thread Benjamin Schieder
On 05.09.2006 22:57:27, Luca Berra wrote:
> On Tue, Sep 05, 2006 at 02:29:48PM -0400, Steve Cousins wrote:
> >
> >
> >Benjamin Schieder wrote:
> >>On 05.09.2006 11:03:45, Steve Cousins wrote:
> >>
> >>>Would people be willing to list their setup? Including such things as 
> >>>mdadm.conf file, crontab -l, plus scripts that they use to check the 
> >>>smart data and the array, mdadm daemon parameters and anything else that 
> >>>is relevant to checking and maintaining an array? 
> >>
> >>
> >>Personally, I use this script from cron:
> >>http://shellscripts.org/project/hdtest
> 
> nice race :)

As in race condition? Where?

> >I am checking this out and I see that you are the writer of this script.
> >I'm getting errors when it comes to lines 76 and 86-90 about the 
> >arithmetic symbols. This is on a Fedora Core 5 system with bash version 
> that is because smartctl output has changed and the grep above returns
> no number.

I'm running smartmontools 5.33 here. When did the output change? It still
works fine here.

> >What do you (or others) recommend for SATA drives?
> 
> smartmontools and a recent kernel just work.
> also you can schedule smart tests with smartmontools. so you don't need
> to cron scripts.

Interesting. I'll look into that.


Greetings,
Benjamin

-- 
The Nethack IdleRPG! Idle to your favorite Nethack messages!
http://pallas.crash-override.net/nethackidle/


pgpjqRitTFwXU.pgp
Description: PGP signature


Re: Care and feeding of RAID?

2006-09-05 Thread Gordon Henderson
On Tue, 5 Sep 2006, Paul Waldo wrote:

> Gordon Henderson wrote:
> > On Tue, 5 Sep 2006, Steve Cousins wrote:
> [snip]
> > and my weekly badblocks script looks like:
> >
> > #!/bin/csh
> >
> > echo "`uname -n`: Badblocks test starting at [`date`]"
> >
> > foreach disk ( a c )
> >   foreach partition ( 1 2 3 5 6 )
> > echo -n "hd$disk${partition}: "
> > badblocks -c 128 /dev/hd$disk$partition
> >   end
> >   echo ""
> > end
> >
> > echo "`uname -n`: Badblocks test   ending at [`date`]"
> [snip]
>
> Maybe I'm missing something, but are these partitions mounted?  Here's what I
> get when I do this on a mounted partition:
>
> [EMAIL PROTECTED] ~]# badblocks -nsv /dev/md0
> /dev/md0 is mounted; it's not safe to run badblocks!

Do not use the -n option... (and -s won't be much use in a cron job, nor
-v, probably!) -n will write to the device which might well have issues
with the filesystem cache...

By reading the underlying drives you won't trigger a raid array failure
should you do see a bad sector, which might give you time to go something
about it. There was some emails on this list some time back (year or 2,3?)
about badblocking the md? device - I imagine it might not read every block
of every device unless it was a raid-0 array...

Gordon
-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Care and feeding of RAID?

2006-09-05 Thread Luca Berra

On Tue, Sep 05, 2006 at 02:29:48PM -0400, Steve Cousins wrote:



Benjamin Schieder wrote:

On 05.09.2006 11:03:45, Steve Cousins wrote:

Would people be willing to list their setup? Including such things as 
mdadm.conf file, crontab -l, plus scripts that they use to check the 
smart data and the array, mdadm daemon parameters and anything else that 
is relevant to checking and maintaining an array? 



Personally, I use this script from cron:
http://shellscripts.org/project/hdtest


nice race :)


I am checking this out and I see that you are the writer of this script.
I'm getting errors when it comes to lines 76 and 86-90 about the 
arithmetic symbols. This is on a Fedora Core 5 system with bash version 

that is because smartctl output has changed and the grep above returns
no number.

3.1.7(1).   I weeded out the smartctl command and tried it manually with 
no luck on my SATA /dev/sd? drives.


which command?


What do you (or others) recommend for SATA drives?


smartmontools and a recent kernel just work.
also you can schedule smart tests with smartmontools. so you don't need
to cron scripts.

L.

--
Luca Berra -- [EMAIL PROTECTED]
   Communication Media & Services S.r.l.
/"\
\ / ASCII RIBBON CAMPAIGN
 XAGAINST HTML MAIL
/ \
-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Care and feeding of RAID?

2006-09-05 Thread Richard Scobie
This is a timely thread for me, as I am about to setup a software RAID 
10 (a striped pair of mirrors), on 4 x 500GB SATA.


Anything to watch for by not partitioning the drives at all? Or is it 
safer to make one partition, slightly smaller (suggestions of how much 
welcome), than the full drive, to allow for possible size discrepencies 
with replacemnets.


Also I am wondering as this is RAID0 on top of RAID1, if there are any 
special steps that need to be taken when maintaining the array (adding, 
removing, rebuilding etc), compared with a "single layer" RAID?


Regards,

Richard
-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Care and feeding of RAID?

2006-09-05 Thread Steve Cousins



Rev. Jeffrey Paul wrote:

On Tue, Sep 05, 2006 at 11:03:45AM -0400, Steve Cousins wrote:

These are SATA drives and except for the one machine that has a 3Ware 
8506 card in it I haven't been able to get SMART programs to do anything 
with these drives.  How do others deal with this? 




I use the tw_cli program to check up on my 3ware stuff.


Hi Jeffrey,

Thanks.  I use tw_cli too and I have scripted a check to see if it 
degrades but this doesn't help with checking for disk problems before 
they happen which SMART should help with.  As it happens, smartctl works 
with 3Ware SATA drives.  It is my other SATA drives that I'm unable to 
monitor.


Steve




It took me quite a bit of time to figure that one out.  I don't
have any automated monitoring set up, but it'd be simple enough to
script.  I check on the array every so often and run a verify every few
months to see if it kicks a disk out (it hasn't yet).

0 [EMAIL PROTECTED]:~# tw_cli 
//datavibe> info


Ctl   ModelPorts   Drives   Units   NotOpt   RRate   VRate   BBU

c08006-2LP 2   21   02   -   -


//datavibe> info c0

Unit  UnitType  Status %Cmpl  Stripe  Size(GB)  Cache  AVerify  IgnECC
--
u0RAID-1OK -  -   232.885   ON -- 


Port   Status   Unit   SizeBlocksSerial
---
p0 OK   u0 232.88 GB   488397168 WD-WMAL718611 
p1 OK   u0 232.88 GB   488397168 WD-WMAL718619

//datavibe>

-j



--
__
 Steve Cousins, Ocean Modeling GroupEmail: [EMAIL PROTECTED]
 Marine Sciences, 452 Aubert Hall   http://rocky.umeoce.maine.edu
 Univ. of Maine, Orono, ME 04469Phone: (207) 581-4302


-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Care and feeding of RAID?

2006-09-05 Thread Steve Cousins



Benjamin Schieder wrote:

On 05.09.2006 11:03:45, Steve Cousins wrote:

Would people be willing to list their setup? Including such things as 
mdadm.conf file, crontab -l, plus scripts that they use to check the 
smart data and the array, mdadm daemon parameters and anything else that 
is relevant to checking and maintaining an array? 



Personally, I use this script from cron:
http://shellscripts.org/project/hdtest


Hi Benjamin,

I am checking this out and I see that you are the writer of this script.
I'm getting errors when it comes to lines 76 and 86-90 about the 
arithmetic symbols. This is on a Fedora Core 5 system with bash version 
3.1.7(1).   I weeded out the smartctl command and tried it manually with 
no luck on my SATA /dev/sd? drives.


What do you (or others) recommend for SATA drives?

Thanks,

Steve


-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Care and feeding of RAID?

2006-09-05 Thread Rev. Jeffrey Paul
On Tue, Sep 05, 2006 at 11:03:45AM -0400, Steve Cousins wrote:
> 
> These are SATA drives and except for the one machine that has a 3Ware 
> 8506 card in it I haven't been able to get SMART programs to do anything 
> with these drives.  How do others deal with this? 
> 

I use the tw_cli program to check up on my 3ware stuff.

It took me quite a bit of time to figure that one out.  I don't
have any automated monitoring set up, but it'd be simple enough to
script.  I check on the array every so often and run a verify every few
months to see if it kicks a disk out (it hasn't yet).

0 [EMAIL PROTECTED]:~# tw_cli 
//datavibe> info

Ctl   ModelPorts   Drives   Units   NotOpt   RRate   VRate   BBU

c08006-2LP 2   21   02   -   -

//datavibe> info c0

Unit  UnitType  Status %Cmpl  Stripe  Size(GB)  Cache  AVerify  IgnECC
--
u0RAID-1OK -  -   232.885   ON -- 

Port   Status   Unit   SizeBlocksSerial
---
p0 OK   u0 232.88 GB   488397168 WD-WMAL718611 
p1 OK   u0 232.88 GB   488397168 WD-WMAL718619
//datavibe>

-j

-- 

 Rev. Jeffrey Paul-datavibe- [EMAIL PROTECTED]
  aim:x736e65616b   pgp:0xD9B3C17D   phone:877-748-3467
   9440 0C7F C598 01CA 2F17  D098 0A3A 4B8F D9B3 C17D

-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Care and feeding of RAID?

2006-09-05 Thread Paul Waldo

Gordon Henderson wrote:

On Tue, 5 Sep 2006, Steve Cousins wrote:

[snip]

and my weekly badblocks script looks like:

#!/bin/csh

echo "`uname -n`: Badblocks test starting at [`date`]"

foreach disk ( a c )
  foreach partition ( 1 2 3 5 6 )
echo -n "hd$disk${partition}: "
badblocks -c 128 /dev/hd$disk$partition
  end
  echo ""
end

echo "`uname -n`: Badblocks test   ending at [`date`]"

[snip]

Maybe I'm missing something, but are these partitions mounted?  Here's what I 
get when I do this on a mounted partition:


[EMAIL PROTECTED] ~]# badblocks -nsv /dev/md0
/dev/md0 is mounted; it's not safe to run badblocks!

If you are running RAID, is it safe to run badblocks on the underlying 
partition?
-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Care and feeding of RAID?

2006-09-05 Thread Paul Waldo

Paul Waldo wrote:

Hi all,

I have a RAID6 array and I wondering about care and feeding instructions 
:-)


Here is what I currently do:
   - daily incremental and weekly full backups to a separate machine
   - run smartd tests (short once a day, long once a week)
   - check the raid for bad blocks every week

What else can I do make sure the array keeps humming?  Thanks in advance!
-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html



What about bitmaps?  Nobody has mentioned them.  It is my understanding that 
you just turn them on with "mdadm /dev/mdX -b internal".  Any caveats for this?


Paul
-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Care and feeding of RAID?

2006-09-05 Thread Paul Waldo

Steve Cousins wrote:

Gordon Henderson wrote:


On Tue, 5 Sep 2006, Paul Waldo wrote:

 


Hi all,

I have a RAID6 array and I wondering about care and feeding 
instructions :-)


Here is what I currently do:
   - daily incremental and weekly full backups to a separate machine
   - run smartd tests (short once a day, long once a week)
   - check the raid for bad blocks every week

What else can I do make sure the array keeps humming?  Thanks in 
advance!
  


Stop fiddling with it :)

I run similar stuff, but don't forget running mdadm in daemon mode to 
send

you an email should a drive fail. I also check each device individually,
rather than the array although I don't know the value of doing this over
the SMART tests on modern drives though...
 



Would people be willing to list their setup? Including such things as 
mdadm.conf file, crontab -l, plus scripts that they use to check the 
smart data and the array, mdadm daemon parameters and anything else that 
is relevant to checking and maintaining an array?
I'm running the mdmonitor script at startup and a sample mdadm.conf  
(one of 3 machines) looks like:


MAILADDR [EMAIL PROTECTED]
ARRAY /dev/md0 level=raid5 num-devices=3 
UUID=39d07542:f3c97e69:fbb63d9d:64a052d3 
devices=/dev/sdb1,/dev/sdc1,/dev/sdd1


These are SATA drives and except for the one machine that has a 3Ware 
8506 card in it I haven't been able to get SMART programs to do anything 
with these drives.  How do others deal with this?

Thanks,

Steve



Excellent idea, Steve.

In my crontab, I have this:
# Check RAID arrays for bad blocks once a week
30 2 * * Tue echo check >> /sys/block/md0/md/sync_action ; echo "Checking md0 bad 
blocks"
30 2 * * Wed echo check >> /sys/block/md1/md/sync_action ; echo "Checking md1 bad 
blocks"

I have this in my smartd.conf:
/dev/hda -H -m root -S on -o on -I 194 -s (S/../.././02|L/../../6/03)
/dev/hdc -H -m root -S on -o on -I 194 -s (S/../.././02|L/../../6/03)
/dev/hde -H -m root -S on -o on -I 194 -s (S/../.././02|L/../../6/03)
/dev/hdg -H -m root -S on -o on -I 194 -s (S/../.././02|L/../../6/03)
/dev/sda -d ata -H -m root -S on -o on -I 194 -s (S/../.././02|L/../../6/03)
/dev/sdb -d ata -H -m root -S on -o on -I 194 -s (S/../.././02|L/../../6/03)
/dev/sdc -d ata -H -m root -S on -o on -I 194 -s (S/../.././02|L/../../6/03)

My Fedora Core box has this in /etc/init.d/mdmonitor:
daemon --check --user=root mdadm ${OPTIONS}
where OPTIONS="--monitor --scan -f --pid-file=/var/run/mdadm/mdadm.pid"


I have no mdadm.conf.  My entire filesystem consists of md0 (/boot) and md1(/).  
I figure if I have problems and need the file, it won't be available anyway.  If I am mistaken,

please do let me know!

Any other suggestions would be welcomed!

Paul
-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Care and feeding of RAID?

2006-09-05 Thread Gordon Henderson
On Tue, 5 Sep 2006, Steve Cousins wrote:

> Would people be willing to list their setup? Including such things as
> mdadm.conf file, crontab -l, plus scripts that they use to check the
> smart data and the array, mdadm daemon parameters and anything else that
> is relevant to checking and maintaining an array?

I don't have any mdadm.conf files ... What am I missing? (I've always been
under the impression that after needing the /etc/raidtab file with the old
raidtools, you didn't need a config file as such under mdadm...  However,
I'm willing to be enlightened!)

For checking the smart stuff, I use the standard Debian packages and a
smartd.conf file typically looks like:

#DEVICESCAN

/dev/hda -d ata -o on -S on -a -m [EMAIL PROTECTED] -s 
(S/../.././04|L/../../1/20) -M daily -M test
/dev/hdc -d ata -o on -S on -a -m [EMAIL PROTECTED] -s 
(S/../.././04|L/../../1/20) -M daily
/dev/hde -d ata -o on -S on -a -m [EMAIL PROTECTED] -s 
(S/../.././04|L/../../1/20) -M daily
/dev/hdi -d ata -o on -S on -a -m [EMAIL PROTECTED] -s 
(S/../.././04|L/../../1/20) -M daily

The running mdadm in monitor mode looks like:

  /sbin/mdadm -F -i /var/run/mdadm.pid -m root -f -s

and my weekly badblocks script looks like:

#!/bin/csh

echo "`uname -n`: Badblocks test starting at [`date`]"

foreach disk ( a c )
  foreach partition ( 1 2 3 5 6 )
echo -n "hd$disk${partition}: "
badblocks -c 128 /dev/hd$disk$partition
  end
  echo ""
end

echo "`uname -n`: Badblocks test   ending at [`date`]"

I do loads of stuff with disk temperatures (when I can), etc. but thats
just for making pretty graphs I can point at my customers... (eg
http://lion.drogon.net/mrtg/diskTemp.html and tell me when that data
centre upgraded their AC ;-)

Gordon
-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Care and feeding of RAID?

2006-09-05 Thread Gordon Henderson
On Tue, 5 Sep 2006, Patrik Jonsson wrote:

> mtbf seems to have an exponential dependence on temperature, so it pays
> off to keep temp down. Exactly what temp you consider safe is
> individual, but my drives only occasionally go above 40C.

I had a pair (2 x Hitachi IDE 80GB) that ran in a sealed case at the top
of a lift-shaft for 2 years. They averaged 55C... I never got to see the
box after it was decomissioned...

Gordon
-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Care and feeding of RAID?

2006-09-05 Thread Patrik Jonsson
On 05.09.2006 08:48:44, Paul Waldo wrote:
>> Hi all,
>>
>> I have a RAID6 array and I wondering about care and feeding instructions :-)
>>
>> Here is what I currently do:
>>- daily incremental and weekly full backups to a separate machine
>>- run smartd tests (short once a day, long once a week)
>>- check the raid for bad blocks every week
>>
>> What else can I do make sure the array keeps humming?  Thanks in advance!
>> 
Make sure the drives are adequately cooled. I use this nifty utility to
look at my drive temps:
http://martybugs.net/linux/hddtemp.cgi

mtbf seems to have an exponential dependence on temperature, so it pays
off to keep temp down. Exactly what temp you consider safe is
individual, but my drives only occasionally go above 40C.

cheers,

/Patrik




signature.asc
Description: OpenPGP digital signature


Re: Care and feeding of RAID?

2006-09-05 Thread Mike Hardy


Steve Cousins wrote:

> MAILADDR [EMAIL PROTECTED]
> ARRAY /dev/md0 level=raid5 num-devices=3
> UUID=39d07542:f3c97e69:fbb63d9d:64a052d3
> devices=/dev/sdb1,/dev/sdc1,/dev/sdd1

If you list the devices explicitly, you're opening the possibility for
errors when the devices are re-ordered following insertion (or removal)
of any other SATA or SCSI (or USB storage) device

I think you want is a "DEVICE partitions" line accompanied by ARRAY
lines that have the UUID attribute you've already got in there.

-Mike
-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Care and feeding of RAID?

2006-09-05 Thread Benjamin Schieder
On 05.09.2006 11:03:45, Steve Cousins wrote:
> Would people be willing to list their setup? Including such things as 
> mdadm.conf file, crontab -l, plus scripts that they use to check the 
> smart data and the array, mdadm daemon parameters and anything else that 
> is relevant to checking and maintaining an array? 

Personally, I use this script from cron:
http://shellscripts.org/project/hdtest
0 3 * * * root /root/sbin/hdtest.sh -l 
/var/log/smart_ata-ST3250624A_4ND33CLT.log 
/dev/disk/by-id/ata-ST3250624A_4ND33CLT short
1 3 * * * root /root/sbin/hdtest.sh -l 
/var/log/smart_ata-ST3250624A_4ND33EJE.log 
/dev/disk/by-id/ata-ST3250624A_4ND33EJE short
2 3 * * * root /root/sbin/hdtest.sh -l 
/var/log/smart_ata-ST3250624A_4ND33ELA.log 
/dev/disk/by-id/ata-ST3250624A_4ND33ELA short

I have made the experience that long tests slow down the raid to a point
where the system becomes unusable.

My mdadm.conf is like this:
---
DEVICE partitions
ARRAY /dev/md/0 level=raid1 num-devices=3 
UUID=3559ffcf:14eb9889:3826d6c2:c13731d7
ARRAY /dev/md/1 level=raid5 num-devices=3 
UUID=649fc7cc:d4b52c31:240fce2c:c64686e7
ARRAY /dev/md/2 level=raid5 num-devices=3 
UUID=9a3bf634:58f39e44:27ba8087:d5189766
   spares=1
ARRAY /dev/md/4 level=raid5 num-devices=3 
UUID=d4799be3:5b157884:e38718c2:c05ab840
   spares=1
ARRAY /dev/md/5 level=raid5 num-devices=3 
UUID=ca4a6110:4533d8d5:0e2ed4e1:2f5805b2
   spares=1

MAIL [EMAIL PROTECTED]
---

Also, I run

mdadm --monitor /dev/md/* --daemonise

from an init script.

Greetings,
Benjamin
-- 
 _  _ _   __   
| \| |___| |_| |_  __ _ __| |__
| .` / -_)  _| ' \/ _` / _| / /
|_|\_\___|\__|_||_\__,_\__|_\_\
| |  (_)_ _ _  ___ __
| |__| | ' \ || \ \ /
||_|_||_\_,_/_\_\
Play Nethack anywhere with an x86 computer:
http://www.crash-override.net/nethacklinux.html


pgp2ufnllRcnR.pgp
Description: PGP signature


Re: Care and feeding of RAID?

2006-09-05 Thread Steve Cousins

Gordon Henderson wrote:


On Tue, 5 Sep 2006, Paul Waldo wrote:

 


Hi all,

I have a RAID6 array and I wondering about care and feeding instructions :-)

Here is what I currently do:
   - daily incremental and weekly full backups to a separate machine
   - run smartd tests (short once a day, long once a week)
   - check the raid for bad blocks every week

What else can I do make sure the array keeps humming?  Thanks in advance!
   



Stop fiddling with it :)

I run similar stuff, but don't forget running mdadm in daemon mode to send
you an email should a drive fail. I also check each device individually,
rather than the array although I don't know the value of doing this over
the SMART tests on modern drives though...
 



Would people be willing to list their setup? Including such things as 
mdadm.conf file, crontab -l, plus scripts that they use to check the 
smart data and the array, mdadm daemon parameters and anything else that 
is relevant to checking and maintaining an array? 

I'm running the mdmonitor script at startup and a sample mdadm.conf  
(one of 3 machines) looks like:


MAILADDR [EMAIL PROTECTED]
ARRAY /dev/md0 level=raid5 num-devices=3 
UUID=39d07542:f3c97e69:fbb63d9d:64a052d3 
devices=/dev/sdb1,/dev/sdc1,/dev/sdd1


These are SATA drives and except for the one machine that has a 3Ware 
8506 card in it I haven't been able to get SMART programs to do anything 
with these drives.  How do others deal with this? 


Thanks,

Steve
-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Care and feeding of RAID?

2006-09-05 Thread Gordon Henderson
On Tue, 5 Sep 2006, Paul Waldo wrote:

> Hi all,
>
> I have a RAID6 array and I wondering about care and feeding instructions :-)
>
> Here is what I currently do:
> - daily incremental and weekly full backups to a separate machine
> - run smartd tests (short once a day, long once a week)
> - check the raid for bad blocks every week
>
> What else can I do make sure the array keeps humming?  Thanks in advance!

Stop fiddling with it :)

I run similar stuff, but don't forget running mdadm in daemon mode to send
you an email should a drive fail. I also check each device individually,
rather than the array although I don't know the value of doing this over
the SMART tests on modern drives though...

Gordon
-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Care and feeding of RAID?

2006-09-05 Thread Benjamin Schieder
On 05.09.2006 08:48:44, Paul Waldo wrote:
> Hi all,
> 
> I have a RAID6 array and I wondering about care and feeding instructions :-)
> 
> Here is what I currently do:
>- daily incremental and weekly full backups to a separate machine
>- run smartd tests (short once a day, long once a week)
>- check the raid for bad blocks every week
> 
> What else can I do make sure the array keeps humming?  Thanks in advance!

The mdadm man-page has information about running mdadm from cron to check
for 'unusual' activity.
You may want to consider that. I run it as daemon, personally.

Greetings,
Benjamin
-- 
#!/bin/sh #!/bin/bash #!/bin/tcsh #!/bin/csh #!/bin/kiss #!/bin/ksh
#!/bin/pdksh #!/usr/bin/perl #!/usr/bin/python #!/bin/zsh #!/bin/ash

Feel at home? Got some of them? Want to show some magic?

http://shellscripts.org


pgp0pYOEW9vym.pgp
Description: PGP signature


Care and feeding of RAID?

2006-09-05 Thread Paul Waldo

Hi all,

I have a RAID6 array and I wondering about care and feeding instructions :-)

Here is what I currently do:
   - daily incremental and weekly full backups to a separate machine
   - run smartd tests (short once a day, long once a week)
   - check the raid for bad blocks every week

What else can I do make sure the array keeps humming?  Thanks in advance!
-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html