Re: Raid-5 long write wait while reading

Bill Davidsen Mon, 04 Jun 2007 13:30:50 -0700

tj wrote:

Bill Davidsen wrote:
tj wrote:
Thomas Jager wrote:
Hi list.
I run a file server on MD raid-5.
If a client reads one big file and at the same time another clienttries to write a file, the thread writing just sits inuninterruptible sleep until the reader has finished. Only verysmall amount of writes get trough while the reader is still working.
I'm having some trouble pinpointing the problem.
It's not consistent either sometimes it works as expected both thereader and writer gets some transactions. On huge reads I've seenthe writer blocked for 30-40 minutes without any significant writeshappening (Maybe a few megabytes, of several gigs waiting). Ithappens with NFS, SMB and FTP, and local with dd. And seems to beconnected to raid-5. This does not happen on block devices withoutraid-5. I'm also wondering if it can have anything to do withloop-aes? I use loop-aes on top of the md, but then again i havenot observed this problem on loop-devices with disk backend. I doknow that loop-aes degrades performance but i didn't think it woulddo something like this?
I've seen this problem in 2.6.16-2.6.21
All disks in the array is connected to a controller with a SiI 3114chip.
I just noticed something else. A couple of slow readers whererunning on my raid-5 array. Then i started a copy from another localdisk to the array. Then i got the extremely long wait. I noticedsomething in iostat:
avg-cpu:  %user   %nice %system %iowait  %steal   %idle
          3.90    0.00   48.05   31.93    0.00   16.12

Device:            tps    kB_read/s    kB_wrtn/s    kB_read    kB_wrtn
....
sdg               0.80        25.55         0.00        128          0
sdh             154.89       632.34         0.00       3168          0
sdi               0.20        12.77         0.00         64          0
sdj               0.40        25.55         0.00        128          0
sdk               0.40        25.55         0.00        128          0
sdl               0.80        25.55         0.00        128          0
sdm               0.80        25.55         0.00        128          0
sdn               0.60        23.95         0.00        120          0
md0             199.20       796.81         0.00       3992          0
All disks are member of the same raid array (md0). One of the diskshas a ton of transactions compared to the other disks. Readoperations as far as i can tell. Why? May be connected with my problem?
Two thoughts on that, if you are doing a lot of directory operations,it's possible that the inodes being used most are all in one chunk.
Hi thanks for the reply.
It's not directory operations AFAIK. Reading a few files (3 in thiscase) and writing one.
The other possibility is that these a journal writes and reflectupdates to the atime. The way to see if this is in some way relatedis to mount (remount) with noatime: "mount -o remount,noatime/dev/md0 /wherever" and retest. If this is journal activity you cando several things to reduce the problem, which I'll go into (a) if itseems to be the problem, and (b) if someone else doesn't point you toan existing document or old post on the topic. Oh, you could also trymounting the filesystem as etc2, assuming that it's ext3 now. Iwouldn't run that way, but it's useful as a diagnostic tool.
I don't use ext3 i use ReiserFS. ( It seemed like a good idea at thetime. ) It's mounted with -o noatime.I've done some more testing and i seems like it might be connected tomount --bind. If i write to a binded mount i get the slow writes. Butif i write directly to the real mount i don't. It might just be arandom occurrence, as the problem always has been inconsistent. Thoughts?


I don't beat on the bind mounts, let me do a test and get back.

--
bill davidsen <[EMAIL PROTECTED]>
 CTO TMR Associates, Inc
 Doing interesting things with small computers since 1979

-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: Raid-5 long write wait while reading

Reply via email to