Re: [Lustre-discuss] OSTs hanging while running IOR

2009-09-10 Thread Brian J. Murrell
On Wed, 2009-09-09 at 21:38 -0300, Rafael David Tinoco wrote: It sms that using 64 threads for OST solved the problem. Ahhh. As I suspected then. Good. Thanx for updating the thread. The archives at least will like that. :-) b. signature.asc Description: This is a digitally

Re: [Lustre-discuss] OSTs hanging while running IOR

2009-09-10 Thread Andreas Dilger
On Sep 09, 2009 19:32 -0300, Rafael David Tinoco wrote: Forget the file.. sorry Lustre: 0:0:(watchdog.c:181:lcw_cb()) Watchdog triggered for pid 16372: it was inactive for 200.00s [stack trace] Since lots of users are confused by this message, and think there is a crash, I think we should

[Lustre-discuss] OSTs hanging while running IOR

2009-09-09 Thread Rafael David Tinoco
Have anyone seen these kind of errors while running IOR or some other benchmarks: Im running lustre 1.8.1 on CentOS 5.3. I have the following configuration: 4 JBDOs J4400 connected to 4 OSSs. Each OSS has 3 OSTs (raid5 - 8 disks) connected using multipathd, mdadm on /dev/dm* and

Re: [Lustre-discuss] OSTs hanging while running IOR

2009-09-09 Thread Brian J. Murrell
On Wed, 2009-09-09 at 14:31 -0300, Rafael David Tinoco wrote: Have anyone seen these kind of errors while running IOR or some other benchmarks: On a note of e-mail formatting, so much vertical whitespace is not really needed and makes reading a bit more difficult. Also, personally, I don't

Re: [Lustre-discuss] OSTs hanging while running IOR

2009-09-09 Thread Oleg Drokin
Hello! On Sep 9, 2009, at 1:31 PM, Rafael David Tinoco wrote: One of my OSSs crashes, sometimes one, sometimes another. With the following error: That's not a crash. That's watchdog timeout indicative of lustre spending too much time waiting on io. As such you need to somehow decrease the

Re: [Lustre-discuss] OSTs hanging while running IOR

2009-09-09 Thread Rafael David Tinoco
Im attaching the messages (only the error part) file so we don't have these mail formatting problems. -- Can you provide a bit more of the log before the above so we can see what the stack trace is in reference to? Also, try to eliminate the white-space between lines. Are you getting any

Re: [Lustre-discuss] OSTs hanging while running IOR

2009-09-09 Thread Rafael David Tinoco
: [Lustre-discuss] OSTs hanging while running IOR Im attaching the messages (only the error part) file so we don't have these mail formatting problems. -- Can you provide a bit more of the log before the above so we can see what the stack trace is in reference to? Also, try to eliminate

Re: [Lustre-discuss] OSTs hanging while running IOR

2009-09-09 Thread Rafael David Tinoco
PM To: Rafael David Tinoco Cc: lustre-discuss@lists.lustre.org Subject: Re: [Lustre-discuss] OSTs hanging while running IOR Hello! On Sep 9, 2009, at 1:31 PM, Rafael David Tinoco wrote: One of my OSSs crashes, sometimes one, sometimes another. With the following error: That's not a crash