Re: Strange crash on Dell R720xd

2012-10-17 Thread Laurent CARON
On Tue, Oct 16, 2012 at 10:58:49AM -0700, Dan Williams wrote: > I think this may be a bug in __raid_run_ops that is only possible when > raid offload and CONFIG_MULTICORE_RAID456 are enabled. I'm thinking > the descriptor is completed and recycled to another requester in the > space between these

Re: Strange crash on Dell R720xd

2012-10-17 Thread Laurent CARON
On Tue, Oct 16, 2012 at 10:58:49AM -0700, Dan Williams wrote: I think this may be a bug in __raid_run_ops that is only possible when raid offload and CONFIG_MULTICORE_RAID456 are enabled. I'm thinking the descriptor is completed and recycled to another requester in the space between these two

Re: Strange crash on Dell R720xd

2012-10-16 Thread Dan Williams
On Tue, Oct 16, 2012 at 5:52 AM, Laurent CARON wrote: > On Tue, Oct 16, 2012 at 02:48:25PM +0200, Borislav Petkov wrote: >> On Tue, Oct 16, 2012 at 11:26:01AM +0200, Laurent CARON wrote: >> > On Tue, Oct 16, 2012 at 11:03:53AM +0200, Borislav Petkov wrote: >> > > That's: >> > > >> > >

Re: Strange crash on Dell R720xd

2012-10-16 Thread Laurent CARON
On Tue, Oct 16, 2012 at 02:48:25PM +0200, Borislav Petkov wrote: > On Tue, Oct 16, 2012 at 11:26:01AM +0200, Laurent CARON wrote: > > On Tue, Oct 16, 2012 at 11:03:53AM +0200, Borislav Petkov wrote: > > > That's: > > > > > > BUG_ON(async_tx_test_ack(depend_tx) || > > >

Re: Strange crash on Dell R720xd

2012-10-16 Thread Borislav Petkov
On Tue, Oct 16, 2012 at 11:26:01AM +0200, Laurent CARON wrote: > On Tue, Oct 16, 2012 at 11:03:53AM +0200, Borislav Petkov wrote: > > That's: > > > > BUG_ON(async_tx_test_ack(depend_tx) || txd_next(depend_tx) > > || > > txd_parent(tx)); > > > > but probably

Re: Strange crash on Dell R720xd

2012-10-16 Thread Laurent CARON
On Tue, Oct 16, 2012 at 11:03:53AM +0200, Borislav Petkov wrote: > That's: > > BUG_ON(async_tx_test_ack(depend_tx) || txd_next(depend_tx) || > txd_parent(tx)); > > but probably the b0rkage happens up the stack. And this __raid_run_ops > is probably starting

Re: Strange crash on Dell R720xd

2012-10-16 Thread Borislav Petkov
On Mon, Oct 15, 2012 at 09:42:58PM +0200, Laurent CARON wrote: > Hi, > > I'm currently replacing an old system (HP DL 380 G5) by new dell R720xd. > On those new boxes I did configure the H310 controler as plain JBOD. > > Those boxes appear to crash more often than not (from 5 mins to a couple >

Re: Strange crash on Dell R720xd

2012-10-16 Thread Borislav Petkov
On Mon, Oct 15, 2012 at 09:42:58PM +0200, Laurent CARON wrote: Hi, I'm currently replacing an old system (HP DL 380 G5) by new dell R720xd. On those new boxes I did configure the H310 controler as plain JBOD. Those boxes appear to crash more often than not (from 5 mins to a couple of

Re: Strange crash on Dell R720xd

2012-10-16 Thread Laurent CARON
On Tue, Oct 16, 2012 at 11:03:53AM +0200, Borislav Petkov wrote: That's: BUG_ON(async_tx_test_ack(depend_tx) || txd_next(depend_tx) || txd_parent(tx)); but probably the b0rkage happens up the stack. And this __raid_run_ops is probably starting the

Re: Strange crash on Dell R720xd

2012-10-16 Thread Borislav Petkov
On Tue, Oct 16, 2012 at 11:26:01AM +0200, Laurent CARON wrote: On Tue, Oct 16, 2012 at 11:03:53AM +0200, Borislav Petkov wrote: That's: BUG_ON(async_tx_test_ack(depend_tx) || txd_next(depend_tx) || txd_parent(tx)); but probably the b0rkage

Re: Strange crash on Dell R720xd

2012-10-16 Thread Laurent CARON
On Tue, Oct 16, 2012 at 02:48:25PM +0200, Borislav Petkov wrote: On Tue, Oct 16, 2012 at 11:26:01AM +0200, Laurent CARON wrote: On Tue, Oct 16, 2012 at 11:03:53AM +0200, Borislav Petkov wrote: That's: BUG_ON(async_tx_test_ack(depend_tx) || txd_next(depend_tx) ||

Re: Strange crash on Dell R720xd

2012-10-16 Thread Dan Williams
On Tue, Oct 16, 2012 at 5:52 AM, Laurent CARON lca...@unix-scripts.info wrote: On Tue, Oct 16, 2012 at 02:48:25PM +0200, Borislav Petkov wrote: On Tue, Oct 16, 2012 at 11:26:01AM +0200, Laurent CARON wrote: On Tue, Oct 16, 2012 at 11:03:53AM +0200, Borislav Petkov wrote: That's: