ZFS panic on a RELENG_8 NFS server (Was: panic: spin lock held too long (RELENG_8 from today))

2011-09-09 Thread Hiroki Sato
Hiroki Sato wrote in <20110907.094717.2272609566853905102@allbsd.org>: hr> During this investigation an disk has to be replaced and resilvering hr> it is now in progress. A deadlock and a forced reboot after that hr> make recovering of the zfs datasets take a long time (for committing h

Re: panic: spin lock held too long (RELENG_8 from today)

2011-09-06 Thread Hiroki Sato
Attilio Rao wrote in : at> This should be enough for someone NFS-aware to look into it. at> at> Were you also able to get a core? Yes. But as kib@ pointed out it seems a deadlock in ZFS. Some experiments I did showed that this deadlock can be triggered at least by doing "rm -rf" against a

Re: panic: spin lock held too long (RELENG_8 from today)

2011-09-03 Thread Kostik Belousov
On Sat, Sep 03, 2011 at 12:05:47PM +0200, Attilio Rao wrote: > This should be enough for someone NFS-aware to look into it. > > Were you also able to get a core? > > I'll try to look into it in the next days, in particular about the > softclock state. > I am absolutely sure that this is a zfs d

Re: panic: spin lock held too long (RELENG_8 from today)

2011-09-01 Thread Attilio Rao
2011/9/1 Trent Nelson : > > On Aug 19, 2011, at 7:53 PM, Attilio Rao wrote: > >> If nobody complains about it earlier, I'll propose the patch to re@ in 8 >> hours. > > Just a friendly 'me too', for the records.  22 hours of heavy network/disk > I/O and no panic yet -- prior to the patch it was a

Re: panic: spin lock held too long (RELENG_8 from today)

2011-09-01 Thread Trent Nelson
On Aug 19, 2011, at 7:53 PM, Attilio Rao wrote: > If nobody complains about it earlier, I'll propose the patch to re@ in 8 > hours. Just a friendly 'me too', for the records. 22 hours of heavy network/disk I/O and no panic yet -- prior to the patch it was a panic orgy. Any response from re@

Re: panic: spin lock held too long (RELENG_8 from today)

2011-08-19 Thread Hiroki Sato
Attilio Rao wrote in : at> If nobody complains about it earlier, I'll propose the patch to re@ in 8 hours. Running fine for 45 hours so far. Please go ahead! -- Hiroki pgp3JVRs7kKa0.pgp Description: PGP signature

Re: panic: spin lock held too long (RELENG_8 from today)

2011-08-19 Thread Attilio Rao
If nobody complains about it earlier, I'll propose the patch to re@ in 8 hours. Attilio 2011/8/19 Mike Tancsa : > On 8/18/2011 8:37 PM, Chip Camden wrote: > >>> st> Thanks, Attilio.  I've applied the patch and removed the extra debug >>> st> options I had added (though keeping debug symbols).  I'

Re: panic: spin lock held too long (RELENG_8 from today)

2011-08-19 Thread Chip Camden
Quoth Mike Tancsa on Friday, 19 August 2011: > On 8/18/2011 8:37 PM, Chip Camden wrote: > > >> st> Thanks, Attilio. I've applied the patch and removed the extra debug > >> st> options I had added (though keeping debug symbols). I'll let you know > >> if > >> st> I experience any more panics. >

Re: panic: spin lock held too long (RELENG_8 from today)

2011-08-19 Thread Mike Tancsa
On 8/18/2011 8:37 PM, Chip Camden wrote: >> st> Thanks, Attilio. I've applied the patch and removed the extra debug >> st> options I had added (though keeping debug symbols). I'll let you know if >> st> I experience any more panics. >> >> No panic for 20 hours at this moment, FYI. For my NFS s

Re: panic: spin lock held too long (RELENG_8 from today)

2011-08-18 Thread Chip Camden
Quoth Hiroki Sato on Friday, 19 August 2011: > Chip Camden wrote > in <20110818025550.ga1...@libertas.local.camdensoftware.com>: > > st> Quoth Attilio Rao on Thursday, 18 August 2011: > st> > In callout_cpu_switch() if a low priority thread is migrating the > st> > callout and gets preempted af

Re: panic: spin lock held too long (RELENG_8 from today)

2011-08-18 Thread Hiroki Sato
Chip Camden wrote in <20110818025550.ga1...@libertas.local.camdensoftware.com>: st> Quoth Attilio Rao on Thursday, 18 August 2011: st> > In callout_cpu_switch() if a low priority thread is migrating the st> > callout and gets preempted after the outcoming cpu queue lock is left st> > (and sched

Re: panic: spin lock held too long (RELENG_8 from today)

2011-08-17 Thread Chip Camden
Quoth Attilio Rao on Thursday, 18 August 2011: > 2011/8/18 Hiroki Sato : > > Hiroki Sato wrote > >  in <20110818.043332.27079545013461535@allbsd.org>: > > > > hr> Attilio Rao wrote > > hr>   in > > : > > hr> > > hr> at> 2011/8/17 Hiroki Sato : > > hr> at> > Hi, > > hr> at> > > > hr> at> > Mi

Re: panic: spin lock held too long (RELENG_8 from today)

2011-08-17 Thread Jeremy Chadwick
On Wed, Aug 17, 2011 at 05:01:05PM -0700, Chip Camden wrote: > Quoth Jeremy Chadwick on Wednesday, 17 August 2011: > > > > > > I'm also getting similar panics on 8.2-STABLE. Locks up everything and I > > > have to power off. Once, I happened to be looking at the console when it > > > happened an

Re: panic: spin lock held too long (RELENG_8 from today)

2011-08-17 Thread Attilio Rao
2011/8/18 Hiroki Sato : > Hiroki Sato wrote >  in <20110818.043332.27079545013461535@allbsd.org>: > > hr> Attilio Rao wrote > hr>   in : > hr> > hr> at> 2011/8/17 Hiroki Sato : > hr> at> > Hi, > hr> at> > > hr> at> > Mike Tancsa wrote > hr> at> >  in <4e15a08c.6090...@sentex.net>: > hr> at>

Re: panic: spin lock held too long (RELENG_8 from today)

2011-08-17 Thread Attilio Rao
2011/8/18 Hiroki Sato : > Hiroki Sato wrote >  in <20110818.043332.27079545013461535@allbsd.org>: > > hr> Attilio Rao wrote > hr>   in : > hr> > hr> at> 2011/8/17 Hiroki Sato : > hr> at> > Hi, > hr> at> > > hr> at> > Mike Tancsa wrote > hr> at> >  in <4e15a08c.6090...@sentex.net>: > hr> at>

Re: panic: spin lock held too long (RELENG_8 from today)

2011-08-17 Thread Hiroki Sato
Hiroki Sato wrote in <20110818.043332.27079545013461535@allbsd.org>: hr> Attilio Rao wrote hr> in : hr> hr> at> 2011/8/17 Hiroki Sato : hr> at> > Hi, hr> at> > hr> at> > Mike Tancsa wrote hr> at> >  in <4e15a08c.6090...@sentex.net>: hr> at> > hr> at> > mi> On 7/7/2011 7:32 AM, Mike Tan

Re: panic: spin lock held too long (RELENG_8 from today)

2011-08-17 Thread Chip Camden
Quoth Jeremy Chadwick on Wednesday, 17 August 2011: > > > > I'm also getting similar panics on 8.2-STABLE. Locks up everything and I > > have to power off. Once, I happened to be looking at the console when it > > happened and copied dow the following: > > > > Sleeping thread (tif 100037, pid 0

Re: panic: spin lock held too long (RELENG_8 from today)

2011-08-17 Thread Jeremy Chadwick
machine just after it occurred, but > > according to the stack trace it was the same as posted one. > > Switching to an 8.2R kernel can prevent this panic. > > > > Any progress on the investigation? > > > > -- > > spin lock 0x80cb46c0 (sched lock 0)

Re: panic: spin lock held too long (RELENG_8 from today)

2011-08-17 Thread Hiroki Sato
Attilio Rao wrote in : at> 2011/8/17 Hiroki Sato : at> > Hi, at> > at> > Mike Tancsa wrote at> >  in <4e15a08c.6090...@sentex.net>: at> > at> > mi> On 7/7/2011 7:32 AM, Mike Tancsa wrote: at> > mi> > On 7/7/2011 4:20 AM, Kostik Belousov wrote: at> > mi> >> at> > mi> >> BTW, we had a similar pa

Re: panic: spin lock held too long (RELENG_8 from today)

2011-08-17 Thread Attilio Rao
2011/8/17 Hiroki Sato : > Hi, > > Mike Tancsa wrote >  in <4e15a08c.6090...@sentex.net>: > > mi> On 7/7/2011 7:32 AM, Mike Tancsa wrote: > mi> > On 7/7/2011 4:20 AM, Kostik Belousov wrote: > mi> >> > mi> >> BTW, we had a similar panic, "spinlock held too long", the spinlock > mi> >> is the sched l

Re: panic: spin lock held too long (RELENG_8 from today)

2011-08-17 Thread Mike Tancsa
On 8/17/2011 1:38 PM, Hiroki Sato wrote: > Any progress on the investigation? Unfortunately, I cannot reproduce it yet with a debugging kernel :( ---Mike > > -- > spin lock 0x80cb46c0 (sched lock 0) held by 0xff01900458c0 (tid > 100489) too long > pan

Re: panic: spin lock held too long (RELENG_8 from today)

2011-08-17 Thread Chip Camden
vy I/O load. I could not get a kernel dump > because this panic locked up the machine just after it occurred, but > according to the stack trace it was the same as posted one. > Switching to an 8.2R kernel can prevent this panic. > > Any progress on the investigation? > &g

Re: panic: spin lock held too long (RELENG_8 from today)

2011-08-17 Thread Hiroki Sato
ed one. Switching to an 8.2R kernel can prevent this panic. Any progress on the investigation? -- spin lock 0x80cb46c0 (sched lock 0) held by 0xff01900458c0 (tid 100489) too long panic: spin lock held too long cpuid = 1 KDB: stack backtrace: db_trace_self_wrapper() at db_trace_se

Re: panic: spin lock held too long (RELENG_8 from today)

2011-07-07 Thread Mike Tancsa
ot; to see the conditions. > There is absolutely no warranty for GDB. Type "show warranty" for details. > This GDB was configured as "i386-marcel-freebsd"... > > Unread portion of the kernel message buffer: > spin lock 0xc0b1d200 (sched lock 1) held by 0xc5dac2e0

Re: panic: spin lock held too long (RELENG_8 from today)

2011-07-07 Thread Andriy Gapon
on 07/07/2011 14:41 Jeremy Chadwick said the following: > 1. info threads > 2. Find the index value that matches the tid in question (in the above >spin lock panic, that'd be tid 100109). The index value will be >the first number shown on the left > 3. thread {index} Just in case, in kgdb

Re: panic: spin lock held too long (RELENG_8 from today)

2011-07-07 Thread Jeremy Chadwick
t; Type "show copying" to see the conditions. > There is absolutely no warranty for GDB. Type "show warranty" for details. > This GDB was configured as "i386-marcel-freebsd"... > > Unread portion of the kernel message buffer: > spin lock 0xc0b1d200 (s

Re: panic: spin lock held too long (RELENG_8 from today)

2011-07-07 Thread Mike Tancsa
arcel-freebsd"... Unread portion of the kernel message buffer: spin lock 0xc0b1d200 (sched lock 1) held by 0xc5dac2e0 (tid 100109) too long panic: spin lock held too long cpuid = 0 Uptime: 5h37m43s Physical memory: 2035 MB Dumping 260 MB: 245 229 213 197 181 165 149 133 117 101 85 69 53 37 2

Re: panic: spin lock held too long (RELENG_8 from today)

2011-07-07 Thread Kostik Belousov
panic'd tonight with > > > > Unread portion of the kernel message buffer: > > spin lock 0xc0b1d200 (sched lock 1) held by 0xc5dac8a0 (tid 100107) too long > > panic: spin lock held too long > > cpuid = 0 > > Uptime: 13h30m4s > > Physical memory: 2035 MB > >

Re: panic: spin lock held too long (RELENG_8 from today)

2011-07-07 Thread Andriy Gapon
lock 0xc0b1d200 (sched lock 1) held by 0xc5dac8a0 (tid 100107) too long > panic: spin lock held too long > cpuid = 0 > Uptime: 13h30m4s > Physical memory: 2035 MB > > > Its a somewhat busy box taking in mail as well as backups for a few > servers over nfs. At the time, it

panic: spin lock held too long (RELENG_8 from today)

2011-07-06 Thread Mike Tancsa
I did a buildworld on this box to bring it up to RELENG_8 for the BIND fixes. Unfortunately, the formerly solid box (April 13th kernel) panic'd tonight with Unread portion of the kernel message buffer: spin lock 0xc0b1d200 (sched lock 1) held by 0xc5dac8a0 (tid 100107) too long panic: spin

panic: spin lock held too long 8.0-STABLE r207638

2010-05-06 Thread Peter
oad and no swap usage, and no panics for 1.5 months until yesterday] Yesterday I tried to install Windows 2008, and in the ~6 hours I was messing around, it panicked around 8 times [after random amounts of time] panic: spin lock held too long [cpuid either 2,3, or 4 as far as I remember] 4GB of

Re: 7.2-release/amd64: panic, spin lock held too long

2009-09-28 Thread Attilio Rao
2009/9/28 C. C. Tang : > C. C. Tang wrote: >> >> Attilio Rao wrote: >>> >>> 2009/9/22 C. C. Tang : > >>> I have patched the sched_ule.c and did a make buildkernel & make >>> installkernel (is buildworld and installworld necessary?), rebooted >>> and >>> the >>> machine i

Re: 7.2-release/amd64: panic, spin lock held too long

2009-09-27 Thread C. C. Tang
C. C. Tang wrote: Attilio Rao wrote: 2009/9/22 C. C. Tang : I have patched the sched_ule.c and did a make buildkernel & make installkernel (is buildworld and installworld necessary?), rebooted and the machine is running now. I will post here again if there is any update. My server is up fo

Re: 7.2-release/amd64: panic, spin lock held too long

2009-09-21 Thread C. C. Tang
Attilio Rao wrote: 2009/9/22 C. C. Tang : I have patched the sched_ule.c and did a make buildkernel & make installkernel (is buildworld and installworld necessary?), rebooted and the machine is running now. I will post here again if there is any update. My server is up for 3.5 days now with H

Re: 7.2-release/amd64: panic, spin lock held too long

2009-09-21 Thread Attilio Rao
2009/9/22 C. C. Tang : >> >> I have patched the sched_ule.c and did a make buildkernel & make installkernel (is buildworld and installworld necessary?), rebooted and the machine is running now. I will post here again if there is any update. > > My server is up for 3

Re: 7.2-release/amd64: panic, spin lock held too long

2009-09-21 Thread C. C. Tang
I have patched the sched_ule.c and did a make buildkernel & make installkernel (is buildworld and installworld necessary?), rebooted and the machine is running now. I will post here again if there is any update. My server is up for 3.5 days now with HyperThreading & powerd enabled. No panic o

Re: 7.2-release/amd64: panic, spin lock held too long

2009-09-19 Thread Attilio Rao
2009/9/19 Dan Naumov : > On Fri, Sep 18, 2009 at 2:25 PM, C. C. Tang wrote: >>> Attilio Rao wrote: 2009/9/17 C. C. Tang : >>> >>> Dan, is that machine equipped with Hyperthreading? >>> >>> Attilio >> >> Yes. It's an Intel Atom 330, which is a dualcore CPU with HT

Re: 7.2-release/amd64: panic, spin lock held too long

2009-09-19 Thread Dan Naumov
On Fri, Sep 18, 2009 at 2:25 PM, C. C. Tang wrote: >> Attilio Rao wrote: >>> >>> 2009/9/17 C. C. Tang : >> >> Dan, is that machine equipped with Hyperthreading? >> >> Attilio > > Yes. It's an Intel Atom 330, which is a dualcore CPU with HT (4 cores > visible in "top" as

Re: 7.2-release/amd64: panic, spin lock held too long

2009-09-18 Thread C. C. Tang
Attilio Rao wrote: 2009/9/17 C. C. Tang : Dan, is that machine equipped with Hyperthreading? Attilio Yes. It's an Intel Atom 330, which is a dualcore CPU with HT (4 cores visible in "top" as a result) Yes, mine is also Atom 330. I cannot test the patch because my machine is also in productio

Re: 7.2-release/amd64: panic, spin lock held too long

2009-09-17 Thread C. C. Tang
Attilio Rao wrote: 2009/9/17 C. C. Tang : Dan, is that machine equipped with Hyperthreading? Attilio Yes. It's an Intel Atom 330, which is a dualcore CPU with HT (4 cores visible in "top" as a result) Yes, mine is also Atom 330. I cannot test the patch because my machine is also in productio

Re: 7.2-release/amd64: panic, spin lock held too long

2009-09-17 Thread Attilio Rao
2009/9/17 C. C. Tang : >>> Dan, is that machine equipped with Hyperthreading? >>> >>> Attilio >> >> Yes. It's an Intel Atom 330, which is a dualcore CPU with HT (4 cores >> visible in "top" as a result) > > Yes, mine is also Atom 330. > > I cannot test the patch because my machine is also in produc

Re: 7.2-release/amd64: panic, spin lock held too long

2009-09-16 Thread C. C. Tang
Dan, is that machine equipped with Hyperthreading? Attilio Yes. It's an Intel Atom 330, which is a dualcore CPU with HT (4 cores visible in "top" as a result) Yes, mine is also Atom 330. I cannot test the patch because my machine is also in production now. But I have tested it with hyperthr

Re: 7.2-release/amd64: panic, spin lock held too long

2009-09-14 Thread Dan Naumov
On Mon, Sep 14, 2009 at 3:43 PM, Attilio Rao wrote: > 2009/7/23 C. C. Tang : >> Attilio Rao wrote: >>> >>> 2009/7/22 C. C. Tang : > > Could that one (on i386) be related? > http://www.freebsd.org/cgi/query-pr.cgi?pr=kern/134584 > I have no idea about it but I can tell the diff

Re: 7.2-release/amd64: panic, spin lock held too long

2009-09-14 Thread Attilio Rao
2009/7/23 C. C. Tang : > Attilio Rao wrote: >> >> 2009/7/22 C. C. Tang : Could that one (on i386) be related? http://www.freebsd.org/cgi/query-pr.cgi?pr=kern/134584 >>> I have no idea about it but I can tell the difference... >>> My machine panic randomly rather than on shutdown

Re: 7.2-release/amd64: panic, spin lock held too long

2009-09-12 Thread Dan Naumov
file is /boot/kernel/kernel >> Jul  7 03:49:38 atom kernel: spin lock 0x80b3edc0 (sched lock >> 1) held by 0xff00017d8370 (tid 100054) too long >> Jul  7 03:49:38 atom kernel: panic: spin lock held too long >> >> /var/crash looks empty. This is a system run

Re: 7.2-release/amd64: panic, spin lock held too long

2009-09-12 Thread Attilio Rao
f80b3edc0 (sched lock > 1) held by 0xff00017d8370 (tid 100054) too long > Jul 7 03:49:38 atom kernel: panic: spin lock held too long > > /var/crash looks empty. This is a system running official 7.2-p1 > binaries since I am using freebsd-update to keep up with the patches

Re: kern/134584: [panic] spin lock held too long

2009-07-30 Thread barbara
> The following reply was made to PR kern/134584; it has been noted by GNATS. > > From: Attilio Rao > To: barbara > Cc: bug-followup , FreeBSD-stable > > Subject: Re: kern/134584: [panic] spin lock held too long > Date: Sun, 26 Jul 2009 16:36:44 +0200 > > 2009/7

Re: kern/134584: [panic] spin lock held too long

2009-07-27 Thread barbara
> 2009/7/26 barbara : > > It happened again, on shutdown. > > As the previous time, it happened after a high (for a desktop) uptime and, > > if it could matter, after running net-p2p/transmission-gtk2 for several > > hours. > > I don't know if it's related, but often quitting transmission, doesn'

Re: kern/134584: [panic] spin lock held too long

2009-07-27 Thread Attilio Rao
2009/7/26 barbara : > It happened again, on shutdown. > As the previous time, it happened after a high (for a desktop) uptime and, if > it could matter, after running net-p2p/transmission-gtk2 for several hours. > I don't know if it's related, but often quitting transmission, doesn't > terminate

Re: kern/134584: [panic] spin lock held too long

2009-07-26 Thread Attilio Rao
2009/7/26 barbara : > It happened again, on shutdown. > As the previous time, it happened after a high (for a desktop) uptime and, if > it could matter, after running net-p2p/transmission-gtk2 for several hours. > I don't know if it's related, but often quitting transmission, doesn't > terminate

Re: kern/134584: [panic] spin lock held too long

2009-07-25 Thread barbara
ng (max 60 seconds) for system process `bufdaemon' to stop...done Waiting (max 60 seconds) for system process `syncer' to stop... Syncing disks, vnodes remaining...7 6 5 3 1 2 2 0 0 done All buffers synced. Uptime: 22h41m12s Rebooting... cpu_reset: Stopping other CPUs spin lock 0xc08ae540 (sc

Re: 7.2-release/amd64: panic, spin lock held too long

2009-07-23 Thread NAKAJI Hiroyuki
> In <3bbf2fe10907221748j7f317ce0yc13bc5d01b49...@mail.gmail.com> > Attilio Rao wrote: > 2009/7/23 NAKAJI Hiroyuki : > >> In <4a667469.1080...@gmail.com> > >> "C. C. Tang" wrote: > >> > Could that one (on i386) be related? > >> > http://www.freebsd.org/cgi/query-pr.cgi?pr=ker

Re: 7.2-release/amd64: panic, spin lock held too long

2009-07-23 Thread John Baldwin
On Wednesday 22 July 2009 12:30:47 pm Attilio Rao wrote: > 2009/7/22 C. C. Tang : > >> Could that one (on i386) be related? > >> http://www.freebsd.org/cgi/query-pr.cgi?pr=kern/134584 > >> > > > > I have no idea about it but I can tell the difference... > > My machine panic randomly rather than on

Re: 7.2-release/amd64: panic, spin lock held too long

2009-07-23 Thread Dan Naumov
On Thu, Jul 23, 2009 at 5:24 AM, C. C. Tang wrote: > Attilio Rao wrote: >> >> 2009/7/22 C. C. Tang : Could that one (on i386) be related? http://www.freebsd.org/cgi/query-pr.cgi?pr=kern/134584 >>> I have no idea about it but I can tell the difference... >>> My machine panic rand

Re: 7.2-release/amd64: panic, spin lock held too long

2009-07-22 Thread C. C. Tang
Attilio Rao wrote: 2009/7/22 C. C. Tang : Could that one (on i386) be related? http://www.freebsd.org/cgi/query-pr.cgi?pr=kern/134584 I have no idea about it but I can tell the difference... My machine panic randomly rather than on shutdown and I remembered that it failed to write core dump. I

Re: 7.2-release/amd64: panic, spin lock held too long

2009-07-22 Thread Attilio Rao
2009/7/23 NAKAJI Hiroyuki : >> In <4a667469.1080...@gmail.com> >> "C. C. Tang" wrote: >> > Could that one (on i386) be related? >> > http://www.freebsd.org/cgi/query-pr.cgi?pr=kern/134584 >> > > >> I have no idea about it but I can tell the difference... >> My machine panic randomly rath

Re: 7.2-release/amd64: panic, spin lock held too long

2009-07-22 Thread NAKAJI Hiroyuki
> In <4a667469.1080...@gmail.com> > "C. C. Tang" wrote: > > Could that one (on i386) be related? > > http://www.freebsd.org/cgi/query-pr.cgi?pr=kern/134584 > > > I have no idea about it but I can tell the difference... > My machine panic randomly rather than on shutdown and I remembere

Re: 7.2-release/amd64: panic, spin lock held too long

2009-07-22 Thread Attilio Rao
2009/7/22 C. C. Tang : >> Could that one (on i386) be related? >> http://www.freebsd.org/cgi/query-pr.cgi?pr=kern/134584 >> > > I have no idea about it but I can tell the difference... > My machine panic randomly rather than on shutdown and I remembered that it > failed to write core dump. It also

Re: 7.2-release/amd64: panic, spin lock held too long

2009-07-21 Thread C. C. Tang
> Could that one (on i386) be related? > http://www.freebsd.org/cgi/query-pr.cgi?pr=kern/134584 > I have no idea about it but I can tell the difference... My machine panic randomly rather than on shutdown and I remembered that it failed to write core dump. It also failed to reboot automatically.

Re: 7.2-release/amd64: panic, spin lock held too long

2009-07-20 Thread barbara
> > I hope you can get some crash dumps for the developers to look at, > > Attilio was trying to help me but sadly the machine had to be put into > > active use so I could no longer play with FreeBSD due to unsolved > > instability. > > I want to help investigate this problem also but I remembered

Re: 7.2-release/amd64: panic, spin lock held too long

2009-07-20 Thread
> I hope you can get some crash dumps for the developers to look at, > Attilio was trying to help me but sadly the machine had to be put into > active use so I could no longer play with FreeBSD due to unsolved > instability. I want to help investigate this problem also but I remembered that the /v

Re: 7.2-release/amd64: panic, spin lock held too long

2009-07-16 Thread Dan Naumov
>> But is that happening at reboot time? >> >> Thanks, >> Attilio >> > > I think I am also having similar problem on my Atom machine. > (FreeBSD-7.2-Release-p1) > It does not happen at boot/reboot but panic randomly. > And I found that it remains stable for more than a month now after I > disabled

Re: 7.2-release/amd64: panic, spin lock held too long

2009-07-16 Thread C. C. Tang
tom kernel: panic: spin lock held too long That's a known bug, affecting -CURRENT as well. The cpustop IPI is handled though an NMI, which means it could interrupt a CPU in any moment, even while holding a spinlock, violating one well known FreeBSD rule. That means that the cpu can stop itself while

Re: 7.2-release/amd64: panic, spin lock held too long

2009-07-08 Thread Attilio Rao
>>>> Jul 7 03:49:38 atom kernel: spin lock 0xffffffff80b3edc0 (sched lock >>>>>> 1) held by 0xff00017d8370 (tid 100054) too long >>>>>> Jul 7 03:49:38 atom kernel: panic: spin lock held too long >>>>> >>>>> That

Re: 7.2-release/amd64: panic, spin lock held too long

2009-07-08 Thread Dan Naumov
fff80b3edc0 (sched lock >>>>> 1) held by 0xff00017d8370 (tid 100054) too long >>>>> Jul  7 03:49:38 atom kernel: panic: spin lock held too long >>>> >>>> That's a known bug, affecting -CURRENT as well. >>>> The cpustop IPI

Re: 7.2-release/amd64: panic, spin lock held too long

2009-07-07 Thread Dan Naumov
update", /var/log/messages shows the following: >>>> >>>> Jul  7 03:49:38 atom syslogd: kernel boot file is /boot/kernel/kernel >>>> Jul  7 03:49:38 atom kernel: spin lock 0x80b3edc0 (sched lock >>>> 1) held by 0xffffff00017d8370 (tid 100054)

Re: 7.2-release/amd64: panic, spin lock held too long

2009-07-06 Thread Attilio Rao
Jul 7 03:49:38 atom syslogd: kernel boot file is /boot/kernel/kernel >>> Jul 7 03:49:38 atom kernel: spin lock 0x80b3edc0 (sched lock >>> 1) held by 0xff00017d8370 (tid 100054) too long >>> Jul 7 03:49:38 atom kernel: panic: spin lock held too long >>

Re: 7.2-release/amd64: panic, spin lock held too long

2009-07-06 Thread Dan Naumov
e is /boot/kernel/kernel >> Jul  7 03:49:38 atom kernel: spin lock 0x80b3edc0 (sched lock >> 1) held by 0xff00017d8370 (tid 100054) too long >> Jul  7 03:49:38 atom kernel: panic: spin lock held too long > > That's a known bug, affecting -CURRENT as well

Re: 7.2-release/amd64: panic, spin lock held too long

2009-07-06 Thread Attilio Rao
f80b3edc0 (sched lock > 1) held by 0xff00017d8370 (tid 100054) too long > Jul 7 03:49:38 atom kernel: panic: spin lock held too long That's a known bug, affecting -CURRENT as well. The cpustop IPI is handled though an NMI, which means it could interrupt a CPU in any moment, even

7.2-release/amd64: panic, spin lock held too long

2009-07-06 Thread Dan Naumov
00017d8370 (tid 100054) too long Jul 7 03:49:38 atom kernel: panic: spin lock held too long /var/crash looks empty. This is a system running official 7.2-p1 binaries since I am using freebsd-update to keep up with the patches (just updated to -p2 after this panic) running with very low load, most

Re: panic: spin lock held too long on 7.1-PRERELEASE (sio)

2008-11-30 Thread Kris Kennaway
Holger Kipp wrote: Hi, I currently encounter spin lock 0xc0c83ba8 (sio) held by 0xc6337460 (tid 15) too long panic: spin lock held too long cpuid=3 on hp server with two dual-core intel xeon (3.8GHz) and 3GB RAM. System is 7.1-PRELEASE last fetched 29.11.2008, 17:58 UTC. This happens

panic: spin lock held too long on 7.1-PRERELEASE (sio)

2008-11-30 Thread Holger Kipp
Hi, I currently encounter spin lock 0xc0c83ba8 (sio) held by 0xc6337460 (tid 15) too long panic: spin lock held too long cpuid=3 on hp server with two dual-core intel xeon (3.8GHz) and 3GB RAM. System is 7.1-PRELEASE last fetched 29.11.2008, 17:58 UTC. This happens with VSCom Card that was

[Fwd: Re: kern/118044: panic: spin lock held too long]

2007-11-14 Thread Per olof Ljungmark
Original Message Subject: Re: kern/118044: panic: spin lock held too long Date: Wed, 14 Nov 2007 15:20:01 GMT From: [EMAIL PROTECTED] Reply-To: [EMAIL PROTECTED], [EMAIL PROTECTED] To: Per olof Ljungmark <[EMAIL PROTECTED]> Thank you very much for your problem report.

Re: panic: spin lock held too long (w/ backtrace)

2007-05-11 Thread Kris Kennaway
On Fri, May 11, 2007 at 10:43:54AM -0600, Scott Swanson wrote: > >> [EMAIL PROTECTED] /usr/src/sys/i386/conf > kldstat > >> Id Refs AddressSize Name > >> 13 0xc040 65e308 kernel > >> 21 0xc0a5f000 59f20acpi.ko > > > > So, yes then :) Can you follow the steps for debuggi

Re: panic: spin lock held too long (w/ backtrace)

2007-05-11 Thread Scott Swanson
>> [EMAIL PROTECTED] /usr/src/sys/i386/conf > kldstat >> Id Refs AddressSize Name >> 13 0xc040 65e308 kernel >> 21 0xc0a5f000 59f20acpi.ko > > So, yes then :) Can you follow the steps for debugging modules and see > if it gives a better trace? > > Kris > Unfortunatel

Re: panic: spin lock held too long (w/ backtrace)

2007-05-10 Thread Kris Kennaway
On Thu, May 10, 2007 at 07:44:37PM -0600, Scott Swanson wrote: > >> (kgdb) proc 18303 > >> (kgdb) bt > >> #0 0xc0644f5b in sched_switch (td=0xc95a5900, newtd=0xc92aad80, > >> flags=0) at /usr/src/sys/kern/sched_4bsd.c:973 > >> #1 0xeaa6dcb4 in ?? () > >> #2 0x0001 in ?? () > >> #3 0x0ee2c00

Re: panic: spin lock held too long (w/ backtrace)

2007-05-10 Thread Scott Swanson
>> (kgdb) proc 18303 >> (kgdb) bt >> #0 0xc0644f5b in sched_switch (td=0xc95a5900, newtd=0xc92aad80, >> flags=0) at /usr/src/sys/kern/sched_4bsd.c:973 >> #1 0xeaa6dcb4 in ?? () >> #2 0x0001 in ?? () >> #3 0x0ee2c000 in ?? () >> #4 0x in ?? () >> #5 0x4000 in ?? () >> #6 0x000

Re: panic: spin lock held too long (w/ backtrace)

2007-05-10 Thread Kris Kennaway
On Thu, May 10, 2007 at 07:13:51PM -0600, Scott Swanson wrote: > Kris Kennaway wrote: > > On Thu, May 10, 2007 at 07:00:00PM -0600, Scott Swanson wrote: > > > Unread portion of the kernel message buffer: > spin lock smp rendezvous held by 0xc95a5900 for > 5 seconds > >>> What is thread 0

Re: panic: spin lock held too long (w/ backtrace)

2007-05-10 Thread Scott Swanson
Kris Kennaway wrote: > On Thu, May 10, 2007 at 07:00:00PM -0600, Scott Swanson wrote: > Unread portion of the kernel message buffer: spin lock smp rendezvous held by 0xc95a5900 for > 5 seconds >>> What is thread 0xc95a5900 doing? >>> >>> Kris >>> >> Is this the best way to determine the

Re: panic: spin lock held too long (w/ backtrace)

2007-05-10 Thread Kris Kennaway
On Thu, May 10, 2007 at 07:00:00PM -0600, Scott Swanson wrote: > >> Unread portion of the kernel message buffer: > >> spin lock smp rendezvous held by 0xc95a5900 for > 5 seconds > > > > What is thread 0xc95a5900 doing? > > > > Kris > > > > Is this the best way to determine the action of the th

Re: panic: spin lock held too long (w/ backtrace)

2007-05-10 Thread Scott Swanson
Kris Kennaway wrote: > On Thu, May 10, 2007 at 02:49:45PM -0600, Scott Swanson wrote: >> Hello all, >> >> I have a couple dozen SuperMicro servers in production and have had >> reoccurring crashing issues on a couple of them that are under higher >> load. >> >> After managing to pry one out of prod

Re: panic: spin lock held too long (w/ backtrace)

2007-05-10 Thread Scott Swanson
Kris Kennaway wrote: > On Thu, May 10, 2007 at 02:49:45PM -0600, Scott Swanson wrote: >> Hello all, >> >> I have a couple dozen SuperMicro servers in production and have had >> reoccurring crashing issues on a couple of them that are under higher >> load. >> >> After managing to pry one out of prod

Re: panic: spin lock held too long (w/ backtrace)

2007-05-10 Thread Kris Kennaway
tion of the kernel message buffer: > spin lock smp rendezvous held by 0xc95a5900 for > 5 seconds What is thread 0xc95a5900 doing? Kris > panic: spin lock held too long > cpuid = 0 > Uptime: 1d3h51m44s > Dumping 3967 MB (3 chunks) > chunk 0: 1MB (158 pages) ... ok > chunk 1:

panic: spin lock held too long (w/ backtrace)

2007-05-10 Thread Scott Swanson
symbol "ps_pglobal_lookup"] GNU gdb 6.1.1 [FreeBSD] Unread portion of the kernel message buffer: spin lock smp rendezvous held by 0xc95a5900 for > 5 seconds panic: spin lock held too long cpuid = 0 Uptime: 1d3h51m44s Dumping 3967 MB (3 chunks) chunk 0: 1MB (158 pages) ... ok chunk

Re: panic: spin lock held too long

2007-04-17 Thread Maxim Konovalov
appreciated. > > Unread portion of the kernel message buffer: > spin lock smp rendezvous held by 0xca78ed80 for > 5 seconds > panic: spin lock held too long > cpuid = 2 > Uptime: 14h25m47s > Dumping 3967 MB (3 chunks) > chunk 0: 1MB (159 pages) ... ok > chunk 1: 39

panic: spin lock held too long

2007-04-10 Thread Maxim Konovalov
Hi, A week old -stable panic trace inlined below. The kernel config file is SMP + KDB/DDB options. Any help to debug futher is appreciated. Unread portion of the kernel message buffer: spin lock smp rendezvous held by 0xca78ed80 for > 5 seconds panic: spin lock held too long cpuid = 2 Upt

4-way Xeon and "panic: spin lock held too long."

2005-09-21 Thread Krzysztof Kowalik
Hello. Today one of my boxes paniced with "panic: spin lock held too long." message -- it was online for 6 hours, under rather light load. Unfortunately, I wasn't able to get any more information, as the box got restarted by our staff. Is this, anyway, a known problem (I

Panic: spin lock held too long

2004-11-07 Thread Peter Holm
I just got a panic with GENERIC 5.3-RELEASE: Script started on Mon Nov 8 08:42:44 2004 $ cd /usr/src/sys/i386/compile/GENERIC $ kgdb kernel /var/crash/vmcore.1 [GDB will not be able to debug user-mode threads: /usr/lib/libthread_db.so: Undefined symbol "ps_pglobal_lookup"] GNU gdb 6.1.1 [FreeBSD