Re: beta2 panic: VAPPEND without VWRITE

2011-10-05 Thread Harald Schmalzbauer
schrieb Rick Macklem am 02.10.2011 00:39 (localtime):
 Harald Schmalzbauer wrote:
 schrieb Attilio Rao am 01.10.2011 16:49 (localtime):
 Can you please show the panic message?

 Sorry, I forgot to add it here:

 free indoe /var/123088 had 8 blocks
 panic: VAPPEND withour VWRITE
 cpuid = 0
 KDB: enter: panic
 [ thread pid 1445 tid 100126 ]
 Stopped at kbd_enter+0x2b: movq $0,0x918a52(%rip)
 db bt
 Tracing pid 1445 tid 100126 td 0xfe000510d460
 kdb_enter() at kbd_enter+0x3b
 panic() at panic+0x180
 vn_isdisk() at vn_isdisk
 ufs_accessx() at ufs_accessx+0x188
 vop_stdaccess() at vop_stdaccess+0x43
 unionfs_access() at unionfs_access+0x1c4
 vn_open_cred() at vn_open_cred+0x547
 kern_opneat() at kern_openat+0x1f9
 syscallenter() at syscallenter+0x1aa
 syscall() at syscall+0x4c
 Xfast_syscall() at Xfast_syscall+0xdd
 --- syscall (5, FreeBSD ELF64, open), rip = 0x801799f2c, rsp =
 0x7fffb388, rbp = 0x8 ---

 I'ts reproducable with exact the same hex-numbers with 'scp' when scp
 tries to alter knwon_hosts, which is on unionfs.

 You could try the attached one line patch. Since VAPPEND is a modifier
 for VWRITE, it makes sense to clear it along with VWRITE, I think?
 
 rick

Thanks for your help. Unfortunately I can neither comment on the patch,
nor reproduce the panic... I tried the patch and all seems fine, but
also without the patch (the exactly same kernel some days ago, but
machine was rebooted since). Of course, the machine has been rebooted
after the panic too, when I was able to reproduce the panic easily.
Any idea why the pnaic was once reproducable and now isn't anymore? Of
course, _nothing_ was changed since then, besides bootme flag on one
GPT partition. 9-beta2 never booted since then... No hardware has
changed either !?! *kopfkratz*

Thanks,

-Harry

 Here's some LORs, I havenÄt checked if they're already known. I don't
 have the known-LORs-URL handy...

 lock order reversal:
 1st 0xfe000519c278 unionfs (unionfs) @
 /usr/src/sys/modules/unionfs/../../fs/unionfs/union_subr.c:356
 2nd 0xfe000519c458 ufs (ufs) @ /usr/src/sys/kern/vfs_subr.c:2246
 KDB: stack backtrace:
 db_trace_self_wrapper() at db_trace_self_wrapper+0x2a
 kdb_backtrace() at kdb_backtrace+0x37
 _witness_debugger() at _witness_debugger+0x2e
 witness_checkorder() at witness_checkorder+0x807
 __lockmgr_args() at __lockmgr_args+0xdc6
 ffs_lock() at ffs_lock+0x8c
 VOP_LOCK1_APV() at VOP_LOCK1_APV+0x9b
 _vn_lock() at _vn_lock+0x47
 vputx() at vputx+0x328
 unionfs_noderem() at unionfs_noderem+0x1c4
 unionfs_reclaim() at unionfs_reclaim+0x11
 vgonel() at vgonel+0x105
 vrecycle() at vrecycle+0x4c
 unionfs_inactive() at unionfs_inactive+0x20
 vinactive() at vinactive+0x72
 vputx() at vputx+0x386
 kern_statat_vnhook() at kern_statat_vnhook+0x11d
 kern_statat() at kern_statat+0x15
 stat() at stat+0x2a
 syscallenter() at syscallenter+0x1aa
 syscall() at syscall+0x4c
 Xfast_syscall() at Xfast_syscall+0xdd
 --- syscall (188, FreeBSD ELF64, stat), rip = 0x800dc7ecc, rsp =
 0x7fffd6a8, rbp = 0x801441190 ---
 lock order reversal:
 1st 0xff80e9bf59f8 bufwait (bufwait) @
 /usr/src/sys/kern/vfs_bio.c:2658
 2nd 0xfe00051a7a00 dirhash (dirhash) @
 /usr/src/sys/ufs/ufs/ufs_dirhash.c:284
 KDB: stack backtrace:
 db_trace_self_wrapper() at db_trace_self_wrapper+0x2a
 kdb_backtrace() at kdb_backtrace+0x37
 _witness_debugger() at _witness_debugger+0x2e
 witness_checkorder() at witness_checkorder+0x807
 _sx_xlock() at _sx_xlock+0x55
 ufsdirhash_acquire() at ufsdirhash_acquire+0x33
 ufsdirhash_add() at ufsdirhash_add+0x19
 ufs_direnter() at ufs_direnter+0x909
 ufs_mkdir() at ufs_mkdir+0x44d
 VOP_MKDIR_APV() at VOP_MKDIR_APV+0x93
 kern_mkdirat() at kern_mkdirat+0x290
 syscallenter() at syscallenter+0x1aa
 syscall() at syscall+0x4c
 Xfast_syscall() at Xfast_syscall+0xdd
 --- syscall (136, FreeBSD ELF64, mkdir), rip = 0x800933eec, rsp =
 0x7fffd768, rbp = 0x800c07050 ---
 lock order reversal:
 1st 0xfe000514f818 ufs (ufs) @ /usr/src/sys/kern/vfs_subr.c:2134
 2nd 0xff80e9bf59f8 bufwait (bufwait) @
 /usr/src/sys/ufs/ffs/ffs_vnops.c:260
 3rd 0xfe0005706278 ufs (ufs) @ /usr/src/sys/kern/vfs_subr.c:2134
 KDB: stack backtrace:
 db_trace_self_wrapper() at db_trace_self_wrapper+0x2a
 kdb_backtrace() at kdb_backtrace+0x37
 _witness_debugger() at _witness_debugger+0x2e
 witness_checkorder() at witness_checkorder+0x807
 __lockmgr_args() at __lockmgr_args+0xdc6
 ffs_lock() at ffs_lock+0x8c
 VOP_LOCK1_APV() at VOP_LOCK1_APV+0x9b
 _vn_lock() at _vn_lock+0x47
 vget() at vget+0x7b
 vfs_hash_get() at vfs_hash_get+0xd5
 ffs_vgetf() at ffs_vgetf+0x48
 softdep_sync_buf() at softdep_sync_buf+0x393
 ffs_syncvnode() at ffs_syncvnode+0x2b3
 ffs_truncate() at ffs_truncate+0x477
 ufs_direnter() at ufs_direnter+0x73b
 ufs_mkdir() at ufs_mkdir+0x44d
 VOP_MKDIR_APV() at VOP_MKDIR_APV+0x93
 kern_mkdirat() at kern_mkdirat+0x290
 syscallenter() at syscallenter+0x1aa
 syscall() at syscall+0x4c
 Xfast_syscall() at Xfast_syscall+0xdd
 --- syscall (136, FreeBSD 

Re: BETA2 panic

2011-10-01 Thread Joel Dahl
On 30-09-2011 17:52, Adrian Chadd wrote:
 Hi,
 
 Would you please try this patch? Make sure that witness and the rest
 of the lock debugging is enabled.
 
 I've been working on this problem with someone else on the list (but I
 can't trigger it myself; I think I need to get some dual-CPU wireless
 test hardware.)

My laptop has been running with your patch for almost 8 hours now. No panics.

-- 
Joel
___
freebsd-current@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to freebsd-current-unsubscr...@freebsd.org


beta2 panic: VAPPEND without VWRITE

2011-10-01 Thread Harald Schmalzbauer
Hello,

I got the following panic with 9.0-beta2:

cpuid = 0
KDB: enter: panic
[ thread pid 1445 tid 100126 ]
Stopped at  kbd_enter+0x2b: movq$0,0x918a52(%rip)
db bt
Tracing pid 1445 tid 100126 td 0xfe000510d460
kdb_enter() at kbd_enter+0x3b
panic() at panic+0x180
vn_isdisk() at vn_isdisk
ufs_accessx() at ufs_accessx+0x188
vop_stdaccess() at vop_stdaccess+0x43
unionfs_access() at unionfs_access+0x1c4
vn_open_cred() at vn_open_cred+0x547
kern_opneat() at kern_openat+0x1f9
syscallenter() at syscallenter+0x1aa
syscall() at syscall+0x4c
Xfast_syscall() at Xfast_syscall+0xdd
--- syscall (5, FreeBSD ELF64, open), rip = 0x801799f2c, rsp =
0x7fffb388, rbp = 0x8 ---

Please tell how to provide more information if needed.

Thanks,

-Harry



signature.asc
Description: OpenPGP digital signature


Re: beta2 panic: VAPPEND without VWRITE

2011-10-01 Thread Attilio Rao
Can you please show the panic message?

Attilio

2011/10/1 Harald Schmalzbauer h.schmalzba...@omnilan.de:
 Hello,

 I got the following panic with 9.0-beta2:

 cpuid = 0
 KDB: enter: panic
 [ thread pid 1445 tid 100126 ]
 Stopped at      kbd_enter+0x2b: movq    $0,0x918a52(%rip)
 db bt
 Tracing pid 1445 tid 100126 td 0xfe000510d460
 kdb_enter() at kbd_enter+0x3b
 panic() at panic+0x180
 vn_isdisk() at vn_isdisk
 ufs_accessx() at ufs_accessx+0x188
 vop_stdaccess() at vop_stdaccess+0x43
 unionfs_access() at unionfs_access+0x1c4
 vn_open_cred() at vn_open_cred+0x547
 kern_opneat() at kern_openat+0x1f9
 syscallenter() at syscallenter+0x1aa
 syscall() at syscall+0x4c
 Xfast_syscall() at Xfast_syscall+0xdd
 --- syscall (5, FreeBSD ELF64, open), rip = 0x801799f2c, rsp =
 0x7fffb388, rbp = 0x8 ---

 Please tell how to provide more information if needed.

 Thanks,

 -Harry





-- 
Peace can only be achieved by understanding - A. Einstein
___
freebsd-current@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to freebsd-current-unsubscr...@freebsd.org


Re: beta2 panic: VAPPEND without VWRITE

2011-10-01 Thread Harald Schmalzbauer
schrieb Attilio Rao am 01.10.2011 16:49 (localtime):
 Can you please show the panic message?

Sorry, I forgot to add it here:

free indoe /var/123088 had 8 blocks
panic: VAPPEND withour VWRITE
 cpuid = 0
 KDB: enter: panic
 [ thread pid 1445 tid 100126 ]
 Stopped at  kbd_enter+0x2b: movq$0,0x918a52(%rip)
 db bt
 Tracing pid 1445 tid 100126 td 0xfe000510d460
 kdb_enter() at kbd_enter+0x3b
 panic() at panic+0x180
 vn_isdisk() at vn_isdisk
 ufs_accessx() at ufs_accessx+0x188
 vop_stdaccess() at vop_stdaccess+0x43
 unionfs_access() at unionfs_access+0x1c4
 vn_open_cred() at vn_open_cred+0x547
 kern_opneat() at kern_openat+0x1f9
 syscallenter() at syscallenter+0x1aa
 syscall() at syscall+0x4c
 Xfast_syscall() at Xfast_syscall+0xdd
 --- syscall (5, FreeBSD ELF64, open), rip = 0x801799f2c, rsp =
 0x7fffb388, rbp = 0x8 ---

I'ts reproducable with exact the same hex-numbers with 'scp' when scp
tries to alter knwon_hosts, which is on unionfs.

Here's some LORs, I havenÄt checked if they're already known. I don't
have the known-LORs-URL handy...

lock order reversal:
 1st 0xfe000519c278 unionfs (unionfs) @
/usr/src/sys/modules/unionfs/../../fs/unionfs/union_subr.c:356
 2nd 0xfe000519c458 ufs (ufs) @ /usr/src/sys/kern/vfs_subr.c:2246
KDB: stack backtrace:
db_trace_self_wrapper() at db_trace_self_wrapper+0x2a
kdb_backtrace() at kdb_backtrace+0x37
_witness_debugger() at _witness_debugger+0x2e
witness_checkorder() at witness_checkorder+0x807
__lockmgr_args() at __lockmgr_args+0xdc6
ffs_lock() at ffs_lock+0x8c
VOP_LOCK1_APV() at VOP_LOCK1_APV+0x9b
_vn_lock() at _vn_lock+0x47
vputx() at vputx+0x328
unionfs_noderem() at unionfs_noderem+0x1c4
unionfs_reclaim() at unionfs_reclaim+0x11
vgonel() at vgonel+0x105
vrecycle() at vrecycle+0x4c
unionfs_inactive() at unionfs_inactive+0x20
vinactive() at vinactive+0x72
vputx() at vputx+0x386
kern_statat_vnhook() at kern_statat_vnhook+0x11d
kern_statat() at kern_statat+0x15
stat() at stat+0x2a
syscallenter() at syscallenter+0x1aa
syscall() at syscall+0x4c
Xfast_syscall() at Xfast_syscall+0xdd
--- syscall (188, FreeBSD ELF64, stat), rip = 0x800dc7ecc, rsp =
0x7fffd6a8, rbp = 0x801441190 ---
lock order reversal:
 1st 0xff80e9bf59f8 bufwait (bufwait) @ /usr/src/sys/kern/vfs_bio.c:2658
 2nd 0xfe00051a7a00 dirhash (dirhash) @
/usr/src/sys/ufs/ufs/ufs_dirhash.c:284
KDB: stack backtrace:
db_trace_self_wrapper() at db_trace_self_wrapper+0x2a
kdb_backtrace() at kdb_backtrace+0x37
_witness_debugger() at _witness_debugger+0x2e
witness_checkorder() at witness_checkorder+0x807
_sx_xlock() at _sx_xlock+0x55
ufsdirhash_acquire() at ufsdirhash_acquire+0x33
ufsdirhash_add() at ufsdirhash_add+0x19
ufs_direnter() at ufs_direnter+0x909
ufs_mkdir() at ufs_mkdir+0x44d
VOP_MKDIR_APV() at VOP_MKDIR_APV+0x93
kern_mkdirat() at kern_mkdirat+0x290
syscallenter() at syscallenter+0x1aa
syscall() at syscall+0x4c
Xfast_syscall() at Xfast_syscall+0xdd
--- syscall (136, FreeBSD ELF64, mkdir), rip = 0x800933eec, rsp =
0x7fffd768, rbp = 0x800c07050 ---
lock order reversal:
 1st 0xfe000514f818 ufs (ufs) @ /usr/src/sys/kern/vfs_subr.c:2134
 2nd 0xff80e9bf59f8 bufwait (bufwait) @
/usr/src/sys/ufs/ffs/ffs_vnops.c:260
 3rd 0xfe0005706278 ufs (ufs) @ /usr/src/sys/kern/vfs_subr.c:2134
KDB: stack backtrace:
db_trace_self_wrapper() at db_trace_self_wrapper+0x2a
kdb_backtrace() at kdb_backtrace+0x37
_witness_debugger() at _witness_debugger+0x2e
witness_checkorder() at witness_checkorder+0x807
__lockmgr_args() at __lockmgr_args+0xdc6
ffs_lock() at ffs_lock+0x8c
VOP_LOCK1_APV() at VOP_LOCK1_APV+0x9b
_vn_lock() at _vn_lock+0x47
vget() at vget+0x7b
vfs_hash_get() at vfs_hash_get+0xd5
ffs_vgetf() at ffs_vgetf+0x48
softdep_sync_buf() at softdep_sync_buf+0x393
ffs_syncvnode() at ffs_syncvnode+0x2b3
ffs_truncate() at ffs_truncate+0x477
ufs_direnter() at ufs_direnter+0x73b
ufs_mkdir() at ufs_mkdir+0x44d
VOP_MKDIR_APV() at VOP_MKDIR_APV+0x93
kern_mkdirat() at kern_mkdirat+0x290
syscallenter() at syscallenter+0x1aa
syscall() at syscall+0x4c
Xfast_syscall() at Xfast_syscall+0xdd
--- syscall (136, FreeBSD ELF64, mkdir), rip = 0x800933eec, rsp =
0x7fffdbb8, rbp = 0x7fffdee6 ---

Thanks,

-Harry

 cpuid = 0
 KDB: enter: panic
 [ thread pid 1445 tid 100126 ]
 Stopped at  kbd_enter+0x2b: movq$0,0x918a52(%rip)
 db bt
 Tracing pid 1445 tid 100126 td 0xfe000510d460
 kdb_enter() at kbd_enter+0x3b
 panic() at panic+0x180
 vn_isdisk() at vn_isdisk
 ufs_accessx() at ufs_accessx+0x188
 vop_stdaccess() at vop_stdaccess+0x43
 unionfs_access() at unionfs_access+0x1c4
 vn_open_cred() at vn_open_cred+0x547
 kern_opneat() at kern_openat+0x1f9
 syscallenter() at syscallenter+0x1aa
 syscall() at syscall+0x4c
 Xfast_syscall() at Xfast_syscall+0xdd
 --- syscall (5, FreeBSD ELF64, open), rip = 0x801799f2c, rsp =
 0x7fffb388, rbp = 0x8 ---



signature.asc
Description: OpenPGP digital signature


Re: BETA2 panic

2011-10-01 Thread Adrian Chadd
On 1 October 2011 20:46, Joel Dahl j...@vnode.se wrote:

 My laptop has been running with your patch for almost 8 hours now. No panics.


Do you have all of the -current debugging enabled (witness, invariants, etc) ?


Adrian
___
freebsd-current@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to freebsd-current-unsubscr...@freebsd.org


Re: beta2 panic: VAPPEND without VWRITE

2011-10-01 Thread Rick Macklem
Harald Schmalzbauer wrote:
 schrieb Attilio Rao am 01.10.2011 16:49 (localtime):
  Can you please show the panic message?
 
 Sorry, I forgot to add it here:
 
 free indoe /var/123088 had 8 blocks
 panic: VAPPEND withour VWRITE
  cpuid = 0
  KDB: enter: panic
  [ thread pid 1445 tid 100126 ]
  Stopped at kbd_enter+0x2b: movq $0,0x918a52(%rip)
  db bt
  Tracing pid 1445 tid 100126 td 0xfe000510d460
  kdb_enter() at kbd_enter+0x3b
  panic() at panic+0x180
  vn_isdisk() at vn_isdisk
  ufs_accessx() at ufs_accessx+0x188
  vop_stdaccess() at vop_stdaccess+0x43
  unionfs_access() at unionfs_access+0x1c4
  vn_open_cred() at vn_open_cred+0x547
  kern_opneat() at kern_openat+0x1f9
  syscallenter() at syscallenter+0x1aa
  syscall() at syscall+0x4c
  Xfast_syscall() at Xfast_syscall+0xdd
  --- syscall (5, FreeBSD ELF64, open), rip = 0x801799f2c, rsp =
  0x7fffb388, rbp = 0x8 ---
 
 I'ts reproducable with exact the same hex-numbers with 'scp' when scp
 tries to alter knwon_hosts, which is on unionfs.
 
You could try the attached one line patch. Since VAPPEND is a modifier
for VWRITE, it makes sense to clear it along with VWRITE, I think?

rick

 Here's some LORs, I havenÄt checked if they're already known. I don't
 have the known-LORs-URL handy...
 
 lock order reversal:
 1st 0xfe000519c278 unionfs (unionfs) @
 /usr/src/sys/modules/unionfs/../../fs/unionfs/union_subr.c:356
 2nd 0xfe000519c458 ufs (ufs) @ /usr/src/sys/kern/vfs_subr.c:2246
 KDB: stack backtrace:
 db_trace_self_wrapper() at db_trace_self_wrapper+0x2a
 kdb_backtrace() at kdb_backtrace+0x37
 _witness_debugger() at _witness_debugger+0x2e
 witness_checkorder() at witness_checkorder+0x807
 __lockmgr_args() at __lockmgr_args+0xdc6
 ffs_lock() at ffs_lock+0x8c
 VOP_LOCK1_APV() at VOP_LOCK1_APV+0x9b
 _vn_lock() at _vn_lock+0x47
 vputx() at vputx+0x328
 unionfs_noderem() at unionfs_noderem+0x1c4
 unionfs_reclaim() at unionfs_reclaim+0x11
 vgonel() at vgonel+0x105
 vrecycle() at vrecycle+0x4c
 unionfs_inactive() at unionfs_inactive+0x20
 vinactive() at vinactive+0x72
 vputx() at vputx+0x386
 kern_statat_vnhook() at kern_statat_vnhook+0x11d
 kern_statat() at kern_statat+0x15
 stat() at stat+0x2a
 syscallenter() at syscallenter+0x1aa
 syscall() at syscall+0x4c
 Xfast_syscall() at Xfast_syscall+0xdd
 --- syscall (188, FreeBSD ELF64, stat), rip = 0x800dc7ecc, rsp =
 0x7fffd6a8, rbp = 0x801441190 ---
 lock order reversal:
 1st 0xff80e9bf59f8 bufwait (bufwait) @
 /usr/src/sys/kern/vfs_bio.c:2658
 2nd 0xfe00051a7a00 dirhash (dirhash) @
 /usr/src/sys/ufs/ufs/ufs_dirhash.c:284
 KDB: stack backtrace:
 db_trace_self_wrapper() at db_trace_self_wrapper+0x2a
 kdb_backtrace() at kdb_backtrace+0x37
 _witness_debugger() at _witness_debugger+0x2e
 witness_checkorder() at witness_checkorder+0x807
 _sx_xlock() at _sx_xlock+0x55
 ufsdirhash_acquire() at ufsdirhash_acquire+0x33
 ufsdirhash_add() at ufsdirhash_add+0x19
 ufs_direnter() at ufs_direnter+0x909
 ufs_mkdir() at ufs_mkdir+0x44d
 VOP_MKDIR_APV() at VOP_MKDIR_APV+0x93
 kern_mkdirat() at kern_mkdirat+0x290
 syscallenter() at syscallenter+0x1aa
 syscall() at syscall+0x4c
 Xfast_syscall() at Xfast_syscall+0xdd
 --- syscall (136, FreeBSD ELF64, mkdir), rip = 0x800933eec, rsp =
 0x7fffd768, rbp = 0x800c07050 ---
 lock order reversal:
 1st 0xfe000514f818 ufs (ufs) @ /usr/src/sys/kern/vfs_subr.c:2134
 2nd 0xff80e9bf59f8 bufwait (bufwait) @
 /usr/src/sys/ufs/ffs/ffs_vnops.c:260
 3rd 0xfe0005706278 ufs (ufs) @ /usr/src/sys/kern/vfs_subr.c:2134
 KDB: stack backtrace:
 db_trace_self_wrapper() at db_trace_self_wrapper+0x2a
 kdb_backtrace() at kdb_backtrace+0x37
 _witness_debugger() at _witness_debugger+0x2e
 witness_checkorder() at witness_checkorder+0x807
 __lockmgr_args() at __lockmgr_args+0xdc6
 ffs_lock() at ffs_lock+0x8c
 VOP_LOCK1_APV() at VOP_LOCK1_APV+0x9b
 _vn_lock() at _vn_lock+0x47
 vget() at vget+0x7b
 vfs_hash_get() at vfs_hash_get+0xd5
 ffs_vgetf() at ffs_vgetf+0x48
 softdep_sync_buf() at softdep_sync_buf+0x393
 ffs_syncvnode() at ffs_syncvnode+0x2b3
 ffs_truncate() at ffs_truncate+0x477
 ufs_direnter() at ufs_direnter+0x73b
 ufs_mkdir() at ufs_mkdir+0x44d
 VOP_MKDIR_APV() at VOP_MKDIR_APV+0x93
 kern_mkdirat() at kern_mkdirat+0x290
 syscallenter() at syscallenter+0x1aa
 syscall() at syscall+0x4c
 Xfast_syscall() at Xfast_syscall+0xdd
 --- syscall (136, FreeBSD ELF64, mkdir), rip = 0x800933eec, rsp =
 0x7fffdbb8, rbp = 0x7fffdee6 ---
 
 Thanks,
 
 -Harry
 
  cpuid = 0
  KDB: enter: panic
  [ thread pid 1445 tid 100126 ]
  Stopped at kbd_enter+0x2b: movq $0,0x918a52(%rip)
  db bt
  Tracing pid 1445 tid 100126 td 0xfe000510d460
  kdb_enter() at kbd_enter+0x3b
  panic() at panic+0x180
  vn_isdisk() at vn_isdisk
  ufs_accessx() at ufs_accessx+0x188
  vop_stdaccess() at vop_stdaccess+0x43
  unionfs_access() at unionfs_access+0x1c4
  vn_open_cred() at vn_open_cred+0x547
  kern_opneat() at kern_openat+0x1f9
  syscallenter() at syscallenter+0x1aa
  syscall() at syscall+0x4c
  

Re: BETA2 panic

2011-10-01 Thread Joel Dahl
On 02-10-2011  0:54, Adrian Chadd wrote:
 On 1 October 2011 20:46, Joel Dahl j...@vnode.se wrote:
 
  My laptop has been running with your patch for almost 8 hours now. No 
  panics.
 
 
 Do you have all of the -current debugging enabled (witness, invariants, etc) ?

I'm running GENERIC, so yes.

-- 
Joel
___
freebsd-current@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to freebsd-current-unsubscr...@freebsd.org


Re: BETA2 panic

2011-09-30 Thread Adrian Chadd
Hi,

Would you please try this patch? Make sure that witness and the rest
of the lock debugging is enabled.

I've been working on this problem with someone else on the list (but I
can't trigger it myself; I think I need to get some dual-CPU wireless
test hardware.)

Thanks,


Adrian


net80211-swbmiss-2.diff
Description: Binary data
___
freebsd-current@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to freebsd-current-unsubscr...@freebsd.org

Re: BETA2 panic

2011-09-07 Thread Bernhard Schmidt
On Tue, Sep 6, 2011 at 22:42, Joel Dahl j...@vnode.se wrote:
 On 05-09-2011  9:02, Bernhard Schmidt wrote:
 On Mon, Sep 5, 2011 at 08:24, Joel Dahl j...@vnode.se wrote:
  On 05-09-2011  5:09, Adrian Chadd wrote:
  On 5 September 2011 01:04, Joel Dahl j...@vnode.se wrote:
   Hi,
  
   I upgraded my laptop from BETA1 to rev. 225367 today, and now my laptop 
   panics just a few minutes after booting. This is 100% reproducible, it 
   panics every time.
  
   I've got a pic with the backtrace here: 
   http://www.vnode.se/sc/panic_20110904.jpg
 
  Hi,
 
  There weren't many commits between BETA1 and -HEAD in the wireless
  area; would you please do a binary search of the kernel revisions (no
  need to do full buildworlds) to find which commit broke it?
 
  Yes, I'll do that tonight.

 While doing so, can you enable at least some debugging?
 wlandebug +state
 or even better
 wlandebug 0x

 It smells like that something is poking the SW bmiss handler while not
 in RUN state and therefore you're hitting a KASSERT(). Question is if
 the bmiss timer isn't stopped/drained somewhere or if there is too
 excessive call.

 Exactly what in the wlandebug output are you looking for?  I've posted a pic
 at http://www.vnode.se/sc/panic_20110906.jpg showing what it looks like right
 before the panic.  This is with wlandebug 0x.

Thanks, so, looks like it is scanning while the panic happens. At that
point it should actually never ever care about beacon misses.. this is
strange, really.

Can you try to comment out the call to ieee80211_beacon_miss() on line
2936 in if_iwn.c? If the panic no longer happens the issue is
somewhere in the conditions before that call.

 I've also installed kernels all the way back to revision 222980, but they all
 panic, which I think is somewhat odd.

I think this is an issue related to iwn(4), which had its last commit
prior to that.

--
Bernhard
___
freebsd-current@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to freebsd-current-unsubscr...@freebsd.org


Re: BETA2 panic

2011-09-07 Thread Joel Dahl
On 07-09-2011 10:59, Bernhard Schmidt wrote:
 On Tue, Sep 6, 2011 at 22:42, Joel Dahl j...@vnode.se wrote:
  On 05-09-2011  9:02, Bernhard Schmidt wrote:
  On Mon, Sep 5, 2011 at 08:24, Joel Dahl j...@vnode.se wrote:
   On 05-09-2011  5:09, Adrian Chadd wrote:
   On 5 September 2011 01:04, Joel Dahl j...@vnode.se wrote:
Hi,
   
I upgraded my laptop from BETA1 to rev. 225367 today, and now my 
laptop panics just a few minutes after booting. This is 100% 
reproducible, it panics every time.
   
I've got a pic with the backtrace here: 
http://www.vnode.se/sc/panic_20110904.jpg
  
   Hi,
  
   There weren't many commits between BETA1 and -HEAD in the wireless
   area; would you please do a binary search of the kernel revisions (no
   need to do full buildworlds) to find which commit broke it?
  
   Yes, I'll do that tonight.
 
  While doing so, can you enable at least some debugging?
  wlandebug +state
  or even better
  wlandebug 0x
 
  It smells like that something is poking the SW bmiss handler while not
  in RUN state and therefore you're hitting a KASSERT(). Question is if
  the bmiss timer isn't stopped/drained somewhere or if there is too
  excessive call.
 
  Exactly what in the wlandebug output are you looking for?  I've posted a pic
  at http://www.vnode.se/sc/panic_20110906.jpg showing what it looks like 
  right
  before the panic.  This is with wlandebug 0x.
 
 Thanks, so, looks like it is scanning while the panic happens. At that
 point it should actually never ever care about beacon misses.. this is
 strange, really.
 
 Can you try to comment out the call to ieee80211_beacon_miss() on line
 2936 in if_iwn.c? If the panic no longer happens the issue is
 somewhere in the conditions before that call.

Hm, this is with bwn(4), not iwn(4). :)

-- 
Joel
___
freebsd-current@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to freebsd-current-unsubscr...@freebsd.org


Re: BETA2 panic

2011-09-07 Thread Bernhard Schmidt
On Wed, Sep 7, 2011 at 12:53, Joel Dahl j...@vnode.se wrote:
 On 07-09-2011 10:59, Bernhard Schmidt wrote:
 On Tue, Sep 6, 2011 at 22:42, Joel Dahl j...@vnode.se wrote:
  On 05-09-2011  9:02, Bernhard Schmidt wrote:
  On Mon, Sep 5, 2011 at 08:24, Joel Dahl j...@vnode.se wrote:
   On 05-09-2011  5:09, Adrian Chadd wrote:
   On 5 September 2011 01:04, Joel Dahl j...@vnode.se wrote:
Hi,
   
I upgraded my laptop from BETA1 to rev. 225367 today, and now my 
laptop panics just a few minutes after booting. This is 100% 
reproducible, it panics every time.
   
I've got a pic with the backtrace here: 
http://www.vnode.se/sc/panic_20110904.jpg
  
   Hi,
  
   There weren't many commits between BETA1 and -HEAD in the wireless
   area; would you please do a binary search of the kernel revisions (no
   need to do full buildworlds) to find which commit broke it?
  
   Yes, I'll do that tonight.
 
  While doing so, can you enable at least some debugging?
  wlandebug +state
  or even better
  wlandebug 0x
 
  It smells like that something is poking the SW bmiss handler while not
  in RUN state and therefore you're hitting a KASSERT(). Question is if
  the bmiss timer isn't stopped/drained somewhere or if there is too
  excessive call.
 
  Exactly what in the wlandebug output are you looking for?  I've posted a 
  pic
  at http://www.vnode.se/sc/panic_20110906.jpg showing what it looks like 
  right
  before the panic.  This is with wlandebug 0x.

 Thanks, so, looks like it is scanning while the panic happens. At that
 point it should actually never ever care about beacon misses.. this is
 strange, really.

 Can you try to comment out the call to ieee80211_beacon_miss() on line
 2936 in if_iwn.c? If the panic no longer happens the issue is
 somewhere in the conditions before that call.

 Hm, this is with bwn(4), not iwn(4). :)

Uhm.. right, I'm sorry, missed that

Still.. two options, remove IEEE80211_FEXT_SWBMISS or try with
ifconfig wlan0 -bgscan

--
Bernhard
___
freebsd-current@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to freebsd-current-unsubscr...@freebsd.org


Re: BETA2 panic

2011-09-06 Thread Joel Dahl
On 05-09-2011  9:02, Bernhard Schmidt wrote:
 On Mon, Sep 5, 2011 at 08:24, Joel Dahl j...@vnode.se wrote:
  On 05-09-2011  5:09, Adrian Chadd wrote:
  On 5 September 2011 01:04, Joel Dahl j...@vnode.se wrote:
   Hi,
  
   I upgraded my laptop from BETA1 to rev. 225367 today, and now my laptop 
   panics just a few minutes after booting. This is 100% reproducible, it 
   panics every time.
  
   I've got a pic with the backtrace here: 
   http://www.vnode.se/sc/panic_20110904.jpg
 
  Hi,
 
  There weren't many commits between BETA1 and -HEAD in the wireless
  area; would you please do a binary search of the kernel revisions (no
  need to do full buildworlds) to find which commit broke it?
 
  Yes, I'll do that tonight.
 
 While doing so, can you enable at least some debugging?
 wlandebug +state
 or even better
 wlandebug 0x
 
 It smells like that something is poking the SW bmiss handler while not
 in RUN state and therefore you're hitting a KASSERT(). Question is if
 the bmiss timer isn't stopped/drained somewhere or if there is too
 excessive call.

Exactly what in the wlandebug output are you looking for?  I've posted a pic
at http://www.vnode.se/sc/panic_20110906.jpg showing what it looks like right
before the panic.  This is with wlandebug 0x.

I've also installed kernels all the way back to revision 222980, but they all
panic, which I think is somewhat odd.

-- 
Joel
___
freebsd-current@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to freebsd-current-unsubscr...@freebsd.org


Re: BETA2 panic

2011-09-05 Thread Joel Dahl
On 04-09-2011 14:45, Garrett Cooper wrote:
 On Sun, Sep 4, 2011 at 1:22 PM, Joel Dahl j...@vnode.se wrote:
  On 04-09-2011 10:56, Garrett Cooper wrote:
  On Sun, Sep 4, 2011 at 10:04 AM, Joel Dahl j...@vnode.se wrote:
   Hi,
  
   I upgraded my laptop from BETA1 to rev. 225367 today, and now my
   laptop panics just a few minutes after booting. This is 100%
   reproducible, it panics every time.
  
   I've got a pic with the backtrace here: 
   http://www.vnode.se/sc/panic_20110904.jpg
 
      Is your wireless turned off? If so, it looks like the state
  transition code isn't being handled properly between bwn(4) and
  net80211(4), in particular because your wireless card was still in
  'scan' mode.
 
  No, it's always on.  I can ping over the wireless for a few minutes, until
  it panics.
 
Hmm.. and the wireless card always appears on? Assuming the
 firmware isn't buggy (did you update it recently? have you booted
 other OSes, i.e. Windows or Linux that could be updating the firmware
 automatically on load?), it should keep the light for the wireless NIC
 properly lit.

Sure, it always appears on. This laptop has been running freebsd exclusively
for at least a year.  FWIW, the same RF switch is changed on/off message
from bwn has always been there on this machine.  I also noticed them on my
old laptop, an HP 6715b.

-- 
Joel
___
freebsd-current@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to freebsd-current-unsubscr...@freebsd.org


Re: BETA2 panic

2011-09-05 Thread Joel Dahl
On 05-09-2011  5:09, Adrian Chadd wrote:
 On 5 September 2011 01:04, Joel Dahl j...@vnode.se wrote:
  Hi,
 
  I upgraded my laptop from BETA1 to rev. 225367 today, and now my laptop 
  panics just a few minutes after booting. This is 100% reproducible, it 
  panics every time.
 
  I've got a pic with the backtrace here: 
  http://www.vnode.se/sc/panic_20110904.jpg
 
 Hi,
 
 There weren't many commits between BETA1 and -HEAD in the wireless
 area; would you please do a binary search of the kernel revisions (no
 need to do full buildworlds) to find which commit broke it?

Yes, I'll do that tonight.

-- 
Joel
___
freebsd-current@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to freebsd-current-unsubscr...@freebsd.org


Re: BETA2 panic

2011-09-05 Thread Adrian Chadd
On 5 September 2011 14:24, Joel Dahl j...@vnode.se wrote:

 There weren't many commits between BETA1 and -HEAD in the wireless
 area; would you please do a binary search of the kernel revisions (no
 need to do full buildworlds) to find which commit broke it?

 Yes, I'll do that tonight.

Thanks, that'd be great!


adrian
___
freebsd-current@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to freebsd-current-unsubscr...@freebsd.org


Re: BETA2 panic

2011-09-05 Thread Bernhard Schmidt
On Mon, Sep 5, 2011 at 08:24, Joel Dahl j...@vnode.se wrote:
 On 05-09-2011  5:09, Adrian Chadd wrote:
 On 5 September 2011 01:04, Joel Dahl j...@vnode.se wrote:
  Hi,
 
  I upgraded my laptop from BETA1 to rev. 225367 today, and now my laptop 
  panics just a few minutes after booting. This is 100% reproducible, it 
  panics every time.
 
  I've got a pic with the backtrace here: 
  http://www.vnode.se/sc/panic_20110904.jpg

 Hi,

 There weren't many commits between BETA1 and -HEAD in the wireless
 area; would you please do a binary search of the kernel revisions (no
 need to do full buildworlds) to find which commit broke it?

 Yes, I'll do that tonight.

While doing so, can you enable at least some debugging?
wlandebug +state
or even better
wlandebug 0x

It smells like that something is poking the SW bmiss handler while not
in RUN state and therefore you're hitting a KASSERT(). Question is if
the bmiss timer isn't stopped/drained somewhere or if there is too
excessive call.

--
Bernhard
___
freebsd-current@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to freebsd-current-unsubscr...@freebsd.org


BETA2 panic

2011-09-04 Thread Joel Dahl
Hi,

I upgraded my laptop from BETA1 to rev. 225367 today, and now my laptop panics 
just a few minutes after booting. This is 100% reproducible, it panics every 
time.

I've got a pic with the backtrace here: 
http://www.vnode.se/sc/panic_20110904.jpg

--
Joel

 ___
freebsd-current@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to freebsd-current-unsubscr...@freebsd.org


Re: BETA2 panic

2011-09-04 Thread Garrett Cooper
On Sun, Sep 4, 2011 at 10:04 AM, Joel Dahl j...@vnode.se wrote:
 Hi,

 I upgraded my laptop from BETA1 to rev. 225367 today, and now my laptop 
 panics just a few minutes after booting. This is 100% reproducible, it panics 
 every time.

 I've got a pic with the backtrace here: 
 http://www.vnode.se/sc/panic_20110904.jpg

Is your wireless turned off? If so, it looks like the state
transition code isn't being handled properly between bwn(4) and
net80211(4), in particular because your wireless card was still in
'scan' mode.
Thanks,
-Garrett
___
freebsd-current@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to freebsd-current-unsubscr...@freebsd.org


Re: BETA2 panic

2011-09-04 Thread Joel Dahl
On 04-09-2011 10:56, Garrett Cooper wrote:
 On Sun, Sep 4, 2011 at 10:04 AM, Joel Dahl j...@vnode.se wrote:
  Hi,
 
  I upgraded my laptop from BETA1 to rev. 225367 today, and now my laptop 
  panics just a few minutes after booting. This is 100% reproducible, it 
  panics every time.
 
  I've got a pic with the backtrace here: 
  http://www.vnode.se/sc/panic_20110904.jpg
 
 Is your wireless turned off? If so, it looks like the state
 transition code isn't being handled properly between bwn(4) and
 net80211(4), in particular because your wireless card was still in
 'scan' mode.

No, it's always on.  I can ping over the wireless for a few minutes, until
it panics.

Any particular commit you think I should try to back out?

-- 
Joel
___
freebsd-current@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to freebsd-current-unsubscr...@freebsd.org


Re: BETA2 panic

2011-09-04 Thread Adrian Chadd
On 5 September 2011 01:04, Joel Dahl j...@vnode.se wrote:
 Hi,

 I upgraded my laptop from BETA1 to rev. 225367 today, and now my laptop 
 panics just a few minutes after booting. This is 100% reproducible, it panics 
 every time.

 I've got a pic with the backtrace here: 
 http://www.vnode.se/sc/panic_20110904.jpg

Hi,

There weren't many commits between BETA1 and -HEAD in the wireless
area; would you please do a binary search of the kernel revisions (no
need to do full buildworlds) to find which commit broke it?

Thanks,



Adrian
___
freebsd-current@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to freebsd-current-unsubscr...@freebsd.org


Re: BETA2 panic

2011-09-04 Thread Garrett Cooper
On Sun, Sep 4, 2011 at 1:22 PM, Joel Dahl j...@vnode.se wrote:
 On 04-09-2011 10:56, Garrett Cooper wrote:
 On Sun, Sep 4, 2011 at 10:04 AM, Joel Dahl j...@vnode.se wrote:
  Hi,
 
  I upgraded my laptop from BETA1 to rev. 225367 today, and now my laptop 
  panics just a few minutes after booting. This is 100% reproducible, it 
  panics every time.
 
  I've got a pic with the backtrace here: 
  http://www.vnode.se/sc/panic_20110904.jpg

     Is your wireless turned off? If so, it looks like the state
 transition code isn't being handled properly between bwn(4) and
 net80211(4), in particular because your wireless card was still in
 'scan' mode.

 No, it's always on.  I can ping over the wireless for a few minutes, until
 it panics.

   Hmm.. and the wireless card always appears on? Assuming the
firmware isn't buggy (did you update it recently? have you booted
other OSes, i.e. Windows or Linux that could be updating the firmware
automatically on load?), it should keep the light for the wireless NIC
properly lit.

 Any particular commit you think I should try to back out?

Not in particular (in fact if_bwn has been untouched after
9.0-BETA1 was cut), but as Adrian suggested you should bisect the
source of failure.. it would be a wise idea to load if_bwn as a module
after boot to see if the behavior persists; next I would try out a
revision sometime between r18 (9.0 Beta1) and your head revision.
Some commits of interest that you might want to try backing out and
testing are:

- 224717
- 225139

HTH,
-Garrett
___
freebsd-current@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to freebsd-current-unsubscr...@freebsd.org