Re: [zfs-discuss] Re: zfs panic installing a brandz zone

2006-09-15 Thread Mark Maybee

Yup, its almost certain that this is the bug you are hitting.

-Mark

Alan Hargreaves wrote:
I know, bad form replying to myself, but I am wondering if it might be 
related to


 6438702 error handling in zfs_getpage() can trigger page not 
locked


Which is marked fix in progress with a target of the current build.

alan.

Alan Hargreaves wrote:

Folks, before I start delving too deeply into this crashdump, has 
anyone seen anything like it?


The background is that I'm running a non-debug open build of b49 and 
was in the process of running the zoneadm -z redlx install 


After a bit, the machine panics, initially looking at the crashdump, 
I'm down to 88mb free (out of a gig) and see the following stack.


fe8000de7800 page_unlock+0x3b(180218720)
fe8000de78d0 zfs_getpage+0x236(89b84d80, 12000, 2000, 
fe8000de7a1c, fe8000de79b8, 2000, fbc29b20, 
fe808180a000, 1,

80826dc8)
fe8000de7950 fop_getpage+0x52(89b84d80, 12000, 2000, 
fe8000de7a1c, fe8000de79b8, 2000, fbc29b20, 
fe8081818000, 1,

80826dc8)
fe8000de7a50 segmap_fault+0x1d6(801a6f38, 
fbc29b20, fe8081818000, 2000, 0, 1)
fe8000de7b30 segmap_getmapflt+0x67a(fbc29b20, 
89b84d80, 12000, 2000, 1, 1)

fe8000de7bd0 lofi_strategy_task+0x14b(959d2400)
fe8000de7c60 taskq_thread+0x1a7(84453da8)
fe8000de7c70 thread_start+8()

%rax = 0x %r9  = 0x0300430e
%rbx = 0x000e %r10 = 0x1000
%rcx = 0xfe8081819000 %r11 = 0x113709b0
%rdx = 0xfe8000de7c80 %r12 = 0x000180218720
%rsi = 0x00013000 %r13 = 0xfbc52160 
pse_mutex+0x200

%rdi = 0xfbc52160 pse_mutex+0x200 %r14 = 0x4000
%r8  = 0x0200 %r15 = 0xfe8000de79d8

%rip = 0xfb8474fb page_unlock+0x3b
%rbp = 0xfe8000de7800
%rsp = 0xfe8000de77e0
%rflags = 0x00010246
  id=0 vip=0 vif=0 ac=0 vm=0 rf=1 nt=0 iopl=0x0
  status=of,df,IF,tf,sf,ZF,af,PF,cf

%cs = 0x0028%ds = 0x0043%es = 0x0043
%trapno = 0xe   %fs = 0xfsbase = 0x8000
   %err = 0x0   %gs = 0x01c3gsbase = 0xfbc27b70

While the panic string says NULL pointer dereference, it appears that 
0x180218720 is not mapped. The dereference looks like the first 
dereference in page_unlock(), which looks at pp-p_selock.


I can spend a little time looking at it, but was wondering if anyone 
had seen this kind of panic previously?


I have two identical crashdumps created in exactly the same way.

alan.





___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


[zfs-discuss] Re: zfs panic installing a brandz zone

2006-09-14 Thread Alan Hargreaves
I know, bad form replying to myself, but I am wondering if it might be 
related to


6438702 error handling in zfs_getpage() can trigger page not 
locked


Which is marked fix in progress with a target of the current build.

alan.

Alan Hargreaves wrote:
Folks, before I start delving too deeply into this crashdump, has anyone 
seen anything like it?


The background is that I'm running a non-debug open build of b49 and was 
in the process of running the zoneadm -z redlx install 


After a bit, the machine panics, initially looking at the crashdump, I'm 
down to 88mb free (out of a gig) and see the following stack.


fe8000de7800 page_unlock+0x3b(180218720)
fe8000de78d0 zfs_getpage+0x236(89b84d80, 12000, 2000, 
fe8000de7a1c, fe8000de79b8, 2000, fbc29b20, 
fe808180a000, 1,

80826dc8)
fe8000de7950 fop_getpage+0x52(89b84d80, 12000, 2000, 
fe8000de7a1c, fe8000de79b8, 2000, fbc29b20, 
fe8081818000, 1,

80826dc8)
fe8000de7a50 segmap_fault+0x1d6(801a6f38, fbc29b20, 
fe8081818000, 2000, 0, 1)
fe8000de7b30 segmap_getmapflt+0x67a(fbc29b20, 
89b84d80, 12000, 2000, 1, 1)

fe8000de7bd0 lofi_strategy_task+0x14b(959d2400)
fe8000de7c60 taskq_thread+0x1a7(84453da8)
fe8000de7c70 thread_start+8()

%rax = 0x %r9  = 0x0300430e
%rbx = 0x000e %r10 = 0x1000
%rcx = 0xfe8081819000 %r11 = 0x113709b0
%rdx = 0xfe8000de7c80 %r12 = 0x000180218720
%rsi = 0x00013000 %r13 = 0xfbc52160 
pse_mutex+0x200

%rdi = 0xfbc52160 pse_mutex+0x200 %r14 = 0x4000
%r8  = 0x0200 %r15 = 0xfe8000de79d8

%rip = 0xfb8474fb page_unlock+0x3b
%rbp = 0xfe8000de7800
%rsp = 0xfe8000de77e0
%rflags = 0x00010246
  id=0 vip=0 vif=0 ac=0 vm=0 rf=1 nt=0 iopl=0x0
  status=of,df,IF,tf,sf,ZF,af,PF,cf

%cs = 0x0028%ds = 0x0043%es = 0x0043
%trapno = 0xe   %fs = 0xfsbase = 0x8000
   %err = 0x0   %gs = 0x01c3gsbase = 0xfbc27b70

While the panic string says NULL pointer dereference, it appears that 
0x180218720 is not mapped. The dereference looks like the first 
dereference in page_unlock(), which looks at pp-p_selock.


I can spend a little time looking at it, but was wondering if anyone had 
seen this kind of panic previously?


I have two identical crashdumps created in exactly the same way.

alan.



--
Alan Hargreaves - http://blogs.sun.com/tpenta
Staff Engineer (Kernel/VOSJEC/Performance)
Systems Technical Service Center
Sun Microsystems

I went in the World's Greatest shave for Leukaemia. See
http://blogs.sun.com/roller/page/tpenta?entry=hair_yesterday_gone_today
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss