Hi there, I have a pseudo file system on top of UFS. I am seeing a kernel panics consistently on SUN Netra CP3060 blades. here is the scat extract:
SolarisCAT(vmcore.3)> analyze PANIC: BAD TRAP: type=%x rp=%p addr=%p mmu_fsr=%x occurred in module "%s" due to %s core file: /mydata/vmcore.3 user: Super-User (root:0) release: 5.10 (64-bit) version: Generic_127111-02 machine: sun4v node name: zrc2s0u5 hw_provider: Sun_Microsystems system type: SUNW,Netra-CP3060 hostid: 849d3ae6 time of crash: Tue Jan 8 15:49:21 GMT+5 2008 age of system: 44 minutes 29.23 seconds panic cpu: 20 (ncpus: 32) panic string: BAD TRAP: type=31 rp=2a1020c73a0 addr=349 mmu_fsr=0 occurred in module "unix" due to a NULL pointer dereference 32 cpus ==== checking for trap information ==== cpu 20 had the panic ==== panic thread: 0x2a1020c7cc0 ==== cpu: 20 ==== address translation failed for proc: 2624 bytes @ 0x0 ==== panic kernel thread: 0x2a1020c7cc0 on cpu: 20 ==== cmd: (unknown) t_stk: 0x2a1020c7ad0 sp: 0x184c251 t_stkbase: 0x0 t_pri: 99(SYS) pctcpu: 0.000000 t_lwp: 0x0 bound psrset: 0 last cpuid: 20 idle: 266923 ticks (44 minutes 29.23 seconds) start: Sun May 3 07:14:24 6511 age: -3297806061903 seconds (38169051 days 15 hours 25 minutes 3 seconds) interrupted (pinned) thread: 0x300005b1a80 tstate: TS_ONPROC - thread is being run on a processor tflg: T_TALLOCSTK - thread structure allocated from stk T_PANIC - thread initiated a system panic tpflg: none set tsched: TS_LOAD - thread is in memory TS_DONT_SWAP - thread/LWP should not be swapped pc: 0x1065ea8 unix:panicsys+0x48: call unix:setjmp startpc: 0x7b71f640 ipfs:ipfs_active_in+0x0: save %sp, -0x140, %sp unix:panicsys+0x48 (0x2a1020c7cc0, 0x60015b9e2c8, 0x1869800, 0x181ac00, 0x1085800, 0x0) unix:vpanic_common+0x78 (0x10810a0, 0x2a1020c7148, 0x349, 0x0, 0x2a1020c7180, 0x1081000) unix:panic+0x1c (0x10810a0, 0x31, 0x2a1020c73a0, 0x349, 0x0, 0x186a447) unix:die+0x78 (0x31, 0x2a1020c73a0, 0x349, 0x0, 0x2a1020c7180, 0x1081000) unix:trap+0x9d8 (0x2a1020c73a0, 0x0, 0x5, 0x1c00, 0x0, 0x1) unix:ktl0+0x64 (0x60006d720a4, 0x42f98, 0x1, 0xfffffffffffffff8, 0x4594c, 0x1) -- trap data type: 0x31 (data access MMU miss) rp: 0x2a1020c73a0 -- pc: 0x103311c unix:sfmmu_tsbmiss_exception+0x54: ldsb [%l2 + 0x349], %l5 npc: 0x1033120 unix:sfmmu_tsbmiss_exception+0x58: ldx [%i5 + 0x10], %l7 global: %g1 0x3000089f0f0 %g2 0x2a1020c7cc0 %g3 0x42f90 %g4 0x8 %g5 0x120e8f4 %g6 0 %g7 0x2a1020c7cc0 out: %o0 0x60006d720a4 %o1 0x42f98 %o2 0x1 %o3 0xfffffffffffffff8 %o4 0x4594c %o5 0x1 %sp 0x2a1020c6c41 %o7 0x11247b8 loc: %l0 0x3000178e000 %l1 0x1 %l2 0 %l3 0x1c00 %l4 0x300005b7f00 %l5 0x4fed4 %l6 0x2 %l7 0x3000178e000 in: %i0 0x2a1020c75a0 %i1 0x42001 %i2 0x31 %i3 0 %i4 0x4fe87 %i5 0x60012c81dd0 %fp 0x2a1020c6cf1 %i7 0x100991c <trap>unix:sfmmu_tsbmiss_exception+0x54 (0x2a1020c75a0, 0x42001, 0x31, 0x0, 0x4fe87, 0x60012c81dd0) unix:ktl0+0x64 (0x60006d720a4, 0x42f98, 0x1, 0xfffffffffffffff8, 0x4594c, 0x1) SUNW,UltraSPARC-T1:xcopyout - frame recycled genunix:uiomove+0x90 (0x60006d7209c, 0x8, 0x0, 0x2a1020c7950, 0x0, 0x8) genunix:struiocopyout+0x38 (0x600031f9dc0, 0x2a1020c7950, 0x2a1020c7864, 0x0, 0x60006d720a4, 0x1) genunix:strread+0x4b4 (0x0, 0x2a1020c7950, 0x0, 0x3001bf947c0, 0x0, 0x0) ipfs:ipfs_in+0x1e8 (0x0, 0x3001c0444c0, 0x3001bf947c0, 0x8, 0x2a1020c7ab0, 0x0) ipfs:ipfs_active_in+0xec (0x3001c0ca000, 0x3001c0444c0, 0x3001cd5cb00, 0x60011bc6000, 0x0, 0x3001c0ca0d8) unix:thread_start+0x4 (0x3001c0ca000, 0x0, 0x0, 0x0, 0x0, 0x0) -- end of kernel thread's stack -- ==== analyzing panic thread stack for trap frames ==== ==== using trap() frame 1 @ 0x2a1020c71a0, rp(%i0): 0x2a1020c73a0 ==== type(%l2): 0x31 (data access MMU miss) pc: 0x103311c unix:sfmmu_tsbmiss_exception+0x54: ldsb [%l2 + 0x349], %l5 npc: 0x1033120 unix:sfmmu_tsbmiss_exception+0x58: ldx [%i5 + 0x10], %l7 global: %g1 0x3000089f0f0 %g2 0x2a1020c7cc0 %g3 0x42f90 %g4 0x8 %g5 SUNW,UltraSPARC-T1:xcopyout+0xc %g6 0 %g7 0x2a1020c7cc0 out: %o0 0x60006d720a4 %o1 0x42f98 %o2 0x1 %o3 0xfffffffffffffff8 %o4 0x4594c %o5 0x1 %sp 0x2a1020c6c41 %o7 genunix:uiomove+0x90 loc: %l0 0x3000178e000 %l1 0x1 %l2 0 %l3 0x1c00 %l4 0x300005b7f00 %l5 0x4fed4 %l6 0x2 %l7 0x3000178e000 in: %i0 0x2a1020c75a0 %i1 0x42001 %i2 0x31 %i3 0 %i4 0x4fe87 %i5 0x60012c81dd0 %fp 0x2a1020c6cf1 %i7 unix:ktl0+0x64 unix:sfmmu_tsbmiss_exception+0x2c: ldx [%l7 + 0xb10], %l4 unix:sfmmu_tsbmiss_exception+0x30: ldx [%l4 + 0x50], %l6 unix:sfmmu_tsbmiss_exception+0x34: add %l6, 0x1, %l5 unix:sfmmu_tsbmiss_exception+0x38: bne,pn %icc, unix:sfmmu_tsbmiss_exception+0x190 (2f) unix:sfmmu_tsbmiss_exception+0x3c: stx %l5, [%l4 + 0x50] unix:sfmmu_tsbmiss_exception+0x40: or %g0, 0x2, %l6 ( mov 0x2, %l6 ) unix:sfmmu_tsbmiss_exception+0x44: or %g0, %g7, %g2 ( mov %g7, %g2 ) unix:sfmmu_tsbmiss_exception+0x48: ldx [%g7 + 0x148], %g1 unix:sfmmu_tsbmiss_exception+0x4c: ldx [%g7 + 0x140], %l2 unix:sfmmu_tsbmiss_exception+0x50: ldx [%g1 + 0x8], %i5 unix:sfmmu_tsbmiss_exception+0x54: ldsb [%l2 + 0x349], %l5 unix:sfmmu_tsbmiss_exception+0x58: ldx [%i5 + 0x10], %l7 unix:sfmmu_tsbmiss_exception+0x5c: stb %l6, [%l2 + 0x349] unix:sfmmu_tsbmiss_exception+0x60: call unix:sfmmu_hat_enter unix:sfmmu_tsbmiss_exception+0x64: or %g0, %l7, %o0 ( mov %l7, %o0 ) unix:sfmmu_tsbmiss_exception+0x68: ldx [%l7 + 0x88], %g3 unix:sfmmu_tsbmiss_exception+0x6c: or %g0, %o0, %l4 ( mov %o0, %l4 ) unix:sfmmu_tsbmiss_exception+0x70: add %l7, 0x9a, %l3 unix:sfmmu_tsbmiss_exception+0x74: subcc %g3, %i3, %g0 ( cmp %g3, %i3 ) unix:sfmmu_tsbmiss_exception+0x78: be,pn %xcc, unix:sfmmu_tsbmiss_exception+0xb8 (1f) unix:sfmmu_tsbmiss_exception+0x7c: nop SolarisCAT(vmcore.3)> This is the same place I see the panic, and the same is not seen on any other platform. why is it coming in sfmmu_tsbmiss_exception? Any help. Thanks, Ramana -- This message posted from opensolaris.org