Re: Today's panic on boot problem

2002-02-27 Thread Michael Nottebrock

Mike Silbersack wrote:
 On Wed, 27 Feb 2002, Mike Silbersack wrote:
 
 
Disabling PG_G allows it to work here again as well.  Given the problems
we're experiencing, backing out the pmap changes of the last two days
seems like a good idea.

Mike Silby Silbersack

 
 Well, I sorta take that back.  The box has been up for ~5 minutes now, and
 the buildworld I started hasn't paniced it yet.  I guess this is workable.

FWIW: I'm typing this on a kernel with that bandaid right now, but I 
still manage to panic it immediately by preloading smbfs.ko from the 
loader. kldload'ing it later works fine, though.
-- 
Michael Nottebrock


To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-current in the body of the message



Re: Today's panic on boot problem

2002-02-27 Thread Philipp Mergenthaler

On Tue, Feb 26, 2002 at 09:29:51PM -0800, Peter Wemm wrote:
 FWIW, turning off PG_G see_ms to help.  Change in pmap.c:
 #if !defined(SMP) || defined(ENABLE_PG_G)
 to:
 #if /*!defined(SMP) ||*/ defined(ENABLE_PG_G)
 and see how you go.  This got me past atkbd0, but it is a very worrying
 sign.  I now get a vnode related panic. :-(

On my (single processor, non-acpi) system the kernel now completes
booting, but savecore receives a signal 11 after having saved about
235 MB, and all(?) processes after that get a signal 11 or 10 until
finally there's a panic: bad peter.  This is absolutely reproducible.

Is there any additional information I should provide?
Bye, Philipp


savecore: writing core to /var/crash/vmcore.22
Feb 27 13:15:40 i609a savecore: reboot after panic: from debugger
pid 175 (savecore), uid 0: exited on signal 11 (core dumped)
Segmentation fault - core dumped
Doing additional network setup: ntpdpid 176 (sh), uid 0: exited on signal 10 (co
re dumped)
Bus error - core dumped
.
Starting final network daemons:.
pid 177 (sh), uid 0: exited on signal 11 (core dumped)
Segmentation fault - core dumped
pid 178 (sh), uid 0: exited on signal 11 (core dumped)
Segmentation fault - core dumped
Starting standard daemons: inetdpid 179 (sh), uid 0: exited on signal 11 (core d
umped)

[...]

Additional TCP options:pid 213 (sh), uid 0: exited on signal 10 (core dumped)
Bus error - core dumped
pid 214 (sh), uid 0: exited on signal 11 (core dumped)
Segmentation fault - core dumped
.
Starting background filesystem checks

Wed Feb 27 13:16T:44 CET 2002
PTE at 0xbfc20404  IS ZERO @ VA 08101000
panic: bad peter
Debugger(panic)
Stopped at  Debugger+0x40:  xorl%eax,%eax
db


save a crash dump and reboot into old kernel


This GDB was configured as i386-unknown-freebsd...Deprecated bfd_read called at 
/usr/src/gnu/usr.bin/binutils/gdb/../../../../contrib/gdb.291/gdb/dbxread.c line 2629 
in elfstab_build_psymtabs
Deprecated bfd_read called at 
/usr/src/gnu/usr.bin/binutils/gdb/../../../../contrib/gdb.291/gdb/dbxread.c line 935 
in fill_symbuf

IdlePTD at phsyical address 0x003ff000
initial pcb at physical address 0x0032f8c0
panicstr: from debugger
panic messages:
---
panic: bad peter
panic: from debugger
Uptime: 1m31s
pfs_vncache_unload(): 1 entries remaining

dumping to dev ad0s1b, offset 589952
dump ata0: resetting devices .. ata0: mask=03 ostat0=50 ostat2=00
ad0: ATAPI 00 00
ata0-slave: ATAPI 00 00
ata0: mask=03 stat0=50 stat1=00
ad0: ATA 01 a5
ata0: devices=01
ad0: invalidating queued requests
ad0: success setting PIO4 on generic chip
done
255 254 253 252 251 250 249 248 247 246 245 244 243 242 241 240 239 238 237 236 235 
234 233 232 231 230 229 228 227 226 225 224 223 222 221 220 219 218 217 216 215 214 
213 212 211 210 209 208 207 206 205 204 203 202 201 200 199 198 197 196 195 194 193 
192 191 190 189 188 187 186 185 184 183 182 181 180 179 178 177 176 175 174 173 172 
171 170 169 168 167 166 165 164 163 162 161 160 159 158 157 156 155 154 153 152 151 
150 149 148 147 146 145 144 143 142 141 140 139 138 137 136 135 134 133 132 131 130 
129 128 127 126 125 124 123 122 121 120 119 118 117 116 115 114 113 112 111 110 109 
108 107 106 105 104 103 102 101 100 99 98 97 96 95 94 93 92 91 90 89 88 87 86 85 84 83 
82 81 80 79 78 77 76 75 74 73 72 71 70 69 68 67 66 65 64 63 62 61 60 59 58 57 56 55 54 
53 52 51 50 49 48 47 46 45 44 43 42 41 40 39 38 37 36 35 34 33 32 31 30 29 28 27 26 25 
24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
---
#0  dumpsys () at ../../../kern/kern_shutdown.c:504
504 if (!dodump)
(kgdb) bt
#0  dumpsys () at ../../../kern/kern_shutdown.c:504
#1  0xc01a2117 in boot (howto=260) at ../../../kern/kern_shutdown.c:336
#2  0xc01a25a1 in panic (fmt=0xc02a772a from debugger)
at ../../../kern/kern_shutdown.c:646
#3  0xc013784d in db_panic (addr=-1071127031, have_addr=0, count=-1, 
modif=0xcf3adb10 ) at ../../../ddb/db_command.c:452
#4  0xc01377eb in db_command (last_cmdp=0xc02ed424, cmd_table=0xc02ed244, 
aux_cmd_tablep=0xc02e4020, aux_cmd_tablep_end=0xc02e4024)
at ../../../ddb/db_command.c:348
#5  0xc01378b7 in db_command_loop () at ../../../ddb/db_command.c:474
#6  0xc0139c4b in db_trap (type=3, code=0) at ../../../ddb/db_trap.c:72
#7  0xc027e326 in kdb_trap (type=3, code=0, regs=0xcf3adc0c)
at ../../../i386/i386/db_interface.c:161
#8  0xc028ac18 in trap (frame={tf_fs = 24, tf_es = 16, tf_ds = 16, 
  tf_edi = -1077804028, tf_esi = 256, tf_ebp = -818226092, 
  tf_isp = -818226120, tf_ebx = -1070734913, tf_edx = 1017, tf_ecx = 1, 
  tf_eax = 18, tf_trapno = 3, tf_err = 0, tf_eip = -1071127148, tf_cs = 8, 
  tf_eflags = 582, tf_esp = -1070742881, tf_ss = -1070895941})
at ../../../i386/i386/trap.c:589
#9  0xc027e594 in Debugger (msg=0xc02b6cbb panic) at machine/cpufunc.h:66
#10 0xc01a258c in panic (fmt=0xc02de1bf bad peter)
at ../../../kern/kern_shutdown.c:633
#11 0xc0289019 in pmap_remove_pages (pmap=0xcd2fadec, sva=0, 

Re: Today's panic on boot problem

2002-02-27 Thread Terry Lambert

Peter Wemm wrote:
 Mike Silbersack wrote:
  On Tue, 26 Feb 2002, Peter Wemm wrote:
   Mike Silbersack wrote:
Hm, sounds like UP got optimized out.
   Gah!  That would be a first. :(
  Well, until I can build a working kernel, I'll just assume that it's a
  feature.
 FWIW, turning off PG_G see_ms to help.  Change in pmap.c:
 #if !defined(SMP) || defined(ENABLE_PG_G)
 to:
 #if /*!defined(SMP) ||*/ defined(ENABLE_PG_G)
 and see how you go.  This got me past atkbd0, but it is a very worrying
 sign.  I now get a vnode related panic. :-(
 
 I've reread the changes about 50 times now and cannot for the life of me
 see why it works for SMP but not UP.  I'm about ready to back it all out.

I believe I know what the problem is.  Turn off PG_PSE
using DISABLE_PSE in your config.

If that fixes it, back the code out.

-- Terry

To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-current in the body of the message



Today's panic on boot problem

2002-02-26 Thread Mike Silbersack

I'm experiencing the same double panic on boot that PHK is now; are we the
only ones, or is it just that nobody else has updated recently?

Mike Silby Silbersack


To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-current in the body of the message



Re: Today's panic on boot problem

2002-02-26 Thread Peter Wemm

Mike Silbersack wrote:
 I'm experiencing the same double panic on boot that PHK is now; are we the
 only ones, or is it just that nobody else has updated recently?

If you are not using acpica, then you're probably using vm86 for pcibios
calls.  I've been told that I've broken bios.c..

You may like to try reverting this change:

peter   2002/02/25 13:42:23 PST
  
  Modified files:
sys/i386/i386bios.c
sys/i386/include pmap.h
  Log:
  Tidy up some warnings
 
  Revision  ChangesPath
  1.47  +7 -6  src/sys/i386/i386/bios.c
  1.74  +1 -1  src/sys/i386/include/pmap.h

Cheers,
-Peter
--
Peter Wemm - [EMAIL PROTECTED]; [EMAIL PROTECTED]; [EMAIL PROTECTED]
All of this is for nothing if we don't go to the stars - JMS/B5


To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-current in the body of the message



RE: Today's panic on boot problem

2002-02-26 Thread Riccardo Torrini

On 26-Feb-2002 (17:27:19/GMT) Mike Silbersack wrote:

 I'm experiencing the same double panic on boot that PHK is
 now; are we the only ones, or is it just that nobody else
 has updated recently?

Mee too, just survied to 4 auto-reboot without messages...

Trying with a boot -v I see a keyboard check as last message,
I forgot to write down it.  Sorry.  I can reboot again with
that explosive kernel (if _really_ necessary).
Now reverted to kernel and modules of 24 Feb (but only kernel
and modules, _not_ world) and it is stable (but no speaker).

cvsup-ed 3 times on 26.2.2002 (compile take 3/4 hours here).

BTW: On built of 25 (to recover /dev/speaker) I noticed some
stranges: doing man, ps, even ls make a core dump (sometimes).
I'm sure that all is in sync: kern, modules, world and etc.
Tryed with and without X, with and without ACPI, normal boot
or single user.  No differences.
As mentioned on my previous mail having both PCA and SPEAKER
compiled into kernel kill /dev/speaker.  With only speaker or
none of them (but loading atspeaker after boot) lead me to a
working /dev/speaker.  All other stuff doesn't no matter.


Riccardo.

To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-current in the body of the message



Re: Today's panic on boot problem

2002-02-26 Thread Mike Silbersack



On Tue, 26 Feb 2002, Peter Wemm wrote:

 Mike Silbersack wrote:
  I'm experiencing the same double panic on boot that PHK is now; are we the
  only ones, or is it just that nobody else has updated recently?

 If you are not using acpica, then you're probably using vm86 for pcibios
 calls.  I've been told that I've broken bios.c..

 You may like to try reverting this change:

 peter   2002/02/25 13:42:23 PST

   Modified files:
 sys/i386/i386bios.c
 sys/i386/include pmap.h
   Log:
   Tidy up some warnings

   Revision  ChangesPath
   1.47  +7 -6  src/sys/i386/i386/bios.c
   1.74  +1 -1  src/sys/i386/include/pmap.h

 Cheers,
 -Peter

I reverted that change, and the double panic still occured.  :|

FWIW, you're correct in that I'm not using the acpi module.

Mike Silby Silbersack


To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-current in the body of the message



Re: Today's panic on boot problem

2002-02-26 Thread Michael D. Harnois

On Tue, 2002-02-26 at 17:38, Peter Wemm wrote:

 You may like to try reverting this change:

A great idea, but unfortunately, incorrect ...
 
-- 
Michael D. Harnois   bilocational bivocational
Pastor, Redeemer Lutheran ChurchWashburn, Iowa
1L, UST School of Law   Minneapolis, Minnesota
 Censorship is the strongest drive in human nature; 
 sex is a weak second. -- Phil Kerby


To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-current in the body of the message



Re: Today's panic on boot problem

2002-02-26 Thread Mike Silbersack


On Tue, 26 Feb 2002, Mike Silbersack wrote:

 I reverted that change, and the double panic still occured.  :|

 FWIW, you're correct in that I'm not using the acpi module.

 Mike Silby Silbersack

Using ACPI doesn't help here either.  Hmph.  Can I get a kernel dump that
early in the boot process?  The dumpon manpage doesn't suggest a way as
far as I can tell.

Mike Silby Silbersack


To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-current in the body of the message



Re: Today's panic on boot problem

2002-02-26 Thread Mike Silbersack


On Tue, 26 Feb 2002, David Wolfskill wrote:

 Date: Tue, 26 Feb 2002 19:46:59 + (GMT)
 From: Mike Silbersack [EMAIL PROTECTED]

 Using ACPI doesn't help here either.  Hmph.  Can I get a kernel dump that
 early in the boot process?  The dumpon manpage doesn't suggest a way as
 far as I can tell.

 I was unable to get a dump, even though the panic dropped me into the
 debugger.

 Also, I only had the panic on my laptop -- the (SMP) build machine ran
 fine.

 Cheers,
 david   (links to my resume at http://www.catwhisker.org/~david)

Hm, sounds like UP got optimized out.

Ok, I guess we wait...

Mike Silby Silbersack



To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-current in the body of the message



Re: Today's panic on boot problem

2002-02-26 Thread Peter Wemm

Mike Silbersack wrote:
 
 On Tue, 26 Feb 2002, David Wolfskill wrote:
 
  Date: Tue, 26 Feb 2002 19:46:59 + (GMT)
  From: Mike Silbersack [EMAIL PROTECTED]
 
  Using ACPI doesn't help here either.  Hmph.  Can I get a kernel dump that
  early in the boot process?  The dumpon manpage doesn't suggest a way as
  far as I can tell.
 
  I was unable to get a dump, even though the panic dropped me into the
  debugger.
 
  Also, I only had the panic on my laptop -- the (SMP) build machine ran
  fine.
 
  Cheers,
  david   (links to my resume at http://www.catwhisker.org/~david)
 
 Hm, sounds like UP got optimized out.

Gah!  That would be a first. :(

 Ok, I guess we wait...
 
 Mike Silby Silbersack
 
 
 
 To Unsubscribe: send mail to [EMAIL PROTECTED]
 with unsubscribe freebsd-current in the body of the message
 

Cheers,
-Peter
--
Peter Wemm - [EMAIL PROTECTED]; [EMAIL PROTECTED]; [EMAIL PROTECTED]
All of this is for nothing if we don't go to the stars - JMS/B5


To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-current in the body of the message



Re: Today's panic on boot problem

2002-02-26 Thread Mike Silbersack


On Tue, 26 Feb 2002, Peter Wemm wrote:

 Mike Silbersack wrote:
 
  Hm, sounds like UP got optimized out.

 Gah!  That would be a first. :(

Well, until I can build a working kernel, I'll just assume that it's a
feature.

Mike Silby Silbersack


To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-current in the body of the message



Re: Today's panic on boot problem

2002-02-26 Thread Peter Wemm

Mike Silbersack wrote:
 
 On Tue, 26 Feb 2002, Peter Wemm wrote:
 
  Mike Silbersack wrote:
  _
   Hm, sounds like UP got optimized out.
 
  Gah!  That would be a first. :(
 
 Well, until I can build a working kernel, I'll just assume that it's a
 feature.

FWIW, turning off PG_G see_ms to help.  Change in pmap.c:
#if !defined(SMP) || defined(ENABLE_PG_G)
to:
#if /*!defined(SMP) ||*/ defined(ENABLE_PG_G)
and see how you go.  This got me past atkbd0, but it is a very worrying
sign.  I now get a vnode related panic. :-(

I've reread the changes about 50 times now and cannot for the life of me
see why it works for SMP but not UP.  I'm about ready to back it all out.

Cheers,
-Peter
--
Peter Wemm - [EMAIL PROTECTED]; [EMAIL PROTECTED]; [EMAIL PROTECTED]
All of this is for nothing if we don't go to the stars - JMS/B5


To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-current in the body of the message



Re: Today's panic on boot problem

2002-02-26 Thread Mike Silbersack


On Tue, 26 Feb 2002, Peter Wemm wrote:

 FWIW, turning off PG_G see_ms to help.  Change in pmap.c:
 #if !defined(SMP) || defined(ENABLE_PG_G)
 to:
 #if /*!defined(SMP) ||*/ defined(ENABLE_PG_G)
 and see how you go.  This got me past atkbd0, but it is a very worrying
 sign.  I now get a vnode related panic. :-(

 I've reread the changes about 50 times now and cannot for the life of me
 see why it works for SMP but not UP.  I'm about ready to back it all out.

 Cheers,
 -Peter

Disabling PG_G allows it to work here again as well.  Given the problems
we're experiencing, backing out the pmap changes of the last two days
seems like a good idea.

Mike Silby Silbersack


To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-current in the body of the message



Re: Today's panic on boot problem

2002-02-26 Thread Mike Silbersack


On Wed, 27 Feb 2002, Mike Silbersack wrote:

 Disabling PG_G allows it to work here again as well.  Given the problems
 we're experiencing, backing out the pmap changes of the last two days
 seems like a good idea.

 Mike Silby Silbersack

Well, I sorta take that back.  The box has been up for ~5 minutes now, and
the buildworld I started hasn't paniced it yet.  I guess this is workable.

Mike Silby Silbersack


To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-current in the body of the message



Re: Today's panic :-)

2001-02-24 Thread Julian Elischer

Warner Losh wrote:
 
 I've added INVARIANTS and WITNESS to my kernel.  Today I get a random
 panic on boot sometimes:
 
 lock order reseral  (this doesn't cause the panic, but
 does seem to happen all the time)
  1st vnode interlock last acquired @ ../../usr/ffs/ffs_fsops.c:396
  2nd 0xc04837a0 mntvnode @ ../../ufs/ffs/ffs_vfsops.c:457
  3rd 0xc80b9e8c vnode interlock @ ../../kern/vfs_subr.c:1872

I keep dropping into ddb with:
ad1: 99MB VMware Inc. Virtual Hard Drive [216/15/63] at ata0-slave WDMA2
Mounting root from ufs:/dev/ad0s1a
lock order reversal
 1st lockmgr last acquired @ ../../kern/kern_lock.c:239
 2nd 0xc085d654 ucred @ ../../kern/kern_prot.c:1177
 3rd 0xc02b65e0 lockmgr @ ../../kern/kern_lock.c:239
lock order reversal
 1st vnode interlock last acquired @ ../../kern/vfs_vnops.c:625
 2nd 0xc02d0a60 mntvnode @ ../../ufs/ffs/ffs_vfsops.c:940
 3rd 0xc3d7186c vnode interlock @ ../../ufs/ffs/ffs_vfsops.c:949
lock order reversal
 1st lockmgr last acquired @ ../../kern/kern_lock.c:239
 2nd 0xc02b8620 uidinfo hash @ ../../kern/kern_resource.c:765
 3rd 0xc02b65e0 lockmgr @ ../../kern/kern_lock.c:239
lock order reversal
 1st lockmgr last acquired @ ../../kern/kern_lock.c:239
 2nd 0xc08d321c uidinfo struct @ ../../kern/kern_resource.c:819
 3rd 0xc02b65e0 lockmgr @ ../../kern/kern_lock.c:239

hitting 'c' (for continue) allows me to go on


This with a kernel CVSUP'd in the last day.
I'll CVsup again as I vaguely remember a commit message that may affect this.




-- 
  __--_|\  Julian Elischer
 /   \ [EMAIL PROTECTED]
(   OZ) World tour 2000-2001
--- X_.---._/  
v

To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: Today's panic :-)

2001-02-24 Thread Warner Losh

In message [EMAIL PROTECTED] Warner Losh writes:
: I note that this doesn't happen when the disks are clean on boot, but
: does happen when they are dirty.  The kernel is as of a cvsup 3pm MST
: today.  The kernel from 1am last night doesn't seem to have this
: problem.

Doesn't seem to have this problem that badly.  I just got a very
similar crash from it.  Maybe I'm just lucky with the older one.

Warner

To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: Today's panic :-)

2001-02-24 Thread John Baldwin


On 24-Feb-01 Julian Elischer wrote:
 Warner Losh wrote:
 
 I've added INVARIANTS and WITNESS to my kernel.  Today I get a random
 panic on boot sometimes:
 
 lock order reseral  (this doesn't cause the panic, but
 does seem to happen all the time)
  1st vnode interlock last acquired @ ../../usr/ffs/ffs_fsops.c:396
  2nd 0xc04837a0 mntvnode @ ../../ufs/ffs/ffs_vfsops.c:457
  3rd 0xc80b9e8c vnode interlock @ ../../kern/vfs_subr.c:1872
 
 I keep dropping into ddb with:

Don't use WITNESS_DDB and you won't drop into ddb.  We're not far enough along
to make that very livable yet. :(

-- 

John Baldwin [EMAIL PROTECTED] -- http://www.FreeBSD.org/~jhb/
PGP Key: http://www.baldwin.cx/~john/pgpkey.asc
"Power Users Use the Power to Serve!"  -  http://www.FreeBSD.org/

To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: Today's panic :-)

2001-02-24 Thread John Baldwin


On 24-Feb-01 Warner Losh wrote:
 In message [EMAIL PROTECTED] Bruce
 Evans writes:
: It seems to be another trap while holding sched_lock.  This should be
: fatal, but the problem is only detected because trap() enables
: interrupts.  Then an interrupt causes bad things to happen.  Unfortunately,
: the above omits the critical information: the instruction at sw1b+0x6b.
: There is no instruction at that address here.  It is apparently just an
: access to a swapped-out page for the new process.  I can't see how this
: ever worked.  The page must be faulted in, but this can't be done while
: sched_lock is held (not to mention after we have committed to switching
: contexts).
 
 sw1b+0x6b is ltr %si
 
 I note that this doesn't happen when the disks are clean on boot, but
 does happen when they are dirty.  The kernel is as of a cvsup 3pm MST
 today.  The kernel from 1am last night doesn't seem to have this
 problem.

Other people have reported this and I can't reproduce this.  The one case I
managed to track down so far involved proc0 having pcb_ext bogusly set,
resulting in cpu_switch() setting up a bogus GDT entry for a TSS and thus
generating a GPF which is the trap you see.  The enable_intr() in trap() then
sends things downhill fast.  I'm not sure yet why processes are having pcb_ext
bogusly set.  Hmm.  Make sure you have rev 1.35 or later of pcb.h.  Also,
try build a kernel from scratch from fresh sources..

 Warner

-- 

John Baldwin [EMAIL PROTECTED] -- http://www.FreeBSD.org/~jhb/
PGP Key: http://www.baldwin.cx/~john/pgpkey.asc
"Power Users Use the Power to Serve!"  -  http://www.FreeBSD.org/

To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Today's panic :-)

2001-02-23 Thread Warner Losh


I've added INVARIANTS and WITNESS to my kernel.  Today I get a random
panic on boot sometimes:

lock order reseral  (this doesn't cause the panic, but
does seem to happen all the time)
 1st vnode interlock last acquired @ ../../usr/ffs/ffs_fsops.c:396
 2nd 0xc04837a0 mntvnode @ ../../ufs/ffs/ffs_vfsops.c:457
 3rd 0xc80b9e8c vnode interlock @ ../../kern/vfs_subr.c:1872
kernel trap 12 with interrupts disabled
panic: runq-add: proc 0xc7b28ee0 (fsck_ufs) not SRUN
Debugger("panic")
Stopeed at Debugger+0x44: pushl %ebx
db trace
Debugger(c03d3c03) at Debugger+0x44
panic(c03d4040,c7b28ee0,c7b290a5,282,c7b2b960) at panic+0x70
runq_add(c046ae20,c7b28ee0,c8a4bcra4,c0221ee5,c7b28ee0) at runq_add+0x40
setrunqueue(c7b28ee0) at setrunqueue+0x10
ithread_schedule(c0f0a00,1) at ithread_schedule+0x129
sched_ithd(e) at sched_ithrd+0x3f
Xresume14() at Xresume14+0x8
--- interrupt, eip = 0xc03830fb, esp = 0x286, ebp = 0xc8a4bd34 ---
trap(18,10,10,73b152,0) at trap+0x9b
calltrap() at calltrap+0x5
--- trap 0xc, eip = 0xc03822e9, esp = 0xc8a4bd7c, ebp = 0xc8a4bd90 ---
sw1b(0,...) at sw1b+0x6b
msleep(...) at msleep+0x588
physio(...) at physio+0x30d
spec_read(...) at spec_read+0x71
ufsspec_read(...) at ufsspec_read+0x20
ufs_noperatespec(...) at ufs_noperatespec+0x15
vn_read(...) at vn_read+0x128
dofileread(...) at dofileread+0xb0
read(...) at read+0x36
syscall(...) at syscall+0x551
Xint0x80_syscall() at Xint0x80_syscall+0x23
--- syscall 0x3, eip = 0x8054770, esp = 0xbfbfef60, ebp = 0xbfbfef9c ---
db

Anything that I can do to help?  I don't have a core dump of this, but
it is happening often enough to be a pain.

Warner

To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: Today's panic :-)

2001-02-23 Thread Bruce Evans

On Fri, 23 Feb 2001, Warner Losh wrote:

 I've added INVARIANTS and WITNESS to my kernel.  Today I get a random
 panic on boot sometimes:
 
 lock order reseral(this doesn't cause the panic, but
   does seem to happen all the time)
  1st vnode interlock last acquired @ ../../usr/ffs/ffs_fsops.c:396
  2nd 0xc04837a0 mntvnode @ ../../ufs/ffs/ffs_vfsops.c:457
  3rd 0xc80b9e8c vnode interlock @ ../../kern/vfs_subr.c:1872
 kernel trap 12 with interrupts disabled
 panic: runq-add: proc 0xc7b28ee0 (fsck_ufs) not SRUN
 Debugger("panic")
 Stopeed at Debugger+0x44: pushl %ebx
 db trace
 Debugger(c03d3c03) at Debugger+0x44
 panic(c03d4040,c7b28ee0,c7b290a5,282,c7b2b960) at panic+0x70
 runq_add(c046ae20,c7b28ee0,c8a4bcra4,c0221ee5,c7b28ee0) at runq_add+0x40
 setrunqueue(c7b28ee0) at setrunqueue+0x10
 ithread_schedule(c0f0a00,1) at ithread_schedule+0x129
 sched_ithd(e) at sched_ithrd+0x3f
 Xresume14() at Xresume14+0x8
 --- interrupt, eip = 0xc03830fb, esp = 0x286, ebp = 0xc8a4bd34 ---
 trap(18,10,10,73b152,0) at trap+0x9b
 calltrap() at calltrap+0x5
 --- trap 0xc, eip = 0xc03822e9, esp = 0xc8a4bd7c, ebp = 0xc8a4bd90 ---
 sw1b(0,...) at sw1b+0x6b
 msleep(...) at msleep+0x588
 physio(...) at physio+0x30d
 spec_read(...) at spec_read+0x71
 ufsspec_read(...) at ufsspec_read+0x20
 ufs_noperatespec(...) at ufs_noperatespec+0x15
 vn_read(...) at vn_read+0x128
 dofileread(...) at dofileread+0xb0
 read(...) at read+0x36
 syscall(...) at syscall+0x551
 Xint0x80_syscall() at Xint0x80_syscall+0x23
 --- syscall 0x3, eip = 0x8054770, esp = 0xbfbfef60, ebp = 0xbfbfef9c ---
 db
 
 Anything that I can do to help?  I don't have a core dump of this, but
 it is happening often enough to be a pain.

It seems to be another trap while holding sched_lock.  This should be
fatal, but the problem is only detected because trap() enables
interrupts.  Then an interrupt causes bad things to happen.  Unfortunately,
the above omits the critical information: the instruction at sw1b+0x6b.
There is no instruction at that address here.  It is apparently just an
access to a swapped-out page for the new process.  I can't see how this
ever worked.  The page must be faulted in, but this can't be done while
sched_lock is held (not to mention after we have committed to switching
contexts).

Bruce


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: Today's panic :-)

2001-02-23 Thread Warner Losh

In message [EMAIL PROTECTED] Bruce Evans 
writes:
: It seems to be another trap while holding sched_lock.  This should be
: fatal, but the problem is only detected because trap() enables
: interrupts.  Then an interrupt causes bad things to happen.  Unfortunately,
: the above omits the critical information: the instruction at sw1b+0x6b.
: There is no instruction at that address here.  It is apparently just an
: access to a swapped-out page for the new process.  I can't see how this
: ever worked.  The page must be faulted in, but this can't be done while
: sched_lock is held (not to mention after we have committed to switching
: contexts).

sw1b+0x6b is ltr %si

I note that this doesn't happen when the disks are clean on boot, but
does happen when they are dirty.  The kernel is as of a cvsup 3pm MST
today.  The kernel from 1am last night doesn't seem to have this
problem.

Warner

To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



today's panic

2000-10-09 Thread Igor Timkin

Today's current (buildworld+build kernel), check out at ~10.00 GMT.
I have this problem during 6-7 days, the stable version for me:
FreeBSD newsfeed.gamma.ru 5.0-CURRENT FreeBSD 5.0-CURRENT #0: Mon Oct  2 21:56:00 MSD 
2000 [EMAIL PROTECTED]:/usr/obj/usr/src/sys/NEWSFEED  i386
I haven't any problems till this date.

Server: P2B-DS, BIOS rev. 1012.

kgdb:
(kgdb) symbol-file kernel.debug
Reading symbols from kernel.debug...done.
(kgdb) exec-file /news/crash/kernel.0
(kgdb) core-file /news/crash/vmcore.0
SMP 2 cpus
IdlePTD 2813952
initial pcb at 23e720
panicstr: page fault
panic messages:
---
Fatal trap 12: page fault while in kernel mode
cpuid = 1; lapic.id = 
fault virtual address   = 0xc
fault code  = supervisor read, page not present
instruction pointer = 0x8:0xc015d616
stack pointer   = 0x10:0xe2312da4
frame pointer   = 0x10:0xe2312dd4
code segment= base 0x0, limit 0xf, type 0x1b
= DPL 0, pres 1, def32 1, gran 1
processor eflags= interrupt enabled, resume, IOPL = 0
current process = 388 (innd)
trap number = 12
panic: page fault
cpuid = 1; lapic.id = 
boot() called on cpu#1

syncing disks... 20 20 20 20 20 20 20 20 20 20 20 20 20 20 20 20 20 20 20 20 
giving up on 20 buffers
Uptime: 1m46s

dumping to dev #da/25, offset 80572
dump 1023 1022 1021 1020 1019 1018 1017 1016 1015 1014 1013 1012 1011 1010 1009 [...]
 6 5 4 3 2 1 0 
---
#0  dumpsys () at /usr/src/sys/kern/kern_shutdown.c:476
476 dumppcb.pcb_cr3 = rcr3();
(kgdb) bt
#0  dumpsys () at /usr/src/sys/kern/kern_shutdown.c:476
#1  0xc014a52f in boot (howto=256) at /usr/src/sys/kern/kern_shutdown.c:317
#2  0xc014a954 in poweroff_wait (junk=0xc0217b6f, howto=-611008192)
at /usr/src/sys/kern/kern_shutdown.c:569
#3  0xc01eb544 in trap_fatal (frame=0xe2312d64, eva=12)
at /usr/src/sys/i386/i386/trap.c:939
#4  0xc01eb2a5 in trap_pfault (frame=0xe2312d64, usermode=0, eva=12)
at /usr/src/sys/i386/i386/trap.c:853
#5  0xc01eae03 in trap (frame={tf_fs = -500105192, tf_es = -500105200, 
  tf_ds = -1072300016, tf_edi = 49, tf_esi = 64, tf_ebp = -500093484, 
  tf_isp = -500093552, tf_ebx = -1015227740, tf_edx = 0, 
  tf_ecx = -611008192, tf_eax = 0, tf_trapno = 12, tf_err = 0, 
  tf_eip = -1072310762, tf_cs = 8, tf_eflags = 66050, 
  tf_esp = -1015227740, tf_ss = 64}) at /usr/src/sys/i386/i386/trap.c:436
#6  0xc015d616 in selscan (p=0xdb94c140, ibits=0xe2312e28, obits=0xe2312e1c, 
nfd=62) at /usr/src/sys/sys/file.h:187
182 struct proc *p;
183 {
184 int error;
185
186 fhold(fp);
187 error = (*fp-f_ops-fo_poll)(fp, events, cred, p);
188 fdrop(fp, p);
189 return (error);
190 }
191
(kgdb) print *fp   
$1 = {f_list = {le_next = 0x0, le_prev = 0x0}, f_flag = 0, f_type = 0, 
  f_count = 0, f_msgcount = 0, f_cred = 0x0, f_ops = 0x0, f_seqcount = 0, 
 ^^^
  f_nextoff = 0, f_offset = 0, f_data = 0x0}
#7  0xc015d371 in select (p=0xdb94c140, uap=0xe2312f80)
at /usr/src/sys/kern/sys_generic.c:709
#8  0xc01eb960 in syscall2 (frame={tf_fs = 47, tf_es = 47, 
  tf_ds = -1078001617, tf_edi = 300, tf_esi = 134660576, 
  tf_ebp = -1077937204, tf_isp = -500092972, tf_ebx = -1077937332, 
  tf_edx = 246, tf_ecx = -1, tf_eax = 93, tf_trapno = 7, tf_err = 2, 
  tf_eip = 672214604, tf_cs = 31, tf_eflags = 515, tf_esp = -1077937568, 
  tf_ss = 47}) at /usr/src/sys/i386/i386/trap.c:1139
---Type return to continue, or q return to quit---
#9  0xc01dc294 in Xint0x80_syscall ()
#10 0x805707d in ?? ()
#11 0x804a671 in ?? ()

dmesg -v:
Copyright (c) 1992-2000 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
FreeBSD 5.0-CURRENT #0: Mon Oct  9 16:11:26 MSD 2000
[EMAIL PROTECTED]:/usr/obj/usr/src/sys/NEWSFEED
Timecounter "i8254"  frequency 1193182 Hz
CPU: Pentium III/Pentium III Xeon/Celeron (451.03-MHz 686-class CPU)
Origin = "GenuineIntel"  Id = 0x673  Stepping = 3
Features=0x387fbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,PN,MMX,FXSR,SSE
real memory  = 1073729536 (1048564K bytes)
avail memory = 1042202624 (1017776K bytes)
Programming 24 pins in IOAPIC #0
IOAPIC #0 intpin 2 - irq 0
FreeBSD/SMP: Multiprocessor motherboard
cpu0 (BSP): apic id:  1, version: 0x00040011, at 0xfee0
cpu1 (AP):  apic id:  0, version: 0x00040011, at 0xfee0
io0 (APIC): apic id:  2, version: 0x00170011, at 0xfec0
Preloaded elf kernel "kernel" at 0xc029d000.
Pentium Pro MTRR support enabled
npx0: math processor on motherboard
npx0: INT 16 interface
pcib0: Intel 82443BX (440 BX) host to PCI bridge at pcibus 0 on motherboard
pci0: PCI bus on pcib0 pcib1: Intel 82443BX (440 BX) PCI-PCI (AGP) bridge at 
device 1.0 on pci0
pci1: PCI bus on pcib1
isab0: Intel 

Re: today's panic

2000-10-09 Thread Igor Timkin

 Today's current (buildworld+build kernel), check out at ~10.00 GMT.
 I have this problem during 6-7 days, the stable version for me:
 FreeBSD newsfeed.gamma.ru 5.0-CURRENT FreeBSD 5.0-CURRENT #0: Mon Oct  2 21:56:00 
MSD 2000 [EMAIL PROTECTED]:/usr/obj/usr/src/sys/NEWSFEED  i386
 I haven't any problems till this date.

The same proble on another server (no crash dump).
IP: c01502d8
nm /boot/kernel/kernel | grep c0150 | sort
c015025c t pollscan
c015031c T openbsd_poll
c015032c T seltrue
c0150338 T selrecord
c0150378 T selwakeup
c0150400 T pipe
c0150588 t pipespace
c01505f0 t pipeinit
c0150690 t pipe_read
c015098c t pipe_build_write_buffer
c0150b44 t pipe_destroy_write_buffer
c0150bc8 t pipe_clone_write_buffer
c0150c04 t pipe_direct_write
c0150edc t pipe_write

dmesg:
Copyright (c) 1992-2000 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
FreeBSD 5.0-CURRENT #0: Mon Oct  9 14:41:38 MSD 2000
[EMAIL PROTECTED]:/usr/obj/usr/src/sys/MAIL
Timecounter "i8254"  frequency 1193182 Hz
Timecounter "TSC"  frequency 300682964 Hz
CPU: Pentium II/Pentium II Xeon/Celeron (300.68-MHz 686-class CPU)
Origin = "GenuineIntel"  Id = 0x633  Stepping = 3
Features=0x80f9ffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,SEP,MTRR,PGE,MCA,CMOV,MMX
real memory  = 134217728 (131072K bytes)
avail memory = 127950848 (124952K bytes)
Preloaded elf kernel "kernel" at 0xc0292000.
Pentium Pro MTRR support enabled
npx0: math processor on motherboard
npx0: INT 16 interface
pcib0: Intel 82443LX (440 LX) host to PCI bridge at pcibus 0 on motherboard
pci0: PCI bus on pcib0
pcib1: Intel 82443LX (440 LX) PCI-PCI (AGP) bridge at device 1.0 on pci0
pci1: PCI bus on pcib1
isab0: Intel 82371AB PCI to ISA bridge at device 4.0 on pci0
isa0: ISA bus on isab0
atapci0: Intel PIIX4 ATA33 controller port 0xd800-0xd80f at device 4.1 on pci0
atapci0: Busmastering DMA enabled
ata0: at 0x1f0 irq 14 on atapci0
ata1: at 0x170 irq 15 on atapci0
pci0: Intel 82371AB/EB (PIIX4) USB controller at 4.2
pci0: Intel 82371AB Power management
controller at 4.3
pci0: unknown card (vendor=0x9004, dev=0x8078) at 6.0 irq 9
de0: Digital 21140A Fast Ethernet port 0xb800-0xb87f mem 0xe280-0xe280007f irq 9 
at device 9.0 on pci0
de0: 21140A [10-100Mb/s] pass 2.2
de0: address 00:40:33:a2:ae:fe
de1: Digital 21140A Fast Ethernet port 0xb400-0xb47f mem 0xe200-0xe27f irq 
12 at device 10.0 on pci0
de1: 21140A [10-100Mb/s] pass 2.2
de1: address 00:40:33:a2:af:01
ed0: NE2000 PCI Ethernet (RealTek 8029) port 0xb000-0xb01f irq 10 at device 11.0 on 
pci0
ed0: address 00:00:1c:3a:3a:39, type NE2000 (16 bit)
pci0: S3 Trio graphics accelerator at 12.0 irq 11
atkbdc0: Keyboard controller (i8042) at port 0x60,0x64 on isa0
atkbd0: AT Keyboard flags 0x1 irq 1 on atkbdc0
kbd0 at atkbd0
ed1 at port 0x240-0x25f iomem 0xd8000 irq 5 on isa0
ed1: address 00:c0:6c:54:12:37, type NE2000 (16 bit)
fdc0: NEC 72065B or clone at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0
fdc0: FIFO enabled, 8 bytes threshold 
fd0: 1440-KB 3.5" drive on fdc0 drive 0
sc0: System console at flags 0x100 on isa0
sc0: VGA 16 virtual consoles, flags=0x300
sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0
sio0: type 16550A
sio1 at port 0x2f8-0x2ff irq 3 on isa0
sio1: type 16550A
vga0: Generic ISA VGA at port 0x3c0-0x3df iomem 0xa-0xb on isa0
unknown: PNP0501 can't assign resources
unknown: PNP0501 can't assign resources
unknown: PNP0700 can't assign resources
unknown: PNP0303 can't assign resources
DUMMYNET initialized (000608)
IP packet filtering initialized, divert disabled, rule-based forwarding enabled, 
default to deny, logging disabled
ad0: 7339MB QUANTUM FIREBALL EL7.6A [15907/15/63] at ata0-master UDMA33
ad2: 7339MB QUANTUM FIREBALL EL7.6A [15907/15/63] at ata1-master UDMA33
Mounting root from ufs:/dev/ad0a
WARNING: / was not properly dismounted
de0: enabling 100baseTX port
de1: enabling 100baseTX port
vinum: loaded
vinum: reading configuration from /dev/ad2s1d
vinum: updating configuration from /dev/ad0s1d
de1: link down: cable problem?
de1: enabling 10baseT port
de1: enabling Full Duplex 10baseT port
link_elf: symbol card_set_res_flags_desc undefined


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message