Re: kernel bug (ufs2?) on a dell 2600

2005-06-14 Thread Geoffroy Desvernay
It's not related to more-than-full fs: it occured one more time without
it :(

Do someone have an idea ?

Geoffroy Desvernay a écrit :
 This server (FreeBSD 5.4 RELENG) is crashing once a week or more since
 5.4 (maybe before).
 
 It may be related with a full filesystem:
 I'm using snapshots on this server (using
 http://people.freebsd.org/~rse/snapshot/), and crash has occured after
 (~30mins) a snapshot that fills up to 100% the filesystem.
 
 Attached the dmesg and kgdb logs.
 
 I'm not so hacker, but hope that it can help to resolve this bug.
 
 
 
 
 
 
 [GDB will not be able to debug user-mode threads: /usr/lib/libthread_db.so: 
 Undefined symbol ps_pglobal_lookup]
 GNU gdb 6.1.1 [FreeBSD]
 Copyright 2004 Free Software Foundation, Inc.
 GDB is free software, covered by the GNU General Public License, and you are
 welcome to change it and/or distribute copies of it under certain conditions.
 Type show copying to see the conditions.
 There is absolutely no warranty for GDB.  Type show warranty for details.
 This GDB was configured as i386-marcel-freebsd.
 #0  doadump () at pcpu.h:160
 160   __asm __volatile(movl %%fs:0,%0 : =r (td));
 (kgdb) bt full
 #0  doadump () at pcpu.h:160
 No locals.
 #1  0xc06878d6 in boot (howto=260) at /usr/src/sys/kern/kern_shutdown.c:410
   first_buf_printf = 1
 #2  0xc0687cc4 in panic (fmt=0xc091a1ae initiate_write_inodeblock_ufs2: 
 already started)
 at /usr/src/sys/kern/kern_shutdown.c:566
   td = (struct thread *) 0xc3641c00
   bootopt = 260
   newpanic = 0
   ap = 0xc3641c00 \\\214\220Ã -XÃ
   buf = initiate_write_inodeblock_ufs2: already started, '\0' repeats 
 208 times
 #3  0xc080ef5f in initiate_write_inodeblock_ufs2 (inodedep=0xc5be0280, bp=0x0)
 at /usr/src/sys/ufs/ffs/ffs_softdep.c:3781
   adp = (struct allocdirect *) 0xd75bd13c
   lastadp = (struct allocdirect *) 0x1000
   dp = (struct ufs2_dinode *) 0x0
   fs = (struct fs *) 0xc21f6730
   i = Unhandled dwarf expression opcode 0x93
 (kgdb) quit
 
 
 
 
 Copyright (c) 1992-2005 The FreeBSD Project.
 Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
   The Regents of the University of California. All rights reserved.
 FreeBSD 5.4-STABLE #0: Mon Jun  6 18:51:49 CEST 2005
 [EMAIL PROTECTED]:/usr/obj/usr/src/sys/ZLIP
 Timecounter i8254 frequency 1193182 Hz quality 0
 CPU: Intel(R) Xeon(TM) CPU 2.40GHz (2392.29-MHz 686-class CPU)
   Origin = GenuineIntel  Id = 0xf29  Stepping = 9
   
 Features=0xbfebfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE
   Hyperthreading: 2 logical CPUs
 real memory  = 2147287040 (2047 MB)
 avail memory = 2095828992 (1998 MB)
 ACPI APIC Table: DELL   PE2600  
 FreeBSD/SMP: Multiprocessor System Detected: 4 CPUs
  cpu0 (BSP): APIC ID:  0
  cpu1 (AP): APIC ID:  1
  cpu2 (AP): APIC ID:  6
  cpu3 (AP): APIC ID:  7
 ioapic0: Changing APIC ID to 8
 ioapic1: Changing APIC ID to 9
 ioapic2: Changing APIC ID to 10
 ioapic2: WARNING: intbase 72 != expected base 48
 ioapic3: Changing APIC ID to 11
 ioapic3: WARNING: intbase 120 != expected base 96
 ioapic4: Changing APIC ID to 12
 ioapic0 Version 2.0 irqs 0-23 on motherboard
 ioapic1 Version 2.0 irqs 24-47 on motherboard
 ioapic2 Version 2.0 irqs 72-95 on motherboard
 ioapic3 Version 2.0 irqs 120-143 on motherboard
 ioapic4 Version 2.0 irqs 144-167 on motherboard
 npx0: math processor on motherboard
 npx0: INT 16 interface
 acpi0: DELL PE2600 on motherboard
 acpi0: Power Button (fixed)
 Timecounter ACPI-safe frequency 3579545 Hz quality 1000
 acpi_timer0: 24-bit timer at 3.579545MHz port 0x808-0x80b on acpi0
 cpu0: ACPI CPU on acpi0
 cpu1: ACPI CPU on acpi0
 cpu2: ACPI CPU on acpi0
 cpu3: ACPI CPU on acpi0
 pcib0: ACPI Host-PCI bridge port 0xcf8-0xcff on acpi0
 pci0: ACPI PCI bus on pcib0
 pcib1: ACPI PCI-PCI bridge at device 2.0 on pci0
 pci1: ACPI PCI bus on pcib1
 pci1: base peripheral, interrupt controller at device 28.0 (no driver 
 attached)
 pcib2: ACPI PCI-PCI bridge at device 29.0 on pci1
 pci2: ACPI PCI bus on pcib2
 em0: Intel(R) PRO/1000 Network Connection, Version - 1.7.35 port 
 0xece0-0xecff mem 0xfdec-0xfded,0xfdee-0xfdef irq 24 at 
 device 2.0 on pci2
 em0: Ethernet address: 00:02:b3:d4:d3:a2
 em0:  Speed:N/A  Duplex:N/A
 pci1: base peripheral, interrupt controller at device 30.0 (no driver 
 attached)
 pcib3: ACPI PCI-PCI bridge at device 31.0 on pci1
 pci3: ACPI PCI bus on pcib3
 em1: Intel(R) PRO/1000 Network Connection, Version - 1.7.35 port 
 0xdce0-0xdcff mem 0xfdcc-0xfdcd,0xfdce-0xfdcf irq 28 at 
 device 1.0 on pci3
 em1: Ethernet address: 00:0b:db:92:0a:e4
 em1:  Speed:N/A  Duplex:N/A
 pcib4: ACPI PCI-PCI bridge at device 3.0 on pci0
 pci4: ACPI PCI bus on pcib4
 pci4: base peripheral, 

kernel bug (ufs2?) on a dell 2600

2005-06-13 Thread Geoffroy Desvernay
This server (FreeBSD 5.4 RELENG) is crashing once a week or more since
5.4 (maybe before).

It may be related with a full filesystem:
I'm using snapshots on this server (using
http://people.freebsd.org/~rse/snapshot/), and crack has occured after
(~30mins) a snapshot that fills up to 100% the filesystem.

Attached the dmesg and kgdb logs.

I'm not so hacker, but hope that it can help to resolve this bug.


[GDB will not be able to debug user-mode threads: /usr/lib/libthread_db.so: 
Undefined symbol ps_pglobal_lookup]
GNU gdb 6.1.1 [FreeBSD]
Copyright 2004 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are
welcome to change it and/or distribute copies of it under certain conditions.
Type show copying to see the conditions.
There is absolutely no warranty for GDB.  Type show warranty for details.
This GDB was configured as i386-marcel-freebsd.
#0  doadump () at pcpu.h:160
160 __asm __volatile(movl %%fs:0,%0 : =r (td));
(kgdb) bt full
#0  doadump () at pcpu.h:160
No locals.
#1  0xc06878d6 in boot (howto=260) at /usr/src/sys/kern/kern_shutdown.c:410
first_buf_printf = 1
#2  0xc0687cc4 in panic (fmt=0xc091a1ae initiate_write_inodeblock_ufs2: 
already started)
at /usr/src/sys/kern/kern_shutdown.c:566
td = (struct thread *) 0xc3641c00
bootopt = 260
newpanic = 0
ap = 0xc3641c00 \\\214\220Ã -XÃ
buf = initiate_write_inodeblock_ufs2: already started, '\0' repeats 
208 times
#3  0xc080ef5f in initiate_write_inodeblock_ufs2 (inodedep=0xc5be0280, bp=0x0)
at /usr/src/sys/ufs/ffs/ffs_softdep.c:3781
adp = (struct allocdirect *) 0xd75bd13c
lastadp = (struct allocdirect *) 0x1000
dp = (struct ufs2_dinode *) 0x0
fs = (struct fs *) 0xc21f6730
i = Unhandled dwarf expression opcode 0x93
(kgdb) quit
Copyright (c) 1992-2005 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
FreeBSD 5.4-STABLE #0: Mon Jun  6 18:51:49 CEST 2005
[EMAIL PROTECTED]:/usr/obj/usr/src/sys/ZLIP
Timecounter i8254 frequency 1193182 Hz quality 0
CPU: Intel(R) Xeon(TM) CPU 2.40GHz (2392.29-MHz 686-class CPU)
  Origin = GenuineIntel  Id = 0xf29  Stepping = 9
  
Features=0xbfebfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE
  Hyperthreading: 2 logical CPUs
real memory  = 2147287040 (2047 MB)
avail memory = 2095828992 (1998 MB)
ACPI APIC Table: DELL   PE2600  
FreeBSD/SMP: Multiprocessor System Detected: 4 CPUs
 cpu0 (BSP): APIC ID:  0
 cpu1 (AP): APIC ID:  1
 cpu2 (AP): APIC ID:  6
 cpu3 (AP): APIC ID:  7
ioapic0: Changing APIC ID to 8
ioapic1: Changing APIC ID to 9
ioapic2: Changing APIC ID to 10
ioapic2: WARNING: intbase 72 != expected base 48
ioapic3: Changing APIC ID to 11
ioapic3: WARNING: intbase 120 != expected base 96
ioapic4: Changing APIC ID to 12
ioapic0 Version 2.0 irqs 0-23 on motherboard
ioapic1 Version 2.0 irqs 24-47 on motherboard
ioapic2 Version 2.0 irqs 72-95 on motherboard
ioapic3 Version 2.0 irqs 120-143 on motherboard
ioapic4 Version 2.0 irqs 144-167 on motherboard
npx0: math processor on motherboard
npx0: INT 16 interface
acpi0: DELL PE2600 on motherboard
acpi0: Power Button (fixed)
Timecounter ACPI-safe frequency 3579545 Hz quality 1000
acpi_timer0: 24-bit timer at 3.579545MHz port 0x808-0x80b on acpi0
cpu0: ACPI CPU on acpi0
cpu1: ACPI CPU on acpi0
cpu2: ACPI CPU on acpi0
cpu3: ACPI CPU on acpi0
pcib0: ACPI Host-PCI bridge port 0xcf8-0xcff on acpi0
pci0: ACPI PCI bus on pcib0
pcib1: ACPI PCI-PCI bridge at device 2.0 on pci0
pci1: ACPI PCI bus on pcib1
pci1: base peripheral, interrupt controller at device 28.0 (no driver 
attached)
pcib2: ACPI PCI-PCI bridge at device 29.0 on pci1
pci2: ACPI PCI bus on pcib2
em0: Intel(R) PRO/1000 Network Connection, Version - 1.7.35 port 
0xece0-0xecff mem 0xfdec-0xfded,0xfdee-0xfdef irq 24 at device 
2.0 on pci2
em0: Ethernet address: 00:02:b3:d4:d3:a2
em0:  Speed:N/A  Duplex:N/A
pci1: base peripheral, interrupt controller at device 30.0 (no driver 
attached)
pcib3: ACPI PCI-PCI bridge at device 31.0 on pci1
pci3: ACPI PCI bus on pcib3
em1: Intel(R) PRO/1000 Network Connection, Version - 1.7.35 port 
0xdce0-0xdcff mem 0xfdcc-0xfdcd,0xfdce-0xfdcf irq 28 at device 
1.0 on pci3
em1: Ethernet address: 00:0b:db:92:0a:e4
em1:  Speed:N/A  Duplex:N/A
pcib4: ACPI PCI-PCI bridge at device 3.0 on pci0
pci4: ACPI PCI bus on pcib4
pci4: base peripheral, interrupt controller at device 28.0 (no driver 
attached)
pcib5: ACPI PCI-PCI bridge at device 29.0 on pci4
pci5: ACPI PCI bus on pcib5
pci4: base peripheral, interrupt controller at device 30.0 (no driver 
attached)
pcib6: ACPI PCI-PCI bridge at device 31.0 on pci4
pci6: ACPI PCI bus on pcib6
bge0: Broadcom BCM5703 Gigabit Ethernet, ASIC rev. 0x1002 mem 
0xfd8f-0xfd8f