PowerEdge 2850 AMD64 panics

2005-12-07 Thread bsd
This is my first post to the FreeBSD mailing list, and unfortunatley its
because I've been having a vexing problem with one of my servers.

The machine in question is a Dell PowerEdge 2850 w/2 3Ghz EMT64 Xeons,
2gbs of ram, a Perc 4e/Di w/6 300GB drives.

This machines primary function is as a mail server running Exim.

Below is a DMESG from this machine.

This machine is running the AMD64 port of freebsd, and what is happening
is often when the volume of incoming mail gets pretty high, the machine
will panic and reboot. I have tried to capture a dump but it always fails
dumping before it completes, usually before its dumped more then 24MB. The
few times that I have seen the console when its happened the panic was
something to do with UFS. I have upgraded the bios and firmwares on
everything, and that increased the stability of the machine. I also turned
off soft updates on all the file systems, which also further increased
stability, however there is still a problem. Any
pointers/hints/tips/solutions would be greatly appreciated. Also, if more
information is needed let me know and I would be happy to provide it.

Thanks in advance for any help you may be able to give.

Stacy Anable
Rio Communications




The Regents of the University of California. All rights reserved.
FreeBSD 5.4-RELEASE-p7 #0: Thu Oct  6 11:13:54 PDT 2005
[EMAIL PROTECTED]:/usr/obj/usr/src/sys/MAIL
Timecounter i8254 frequency 1193182 Hz quality 0
CPU: Intel(R) Xeon(TM) CPU 3.00GHz (2992.71-MHz K8-class CPU)
  Origin = GenuineIntel  Id = 0xf43  Stepping = 3
  
Features=0xbfebfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE
Features2=0x641dSSE3,RSVD2,MON,DS_CPL,CNTX-ID,CX16,b14
  AMD Features=0x20100800SYSCALL,NX,LM
  Hyperthreading: 2 logical CPUs
real memory  = 2147221504 (2047 MB)
avail memory = 2064646144 (1969 MB)
ACPI APIC Table: DELL   PE BKC  
FreeBSD/SMP: Multiprocessor System Detected: 4 CPUs
 cpu0 (BSP): APIC ID:  0
 cpu1 (AP): APIC ID:  1
 cpu2 (AP): APIC ID:  6
 cpu3 (AP): APIC ID:  7
ioapic0: Changing APIC ID to 8
ioapic1: Changing APIC ID to 9
ioapic1: WARNING: intbase 32 != expected base 24
ioapic2: Changing APIC ID to 10
ioapic2: WARNING: intbase 64 != expected base 56
ioapic3: Changing APIC ID to 11
ioapic3: WARNING: intbase 96 != expected base 88
ioapic0 Version 2.0 irqs 0-23 on motherboard
ioapic1 Version 2.0 irqs 32-55 on motherboard
ioapic2 Version 2.0 irqs 64-87 on motherboard
ioapic3 Version 2.0 irqs 96-119 on motherboard
acpi0: DELL PE BKC on motherboard
acpi0: Power Button (fixed)
Timecounter ACPI-fast frequency 3579545 Hz quality 1000
acpi_timer0: 24-bit timer at 3.579545MHz port 0x808-0x80b on acpi0 cpu0:
ACPI CPU on acpi0
cpu1: ACPI CPU on acpi0
cpu2: ACPI CPU on acpi0
cpu3: ACPI CPU on acpi0
pcib0: ACPI Host-PCI bridge port 0xcf8-0xcff on acpi0
pci0: ACPI PCI bus on pcib0
pcib1: ACPI PCI-PCI bridge at device 2.0 on pci0
pci1: ACPI PCI bus on pcib1
pcib2: ACPI PCI-PCI bridge at device 0.0 on pci1
pci2: ACPI PCI bus on pcib2
amr0: LSILogic MegaRAID 1.51 mem
0xdfec-0xdfef,0xd80f-0xd80f irq 46 at device 14.0 on pci2
amr0: LSILogic PERC 4e/Di Firmware 521S, BIOS H430, 256MB RAM
pcib3: ACPI PCI-PCI bridge at device 0.2 on pci1
pci3: ACPI PCI bus on pcib3
pcib4: ACPI PCI-PCI bridge at device 4.0 on pci0
pci4: ACPI PCI bus on pcib4
pcib5: ACPI PCI-PCI bridge at device 5.0 on pci0
pci5: ACPI PCI bus on pcib5
pcib6: ACPI PCI-PCI bridge at device 0.0 on pci5
pci6: ACPI PCI bus on pcib6
em0: Intel(R) PRO/1000 Network Connection, Version - 1.7.35 port
0xecc0-0xecff mem 0xdfbe-0xdfbf irq 64 at device 7.0 on pci6 em0:
Ethernet address: 00:14:22:10:4f:c2
em0:  Speed:N/A  Duplex:N/A
pcib7: ACPI PCI-PCI bridge at device 0.2 on pci5
pci7: ACPI PCI bus on pcib7
em1: Intel(R) PRO/1000 Network Connection, Version - 1.7.35 port
0xdcc0-0xdcff mem 0xdf9e-0xdf9f irq 65 at device 8.0 on pci7 em1:
Ethernet address:
em1:  Speed:N/A  Duplex:N/A
pcib8: ACPI PCI-PCI bridge at device 6.0 on pci0
pci8: ACPI PCI bus on pcib8
pcib9: ACPI PCI-PCI bridge at device 0.0 on pci8
pci9: ACPI PCI bus on pcib9
pcib10: ACPI PCI-PCI bridge at device 0.2 on pci8
pci10: ACPI PCI bus on pcib10
pci0: serial bus, USB at device 29.0 (no driver attached)
pcib11: ACPI PCI-PCI bridge at device 30.0 on pci0
pci11: ACPI PCI bus on pcib11
pci11: unknown at device 5.0 (no driver attached)
pci11: unknown at device 5.1 (no driver attached)
pci11: unknown at device 5.2 (no driver attached)
atapci0: SiI 0680 UDMA133 controller port
0xcc70-0xcc7f,0xccd0-0xccd3,0xccd8-0xccdf,0xcce4-0xcce7,0xccf0-0xccf7 irq
23 at device 6.0 on pci11
ata2: channel #0 on atapci0
ata3: channel #1 on atapci0
pci11: display, VGA at device 13.0 (no driver attached)
isab0: PCI-ISA bridge at device 31.0 on pci0
isa0: ISA bus on isab0
atapci1: Intel ICH5 UDMA100 controller port
0xfc00-0xfc0f,0x376,0x170-0x177,0x3f6,0x1f0-0x1f7 at device 31.1 on pci0
ata0: channel #0 on atapci1

Re: PowerEdge 2850 AMD64 panics

2005-12-07 Thread Martin Hepworth
Hi

this is 'known' issue with the 5.x versions, it kernel panics under high
file I/O. I've had this myself while testing a new email server.

I've no idea if 6.0 fixes this issue, but it can't any worse to probably
worth a go to upgrade to the 6.0 release.

--
Martin

On 12/7/05, [EMAIL PROTECTED] [EMAIL PROTECTED] wrote:

 This is my first post to the FreeBSD mailing list, and unfortunatley its
 because I've been having a vexing problem with one of my servers.

 The machine in question is a Dell PowerEdge 2850 w/2 3Ghz EMT64 Xeons,
 2gbs of ram, a Perc 4e/Di w/6 300GB drives.

 This machines primary function is as a mail server running Exim.

 Below is a DMESG from this machine.

 This machine is running the AMD64 port of freebsd, and what is happening
 is often when the volume of incoming mail gets pretty high, the machine
 will panic and reboot. I have tried to capture a dump but it always fails
 dumping before it completes, usually before its dumped more then 24MB. The
 few times that I have seen the console when its happened the panic was
 something to do with UFS. I have upgraded the bios and firmwares on
 everything, and that increased the stability of the machine. I also turned
 off soft updates on all the file systems, which also further increased
 stability, however there is still a problem. Any
 pointers/hints/tips/solutions would be greatly appreciated. Also, if more
 information is needed let me know and I would be happy to provide it.

 Thanks in advance for any help you may be able to give.

 Stacy Anable
 Rio Communications




 The Regents of the University of California. All rights reserved.
 FreeBSD 5.4-RELEASE-p7 #0: Thu Oct  6 11:13:54 PDT 2005
 [EMAIL PROTECTED]:/usr/obj/usr/src/sys/MAIL
 Timecounter i8254 frequency 1193182 Hz quality 0
 CPU: Intel(R) Xeon(TM) CPU 3.00GHz (2992.71-MHz K8-class CPU)
   Origin = GenuineIntel  Id = 0xf43  Stepping = 3

   
 Features=0xbfebfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE
 Features2=0x641dSSE3,RSVD2,MON,DS_CPL,CNTX-ID,CX16,b14
   AMD Features=0x20100800SYSCALL,NX,LM
   Hyperthreading: 2 logical CPUs
 real memory  = 2147221504 (2047 MB)
 avail memory = 2064646144 (1969 MB)
 ACPI APIC Table: DELL   PE BKC  
 FreeBSD/SMP: Multiprocessor System Detected: 4 CPUs
 cpu0 (BSP): APIC ID:  0
 cpu1 (AP): APIC ID:  1
 cpu2 (AP): APIC ID:  6
 cpu3 (AP): APIC ID:  7
 ioapic0: Changing APIC ID to 8
 ioapic1: Changing APIC ID to 9
 ioapic1: WARNING: intbase 32 != expected base 24
 ioapic2: Changing APIC ID to 10
 ioapic2: WARNING: intbase 64 != expected base 56
 ioapic3: Changing APIC ID to 11
 ioapic3: WARNING: intbase 96 != expected base 88
 ioapic0 Version 2.0 irqs 0-23 on motherboard
 ioapic1 Version 2.0 irqs 32-55 on motherboard
 ioapic2 Version 2.0 irqs 64-87 on motherboard
 ioapic3 Version 2.0 irqs 96-119 on motherboard
 acpi0: DELL PE BKC on motherboard
 acpi0: Power Button (fixed)
 Timecounter ACPI-fast frequency 3579545 Hz quality 1000
 acpi_timer0: 24-bit timer at 3.579545MHz port 0x808-0x80b on acpi0 cpu0:
 ACPI CPU on acpi0
 cpu1: ACPI CPU on acpi0
 cpu2: ACPI CPU on acpi0
 cpu3: ACPI CPU on acpi0
 pcib0: ACPI Host-PCI bridge port 0xcf8-0xcff on acpi0
 pci0: ACPI PCI bus on pcib0
 pcib1: ACPI PCI-PCI bridge at device 2.0 on pci0
 pci1: ACPI PCI bus on pcib1
 pcib2: ACPI PCI-PCI bridge at device 0.0 on pci1
 pci2: ACPI PCI bus on pcib2
 amr0: LSILogic MegaRAID 1.51 mem
 0xdfec-0xdfef,0xd80f-0xd80f irq 46 at device 14.0 on pci2
 amr0: LSILogic PERC 4e/Di Firmware 521S, BIOS H430, 256MB RAM
 pcib3: ACPI PCI-PCI bridge at device 0.2 on pci1
 pci3: ACPI PCI bus on pcib3
 pcib4: ACPI PCI-PCI bridge at device 4.0 on pci0
 pci4: ACPI PCI bus on pcib4
 pcib5: ACPI PCI-PCI bridge at device 5.0 on pci0
 pci5: ACPI PCI bus on pcib5
 pcib6: ACPI PCI-PCI bridge at device 0.0 on pci5
 pci6: ACPI PCI bus on pcib6
 em0: Intel(R) PRO/1000 Network Connection, Version - 1.7.35 port
 0xecc0-0xecff mem 0xdfbe-0xdfbf irq 64 at device 7.0 on pci6 em0:
 Ethernet address: 00:14:22:10:4f:c2
 em0:  Speed:N/A  Duplex:N/A
 pcib7: ACPI PCI-PCI bridge at device 0.2 on pci5
 pci7: ACPI PCI bus on pcib7
 em1: Intel(R) PRO/1000 Network Connection, Version - 1.7.35 port
 0xdcc0-0xdcff mem 0xdf9e-0xdf9f irq 65 at device 8.0 on pci7 em1:
 Ethernet address:
 em1:  Speed:N/A  Duplex:N/A
 pcib8: ACPI PCI-PCI bridge at device 6.0 on pci0
 pci8: ACPI PCI bus on pcib8
 pcib9: ACPI PCI-PCI bridge at device 0.0 on pci8
 pci9: ACPI PCI bus on pcib9
 pcib10: ACPI PCI-PCI bridge at device 0.2 on pci8
 pci10: ACPI PCI bus on pcib10
 pci0: serial bus, USB at device 29.0 (no driver attached)
 pcib11: ACPI PCI-PCI bridge at device 30.0 on pci0
 pci11: ACPI PCI bus on pcib11
 pci11: unknown at device 5.0 (no driver attached)
 pci11: unknown at device 5.1 (no driver attached)
 pci11: unknown at device 5.2 (no driver attached)
 atapci0: SiI 0680 UDMA133 controller port