Here's a weird one.

I have a server machine running debian testing.  It is presently sitting
idle but on.   It does a cold restart now and then, but I can't find any
reason why.


citalopram:~# last
root     pts/0        10.28.1.2        Thu May 18 12:22   still logged in   
reboot   system boot  2.6.16.16        Wed May 17 23:10 - 12:22  (13:12)    
root     pts/0        10.28.1.2        Tue May 16 14:29 - crash (1+08:40)   
root     pts/0        10.28.1.2        Tue May 16 14:08 - 14:29  (00:20)    
root     pts/0        thionite-wl.crig Mon May 15 18:47 - 08:18  (13:31)    
reboot   system boot  2.6.16.16        Mon May 15 18:46 - 12:22 (2+17:36)   
root     pts/0        thionite-wl.crig Mon May 15 18:30 - down   (00:14)    
reboot   system boot  2.6.8-2-386      Mon May 15 17:42 - 18:45  (01:03)    
reboot   system boot  2.6.8-2-386      Mon May 15 14:45 - 18:45  (03:59)    
reboot   system boot  2.6.8-2-386      Mon May 15 13:10 - 18:45  (05:35)    
root     pts/0        10.28.1.2        Mon May 15 12:42 - crash  (00:27)    
reboot   system boot  2.6.8-2-386      Mon May 15 12:09 - 18:45  (06:36)    
root     pts/0        10.28.1.2        Mon May 15 11:31 - crash  (00:37)    
reboot   system boot  2.6.8-2-386      Mon May 15 10:59 - 18:45  (07:46)    
root     pts/0        10.28.1.2        Mon May 15 10:30 - crash  (00:29)    
root     pts/0        10.28.1.2        Mon May 15 10:27 - 10:30  (00:02)    
root     pts/0        10.28.1.2        Mon May 15 10:27 - 10:27  (00:00)    
reboot   system boot  2.6.8-2-386      Mon May 15 09:51 - 18:45  (08:54)    
root     pts/0        10.28.1.2        Mon May 15 09:14 - crash  (00:36)    
reboot   system boot  2.6.8-2-386      Mon May 15 08:42 - 18:45  (10:03)    
reboot   system boot  2.6.8-2-386      Mon May 15 07:34 - 18:45  (11:11)    
reboot   system boot  2.6.8-2-386      Mon May 15 05:59 - 18:45  (12:46)    
reboot   system boot  2.6.8-2-386      Mon May 15 04:35 - 18:45  (14:10)    
reboot   system boot  2.6.8-2-386      Mon May 15 04:13 - 18:45  (14:31)    
reboot   system boot  2.6.8-2-386      Mon May 15 00:32 - 18:45  (18:12)    
root     pts/1        thionite-wl.crig Sun May 14 21:28 - crash  (03:04)    
root     pts/0        thionite-wl.crig Sun May 14 21:18 - 21:41  (00:23)    
root     pts/0        thionite-wl.crig Sun May 14 21:16 - 21:18  (00:01)    
root     tty1                          Sun May 14 20:52 - 20:55  (00:02)    
reboot   system boot  2.6.8-2-386      Mon May 15 08:43 - 18:45  (10:01)    
wtmp begins Mon May 15 08:43:58 2006

Since it was installed, that's one proper shutdown and six crash restarts.

Here's syslog
Lots of --MARK-- entries
May 17 23:06:43 citalopram -- MARK --
May 17 23:10:04 citalopram syslogd 1.4.1#17.2: restart.
May 17 23:10:04 citalopram kernel: klogd 1.4.1#17.2, log source = /proc/kmsg
started.
May 17 23:10:04 citalopram kernel: Inspecting /boot/System.map-2.6.16.16
May 17 23:10:04 citalopram kernel: Loaded 21495 symbols from
/boot/System.map-2.6.16.16.
May 17 23:10:04 citalopram kernel: Symbols match kernel version 2.6.16.
May 17 23:10:04 citalopram kernel: No module symbols loaded - kernel modules
not enabled. 
May 17 23:10:04 citalopram kernel: Linux version 2.6.16.16 ([EMAIL PROTECTED])
(gcc version 4.0.3 (Debian 4.0.3-1)) #2 Mon May 15 18:33:55 NZST 2006
May 17 23:10:04 citalopram kernel: BIOS-provided physical RAM map:
May 17 23:10:04 citalopram kernel:  BIOS-e820: 0000000000000000 -
000000000009fc00 (usable)
May 17 23:10:04 citalopram kernel:  BIOS-e820: 000000000009fc00 -
00000000000a0000 (reserved)
May 17 23:10:04 citalopram kernel:  BIOS-e820: 00000000000f0000 -
0000000000100000 (reserved)
May 17 23:10:04 citalopram kernel:  BIOS-e820: 0000000000100000 -
000000000f7f0000 (usable)
May 17 23:10:04 citalopram kernel:  BIOS-e820: 000000000f7f0000 -
000000000f7f3000 (ACPI NVS)
May 17 23:10:04 citalopram kernel:  BIOS-e820: 000000000f7f3000 -
000000000f800000 (ACPI data)
May 17 23:10:04 citalopram kernel:  BIOS-e820: 000000000f800000 -
0000000010000000 (reserved)
May 17 23:10:04 citalopram kernel:  BIOS-e820: 00000000fec00000 -
0000000100000000 (reserved)
May 17 23:10:04 citalopram kernel: 247MB LOWMEM available.
May 17 23:10:04 citalopram kernel: found SMP MP-table at 000f58c0
May 17 23:10:04 citalopram kernel: DMI 2.3 present.
May 17 23:10:04 citalopram kernel: ACPI: PM-Timer IO Port: 0x4008
May 17 23:10:04 citalopram kernel: ACPI: LAPIC (acpi_id[0x00] lapic_id[0x00]
enabled)
May 17 23:10:04 citalopram kernel: Processor #0 6:11 APIC version 17
May 17 23:10:04 citalopram kernel: ACPI: LAPIC_NMI (acpi_id[0xff] dfl dfl
lint[0x1])
May 17 23:10:04 citalopram kernel: ACPI: IOAPIC (id[0x02]
address[0xfec00000] gsi_base[0])
May 17 23:10:04 citalopram kernel: IOAPIC[0]: apic_id 2, version 17, address
0xfec00000, GSI 0-23
May 17 23:10:04 citalopram kernel: ACPI: INT_SRC_OVR (bus 0 bus_irq 0
global_irq 2 dfl dfl)
May 17 23:10:04 citalopram kernel: ACPI: INT_SRC_OVR (bus 0 bus_irq 9
global_irq 9 low level)
May 17 23:10:04 citalopram kernel: Enabling APIC mode:  Flat.  Using 1 I/O
APICs
May 17 23:10:04 citalopram kernel: Using ACPI (MADT) for SMP configuration
information
May 17 23:10:04 citalopram kernel: Allocating PCI resources starting at
20000000 (gap: 10000000:eec00000)
May 17 23:10:04 citalopram kernel: Built 1 zonelists
May 17 23:10:04 citalopram kernel: Kernel command line: auto
BOOT_IMAGE=2.6.16.16 ro root=303
May 17 23:10:04 citalopram kernel: Enabling fast FPU save and restore...
done.
May 17 23:10:04 citalopram kernel: Enabling unmasked SIMD FPU exception
support... done.
May 17 23:10:04 citalopram kernel: Initializing CPU#0
May 17 23:10:04 citalopram kernel: PID hash table entries: 1024 (order: 10,
16384 bytes)
May 17 23:10:04 citalopram kernel: Detected 1203.007 MHz processor.
May 17 23:10:04 citalopram kernel: Using pmtmr for high-res timesource
May 17 23:10:04 citalopram kernel: Console: colour VGA+ 80x50
May 17 23:10:04 citalopram kernel: Dentry cache hash table entries: 32768
(order: 5, 131072 bytes)
May 17 23:10:04 citalopram kernel: Inode-cache hash table entries: 16384
(order: 4, 65536 bytes)
May 17 23:10:04 citalopram kernel: Memory: 248480k/253888k available (1757k
kernel code, 4864k reserved, 573k data, 132k init, 0k highmem)
May 17 23:10:04 citalopram kernel: Checking if this processor honours the WP
bit even in supervisor mode... Ok.
May 17 23:10:04 citalopram kernel: Calibrating delay using timer specific
routine.. 2409.00 BogoMIPS (lpj=4818009)
May 17 23:10:04 citalopram kernel: Security Framework v1.0.0 initialized
May 17 23:10:04 citalopram kernel: Mount-cache hash table entries: 512
May 17 23:10:04 citalopram kernel: CPU: L1 I cache: 16K, L1 D cache: 16K
May 17 23:10:04 citalopram kernel: CPU: L2 cache: 256K
May 17 23:10:04 citalopram kernel: Intel machine check architecture
supported.
May 17 23:10:04 citalopram kernel: Intel machine check reporting enabled on
CPU#0.
May 17 23:10:04 citalopram kernel: CPU: Intel(R) Celeron(TM) CPU
1200MHz stepping 04
May 17 23:10:04 citalopram kernel: Checking 'hlt' instruction... OK.
May 17 23:10:04 citalopram kernel: ENABLING IO-APIC IRQs
May 17 23:10:04 citalopram kernel: ..TIMER: vector=0x31 apic1=0 pin1=2
apic2=-1 pin2=-1
May 17 23:10:04 citalopram kernel: NET: Registered protocol family 16
May 17 23:10:04 citalopram kernel: ACPI: bus type pci registered
May 17 23:10:04 citalopram kernel: PCI: PCI BIOS revision 2.10 entry at
0xfaa80, last bus=1
May 17 23:10:04 citalopram kernel: PCI: Using configuration type 1
May 17 23:10:04 citalopram kernel: ACPI: Subsystem revision 20060127
May 17 23:10:04 citalopram kernel: ACPI: Interpreter enabled
May 17 23:10:04 citalopram kernel: ACPI: Using IOAPIC for interrupt routing
May 17 23:10:04 citalopram kernel: ACPI: PCI Root Bridge [PCI0] (0000:00)
May 17 23:10:04 citalopram kernel: ACPI: Assume root bridge [\_SB_.PCI0] bus
is 0
May 17 23:10:04 citalopram kernel: PCI quirk: region 6000-607f claimed by
vt82c686 HW-mon
May 17 23:10:04 citalopram kernel: PCI quirk: region 5000-500f claimed by
vt82c686 SMB
May 17 23:10:04 citalopram kernel: ACPI: PCI Interrupt Link [LNKA] (IRQs 1 3
4 5 6 7 *10 11 12 14 15)
May 17 23:10:04 citalopram kernel: ACPI: PCI Interrupt Link [LNKB] (IRQs 1 3
4 5 6 7 10 *11 12 14 15)
May 17 23:10:04 citalopram kernel: ACPI: PCI Interrupt Link [LNKC] (IRQs 1 3
4 5 6 7 10 11 12 14 15) *9
May 17 23:10:04 citalopram kernel: ACPI: PCI Interrupt Link [LNKD] (IRQs 1 3
4 5 6 7 10 *11 12 14 15)
May 17 23:10:04 citalopram kernel: Linux Plug and Play Support v0.97 (c)
Adam Belay
May 17 23:10:04 citalopram kernel: pnp: PnP ACPI init
May 17 23:10:04 citalopram kernel: pnp: PnP ACPI: found 12 devices
May 17 23:10:04 citalopram kernel: PCI: Using ACPI for IRQ routing
May 17 23:10:04 citalopram kernel: PCI: If a device doesn't work, try
"pci=routeirq".  If it helps, post a report
May 17 23:10:04 citalopram kernel: PCI: Bridge: 0000:00:01.0
May 17 23:10:04 citalopram kernel:   IO window: disabled.
May 17 23:10:04 citalopram kernel:   MEM window: e0000000-e2ffffff
May 17 23:10:04 citalopram kernel:   PREFETCH window: 20000000-200fffff
May 17 23:10:04 citalopram kernel: Machine check exception polling timer
started.
May 17 23:10:04 citalopram kernel: IA-32 Microcode Update Driver: v1.14
<[EMAIL PROTECTED]>
May 17 23:10:04 citalopram kernel: SGI XFS with ACLs, security attributes,
realtime, no debug enabled
May 17 23:10:04 citalopram kernel: SGI XFS Quota Management subsystem
May 17 23:10:04 citalopram kernel: Initializing Cryptographic API
May 17 23:10:04 citalopram kernel: io scheduler noop registered
May 17 23:10:04 citalopram kernel: io scheduler anticipatory registered
(default)
May 17 23:10:04 citalopram kernel: io scheduler deadline registered
May 17 23:10:04 citalopram kernel: io scheduler cfq registered
May 17 23:10:04 citalopram kernel: PCI: Enabling Via external APIC routing
May 17 23:10:04 citalopram kernel: ACPI: Power Button (FF) [PWRF]
May 17 23:10:04 citalopram kernel: ACPI: Power Button (CM) [PWRB]
May 17 23:10:04 citalopram kernel: ACPI: Sleep Button (CM) [SLPB]
May 17 23:10:04 citalopram kernel: Real Time Clock Driver v1.12ac
May 17 23:10:04 citalopram kernel: Non-volatile memory driver v1.2
May 17 23:10:04 citalopram kernel: Linux agpgart interface v0.101 (c) Dave
Jones
May 17 23:10:04 citalopram kernel: PNP: PS/2 Controller [PNP0303:PS2K] at
0x60,0x64 irq 1
May 17 23:10:04 citalopram kernel: PNP: PS/2 controller doesn't have AUX
irq; using default 12
May 17 23:10:04 citalopram kernel: serio: i8042 AUX port at 0x60,0x64 irq 12
May 17 23:10:04 citalopram kernel: serio: i8042 KBD port at 0x60,0x64 irq 1
May 17 23:10:04 citalopram kernel: Serial: 8250/16550 driver $Revision: 1.90
$ 4 ports, IRQ sharing enabled
May 17 23:10:04 citalopram kernel: serial8250: ttyS0 at I/O 0x3f8 (irq = 4)
is a 16550A
May 17 23:10:04 citalopram kernel: serial8250: ttyS1 at I/O 0x2f8 (irq = 3)
is a 16550A
May 17 23:10:04 citalopram kernel: serial8250: ttyS0 at I/O 0x3f8 (irq = 0)
is a 16550A
May 17 23:10:04 citalopram kernel: serial8250: ttyS1 at I/O 0x2f8 (irq = 0)
is a 16550A
May 17 23:10:04 citalopram kernel: 00:08: ttyS0 at I/O 0x3f8 (irq = 4) is a
16550A
May 17 23:10:04 citalopram kernel: 00:09: ttyS1 at I/O 0x2f8 (irq = 3) is a
16550A
May 17 23:10:04 citalopram kernel: Uniform Multi-Platform E-IDE driver
Revision: 7.00alpha2
May 17 23:10:04 citalopram kernel: ide: Assuming 33MHz system bus speed for
PIO modes; override with idebus=xx
May 17 23:10:04 citalopram kernel: VP_IDE: IDE controller at PCI slot
0000:00:07.1
May 17 23:10:04 citalopram kernel: PCI: Via IRQ fixup for 0000:00:07.1, from
255 to 0
May 17 23:10:04 citalopram kernel: VP_IDE: chipset revision 6
May 17 23:10:04 citalopram kernel: VP_IDE: not 100%% native mode: will probe
irqs later
May 17 23:10:04 citalopram kernel: VP_IDE: VIA vt82c686b (rev 40) IDE
UDMA100 controller on pci0000:00:07.1
May 17 23:10:04 citalopram kernel:     ide0: BM-DMA at 0xd000-0xd007, BIOS
settings: hda:DMA, hdb:pio
May 17 23:10:04 citalopram kernel:     ide1: BM-DMA at 0xd008-0xd00f, BIOS
settings: hdc:DMA, hdd:pio
May 17 23:10:04 citalopram kernel: hda: MDT MD200BB-00DEA0, ATA DISK drive
May 17 23:10:04 citalopram kernel: ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
May 17 23:10:04 citalopram kernel: hdc: CD-S520/A, ATAPI CD/DVD-ROM drive
May 17 23:10:04 citalopram kernel: ide1 at 0x170-0x177,0x376 on irq 15
May 17 23:10:04 citalopram kernel: hda: max request size: 128KiB
May 17 23:10:04 citalopram kernel: hda: 39102336 sectors (20020 MB)
w/2048KiB Cache, CHS=38792/16/63, UDMA(100)
May 17 23:10:04 citalopram kernel: hda: cache flushes not supported
May 17 23:10:04 citalopram kernel:  hda: hda1 hda2 hda3
May 17 23:10:04 citalopram kernel: mice: PS/2 mouse device common for all
mice
May 17 23:10:04 citalopram kernel: input: AT Translated Set 2 keyboard as
/class/input/input0
May 17 23:10:04 citalopram kernel: input: PC Speaker as /class/input/input1
May 17 23:10:04 citalopram kernel: NET: Registered protocol family 2
May 17 23:10:04 citalopram kernel: IP route cache hash table entries: 2048
(order: 1, 8192 bytes)
May 17 23:10:04 citalopram kernel: TCP established hash table entries: 8192
(order: 3, 32768 bytes)
May 17 23:10:04 citalopram kernel: TCP bind hash table entries: 8192 (order:
3, 32768 bytes)
May 17 23:10:04 citalopram kernel: TCP: Hash tables configured (established
8192 bind 8192)
May 17 23:10:04 citalopram kernel: TCP reno registered
May 17 23:10:04 citalopram kernel: TCP bic registered
May 17 23:10:04 citalopram kernel: 802.1Q VLAN Support v1.8 Ben Greear
<[EMAIL PROTECTED]>
May 17 23:10:04 citalopram kernel: All bugs added by David S. Miller
<[email protected]>
May 17 23:10:04 citalopram kernel: Using IPI Shortcut mode
May 17 23:10:04 citalopram kernel: ACPI wakeup devices: 
May 17 23:10:04 citalopram kernel: PCI0 USB0 USB1 MODM 
May 17 23:10:04 citalopram kernel: ACPI: (supports S0 S1 S4 S5)
May 17 23:10:04 citalopram kernel: XFS mounting filesystem hda3
May 17 23:10:04 citalopram kernel: Starting XFS recovery on filesystem: hda3
(logdev: internal)
May 17 23:10:04 citalopram kernel: Ending XFS recovery on filesystem: hda3
(logdev: internal)
May 17 23:10:04 citalopram kernel: VFS: Mounted root (xfs filesystem)
readonly.
May 17 23:10:04 citalopram kernel: Freeing unused kernel memory: 132k freed
May 17 23:10:04 citalopram kernel: NET: Registered protocol family 1
May 17 23:10:04 citalopram kernel: agpgart: Detected VIA Apollo
ProMedia/PLE133Ta chipset
May 17 23:10:04 citalopram kernel: agpgart: AGP aperture is 4M @ 0xe3000000
May 17 23:10:04 citalopram kernel: usbcore: registered new driver usbfs
May 17 23:10:04 citalopram kernel: usbcore: registered new driver hub
May 17 23:10:04 citalopram kernel: hdc: ATAPI 52X CD-ROM drive, 128kB Cache,
UDMA(33)
May 17 23:10:04 citalopram kernel: Uniform CD-ROM driver Revision: 3.20
May 17 23:10:04 citalopram kernel: 8139too Fast Ethernet driver 0.9.27
May 17 23:10:04 citalopram kernel: ACPI: PCI Interrupt 0000:00:0d.0[A] ->
GSI 17 (level, low) -> IRQ 16
May 17 23:10:04 citalopram kernel: eth0: RealTek RTL8139 at 0xd001c000,
00:20:ed:77:36:97, IRQ 16
May 17 23:10:04 citalopram kernel: USB Universal Host Controller Interface
driver v2.3
May 17 23:10:04 citalopram kernel: ACPI: PCI Interrupt Link [LNKD] enabled
at IRQ 11
May 17 23:10:04 citalopram kernel: ACPI: PCI Interrupt 0000:00:07.2[D] ->
Link [LNKD] -> GSI 11 (level, low) -> IRQ 11
May 17 23:10:04 citalopram kernel: uhci_hcd 0000:00:07.2: UHCI Host
Controller
May 17 23:10:04 citalopram kernel: uhci_hcd 0000:00:07.2: new USB bus
registered, assigned bus number 1
May 17 23:10:04 citalopram kernel: uhci_hcd 0000:00:07.2: irq 11, io base
0x0000d400
May 17 23:10:04 citalopram kernel: usb usb1: configuration #1 chosen from 1
choice
May 17 23:10:04 citalopram kernel: hub 1-0:1.0: USB hub found
May 17 23:10:04 citalopram kernel: hub 1-0:1.0: 2 ports detected
May 17 23:10:04 citalopram kernel: 8139cp: 10/100 PCI Ethernet driver v1.2
(Mar 22, 2004)
May 17 23:10:04 citalopram kernel: ACPI: PCI Interrupt 0000:00:07.3[D] ->
Link [LNKD] -> GSI 11 (level, low) -> IRQ 11
May 17 23:10:04 citalopram kernel: uhci_hcd 0000:00:07.3: UHCI Host
Controller
May 17 23:10:04 citalopram kernel: uhci_hcd 0000:00:07.3: new USB bus
registered, assigned bus number 2
May 17 23:10:04 citalopram kernel: uhci_hcd 0000:00:07.3: irq 11, io base
0x0000d800
May 17 23:10:04 citalopram kernel: usb usb2: configuration #1 chosen from 1
choice
May 17 23:10:04 citalopram kernel: hub 2-0:1.0: USB hub found
May 17 23:10:04 citalopram kernel: hub 2-0:1.0: 2 ports detected
May 17 23:10:04 citalopram kernel: ACPI: PCI Interrupt Link [LNKC] enabled
at IRQ 10
May 17 23:10:04 citalopram kernel: ACPI: PCI Interrupt 0000:00:07.5[C] ->
Link [LNKC] -> GSI 10 (level, low) -> IRQ 10
May 17 23:10:04 citalopram kernel: PCI: Via IRQ fixup for 0000:00:07.5, from
9 to 10
May 17 23:10:04 citalopram kernel: Floppy drive(s): fd0 is 1.44M
May 17 23:10:04 citalopram kernel: FDC 0 is a post-1991 82077
May 17 23:10:04 citalopram kernel: Adding 498004k swap on /dev/hda2.
Priority:-1 extents:1 across:498004k
May 17 23:10:04 citalopram kernel: XFS mounting filesystem hda1
May 17 23:10:04 citalopram kernel: eth0: link up, 100Mbps, full-duplex, lpa
0x45E1
May 17 23:10:06 citalopram lpd[3057]: restarted
May 17 23:30:04 citalopram -- MARK --
May 17 23:50:04 citalopram -- MARK --
..... Lots more --mark-- lines


That's all that syslog logs show about the crash/reboot.

The hardware is a Gigabyte VEML board with on-board everything, a 1200 MHz
Tualitin Celeron CPU and 2x128 Mb SDRAM in a 1U rack case.  There's a 20 GB
WD drive and an Asus CDROM and a FDD.


Where else can I look for more info?  There is nothing on screen after the
reboot of course.
I could disable ACPI and see if that's the problem, but the board had been
running fine for months in another case/PSU.
PSU faults?  How best to determine that?




-- 
C. Falconer
http://www.avonside.school.nz/
http://criggie.dyndns.org/

Reply via email to