Bug#407112: kernel: network and system crashes.

2008-06-13 Thread Staff
This is a mission critical node, I can't install an unstable kernel. 
I've compiled module sk98lin downloaded from Marvell site on 2.6.18, now 
will test this configuration.


Thanks
--Sergio Tosti



--
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Bug#407112: kernel: network and system crashes.

2008-06-13 Thread maximilian attems
On Fri, Jun 13, 2008 at 02:08:44PM +0200, Staff wrote:
 This is a mission critical node, I can't install an unstable kernel. 
 I've compiled module sk98lin downloaded from Marvell site on 2.6.18, now 
 will test this configuration.
 
 Thanks
 --Sergio Tosti

right let's better ride it with ugly unsupported vendor code.

2.6.25 is the current upstream supported kernel.
you gain by testing it.

-- 
maks



-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Bug#407112: kernel: network and system crashes.

2008-06-12 Thread Servizio Reti

Version: 2.6.18-6

# cat /proc/sys/kernel/version
#1 SMP Fri Jun 6 22:22:11 UTC 2008

Bug present, relevant syslog:
[..]
Jun 11 07:31:08 hera kernel: NETDEV WATCHDOG: eth1: transmit timed out
Jun 11 07:31:08 hera kernel: sky2 eth1: tx timeout
Jun 11 07:31:08 hera kernel: sky2 eth1: transmit ring 205 .. 182  
report=205 done

=205
Jun 11 07:31:08 hera kernel: sky2 hardware hung? flushing
[..]
after that system freezes.

With 2.6.24-1 from backports system freezes without any message in  
logs and console!


the node is an intel server sr1435vp2

additional info:
# cat /proc/cpuinfo
processor   : 0
vendor_id   : GenuineIntel
cpu family  : 15
model   : 4
model name  : Intel(R) Xeon(TM) CPU 2.80GHz
stepping: 3
cpu MHz : 2793.334
cache size  : 2048 KB
physical id : 0
siblings: 2
core id : 0
cpu cores   : 1
fdiv_bug: no
hlt_bug : no
f00f_bug: no
coma_bug: no
fpu : yes
fpu_exception   : yes
cpuid level : 5
wp  : yes
flags   : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge  
mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe lm  
constant_tsc pni monitor ds_cpl cid cx16 xtpr

bogomips: 5590.47

processor   : 1
vendor_id   : GenuineIntel
cpu family  : 15
model   : 4
model name  : Intel(R) Xeon(TM) CPU 2.80GHz
stepping: 3
cpu MHz : 2793.334
cache size  : 2048 KB
physical id : 0
siblings: 2
core id : 0
cpu cores   : 1
fdiv_bug: no
hlt_bug : no
f00f_bug: no
coma_bug: no
fpu : yes
fpu_exception   : yes
cpuid level : 5
wp  : yes
flags   : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge  
mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe lm  
constant_tsc pni monitor ds_cpl cid cx16 xtpr

bogomips: 5586.39

processor   : 2
vendor_id   : GenuineIntel
cpu family  : 15
model   : 4
model name  : Intel(R) Xeon(TM) CPU 2.80GHz
stepping: 3
cpu MHz : 2793.334
cache size  : 2048 KB
physical id : 3
siblings: 2
core id : 0
cpu cores   : 1
fdiv_bug: no
hlt_bug : no
f00f_bug: no
coma_bug: no
fpu : yes
fpu_exception   : yes
cpuid level : 5
wp  : yes
flags   : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge  
mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe lm  
constant_tsc pni monitor ds_cpl cid cx16 xtpr

bogomips: 5586.49

processor   : 3
vendor_id   : GenuineIntel
cpu family  : 15
model   : 4
model name  : Intel(R) Xeon(TM) CPU 2.80GHz
stepping: 3
cpu MHz : 2793.334
cache size  : 2048 KB
physical id : 3
siblings: 2
core id : 0
cpu cores   : 1
fdiv_bug: no
hlt_bug : no
f00f_bug: no
coma_bug: no
fpu : yes
fpu_exception   : yes
cpuid level : 5
wp  : yes
flags   : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge  
mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe lm  
constant_tsc pni monitor ds_cpl cid cx16 xtpr

bogomips: 5586.52

# lspci

00:00.0 Host bridge: Intel Corporation E7320 Memory Controller Hub (rev 0c)
00:00.1 Class ff00: Intel Corporation E7320 Error Reporting Registers (rev 0c)
00:02.0 PCI bridge: Intel Corporation E7525/E7520/E7320 PCI Express  
Port A (rev 0c)
00:03.0 PCI bridge: Intel Corporation E7525/E7520/E7320 PCI Express  
Port A1 (rev 0c)

00:1c.0 PCI bridge: Intel Corporation 6300ESB 64-bit PCI-X Bridge (rev 02)
00:1d.0 USB Controller: Intel Corporation 6300ESB USB Universal Host  
Controller (rev 02)
00:1d.1 USB Controller: Intel Corporation 6300ESB USB Universal Host  
Controller (rev 02)

00:1d.4 System peripheral: Intel Corporation 6300ESB Watchdog Timer (rev 02)
00:1d.5 PIC: Intel Corporation 6300ESB I/O Advanced Programmable  
Interrupt Controller (rev 02)
00:1d.7 USB Controller: Intel Corporation 6300ESB USB2 Enhanced Host  
Controller (rev 02)

00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev 0a)
00:1f.0 ISA bridge: Intel Corporation 6300ESB LPC Interface Controller  
(rev 02)
00:1f.1 IDE interface: Intel Corporation 6300ESB PATA Storage  
Controller (rev 02)
00:1f.2 IDE interface: Intel Corporation 6300ESB SATA Storage  
Controller (rev 02)

00:1f.3 SMBus: Intel Corporation 6300ESB SMBus Controller (rev 02)
02:00.0 Ethernet controller: Marvell Technology Group Ltd. 88E8050  
PCI-E ASF Gigabit Ethernet Controller (rev 17)

03:03.0 SCSI storage controller: Adaptec AHA-3960D / AIC-7899A U160/m (rev 01)
03:03.1 SCSI storage controller: Adaptec AHA-3960D / AIC-7899A U160/m (rev 01)
04:02.0 VGA compatible controller: ATI Technologies Inc Rage XL (rev 27)
04:03.0 Ethernet controller: Intel Corporation 82541GI/PI Gigabit  
Ethernet Controller 

Bug#407112: kernel: network and system crashes.

2008-06-12 Thread maximilian attems
On Thu, Jun 12, 2008 at 08:59:40PM +0200, Servizio Reti wrote:
 Version: 2.6.18-6
 
 # cat /proc/sys/kernel/version
 #1 SMP Fri Jun 6 22:22:11 UTC 2008
 
 Bug present, relevant syslog:
 [..]
 Jun 11 07:31:08 hera kernel: NETDEV WATCHDOG: eth1: transmit timed out
 Jun 11 07:31:08 hera kernel: sky2 eth1: tx timeout
 Jun 11 07:31:08 hera kernel: sky2 eth1: transmit ring 205 .. 182  
 report=205 done
 =205
 Jun 11 07:31:08 hera kernel: sky2 hardware hung? flushing
 [..]
 after that system freezes.
 
 With 2.6.24-1 from backports system freezes without any message in  
 logs and console!
 
 the node is an intel server sr1435vp2

please try out 2.6.25 from unstable should install fine

thanks for feedback

-- 
maks



-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Bug#407112: kernel: network and system crashes.

2007-01-16 Thread jeanmichel
Package: kernel
Version: NetworkManager
Severity: normal


During the night, while there is no activity on computer but samba
traffic, network crashed, and computer too.

At the morning, computer was unavailable by ssh, nor by X local session.

Here after, logs:

/var/log/debug:
Jan 15 23:32:21 computername last message repeated 4 times
Jan 16 00:31:50 computername kernel: usb 1-2: usbfs: USBDEVFS_CONTROL failed cmd
newhidups rqt 128 rq 6 len 255 ret -110
Jan 16 00:31:51 computername kernel: usb 1-2: usbfs: USBDEVFS_CONTROL failed cmd
newhidups rqt 128 rq 6 len 255 ret -110
Jan 16 00:41:11 computername NetworkManager: debug info^I[1168904471.056781]
nm_hal_device_removed (): Device removed (hal udi
is '/org/freedesktop/Hal/devices/volume_empty_unknown').
Jan 16 00:53:44 computername kernel: sky2 eth2: transmit ring 314 .. 273
report=314 done=314
Jan 16 01:07:59 computername kernel: sky2 eth2: transmit ring 273 .. 232
report=314 done=314
Jan 16 01:09:34 computername kernel: sky2 eth2: transmit ring 314 .. 273
report=314 done=314
(manual reboot)
Jan 16 08:18:05 computername kernel: ACPI: RSDP (v000 ACPIAM 

/var/log/syslog:

Jan 16 00:41:15 computername postfix/smtp[12742]: C07976BE4F:
to=[EMAIL PROTECTED],
relay=192.168.236.22[192.168.236.22]:25, delay=11,
delays=7.6/0.41/1.3/1.5, dsn=2.0.0, status=sent (250 Ok: queued as
3E7A06032)
Jan 16 00:41:15 computername postfix/qmgr[3471]: C07976BE4F: removed
Jan 16 00:41:15 computername postfix/smtp[12741]: 260E06BE4E:
to=[EMAIL PROTECTED],
relay=192.168.236.22[192.168.236.22]:25, delay=4.4,
delays=1.1/0.33/1.4/1.6, dsn=2.0.0, status=sent (250 Ok: queued as
DC7076030)
Jan 16 00:41:15 computername postfix/qmgr[3471]: 260E06BE4E: removed
Jan 16 00:45:02 computername ypbind[2992]: broadcast: RPC: Timed out.
Jan 16 00:46:16 computername ypbind[2992]: broadcast: RPC: Timed out.
Jan 16 00:47:30 computername ypbind[2992]: broadcast: RPC: Timed out.
Jan 16 00:48:44 computername ypbind[2992]: broadcast: RPC: Timed out.
Jan 16 00:49:58 computername ypbind[2992]: broadcast: RPC: Timed out.
Jan 16 00:51:12 computername ypbind[2992]: broadcast: RPC: Timed out.
Jan 16 00:52:26 computername ypbind[2992]: broadcast: RPC: Timed out.
Jan 16 00:53:40 computername ypbind[2992]: broadcast: RPC: Timed out.
Jan 16 00:53:44 computername kernel: NETDEV WATCHDOG: eth2: transmit timed out
Jan 16 00:53:44 computername kernel: sky2 eth2: tx timeout
Jan 16 00:53:44 computername kernel: sky2 eth2: transmit ring 314 .. 273
report=314 done=314
Jan 16 00:53:44 computername kernel: sky2 hardware hung? flushing
Jan 16 00:54:54 computername ypbind[2992]: broadcast: RPC: Timed out.
Jan 16 00:56:08 computername ypbind[2992]: broadcast: RPC: Timed out.
Jan 16 00:57:22 computername ypbind[2992]: broadcast: RPC: Timed out.
Jan 16 00:58:36 computername ypbind[2992]: broadcast: RPC: Timed out.
Jan 16 00:59:30 computername ypbind[2992]: broadcast: RPC: Timed out.
Jan 16 01:00:24 computername ypbind[2992]: broadcast: RPC: Timed out.
Jan 16 01:01:18 computername ypbind[2992]: broadcast: RPC: Timed out.
Jan 16 01:02:12 computername ypbind[2992]: broadcast: RPC: Timed out.
Jan 16 01:04:00 computername last message repeated 2 times
Jan 16 01:04:54 computername ypbind[2992]: broadcast: RPC: Timed out.
Jan 16 01:05:48 computername ypbind[2992]: broadcast: RPC: Timed out.
Jan 16 01:06:42 computername ypbind[2992]: broadcast: RPC: Timed out.
Jan 16 01:07:36 computername ypbind[2992]: broadcast: RPC: Timed out.
Jan 16 01:07:59 computername kernel: NETDEV WATCHDOG: eth2: transmit timed out
Jan 16 01:07:59 computername kernel: sky2 eth2: tx timeout
Jan 16 01:07:59 computername kernel: sky2 eth2: transmit ring 273 .. 232
report=314 done=314
Jan 16 01:07:59 computername kernel: sky2 status report lost?
Jan 16 01:08:30 computername ypbind[2992]: broadcast: RPC: Timed out.
Jan 16 01:09:02 computername /USR/SBIN/CRON[14089]: (root) CMD (  [ -d
/var/lib/php4 ]  find /var/lib/php4/ -type f -cmin
+$(/usr/lib/php4/maxlifetime) -print0 | xargs -r -0 rm)
Jan 16 01:09:02 computername /USR/SBIN/CRON[14090]: (root) CMD (  [ -d
/var/lib/php5 ]  find /var/lib/php5/ -type f -cmin
+$(/usr/lib/php5/maxlifetime) -print0 | xargs -r -0 rm)
Jan 16 01:09:24 computername ypbind[2992]: broadcast: RPC: Timed out.
Jan 16 01:09:34 computername kernel: NETDEV WATCHDOG: eth2: transmit timed out
Jan 16 01:09:34 computername kernel: sky2 eth2: tx timeout
Jan 16 01:09:34 computername kernel: sky2 eth2: transmit ring 314 .. 273
report=314 done=314
Jan 16 01:09:34 computername kernel: sky2 hardware hung? flushing
Jan 16 01:10:18 computername ypbind[2992]: broadcast: RPC: Timed out.
Jan 16 01:11:12 computername ypbind[2992]: broadcast: RPC: Timed out.
Jan 16 01:12:06 computername ypbind[2992]: broadcast: RPC: Timed out.
Jan 16 01:13:00 computername ypbind[2992]: broadcast: RPC: Timed out.
Jan 16 01:13:54 computername ypbind[2992]: broadcast: RPC: Timed out.
Jan 16 01:14:48 computername ypbind[2992]: broadcast: RPC: Timed out.
Jan 16 01:15:42