Bug#688882: linux-image-2.6.32-5-xen-amd64: Dom0 crashes when starting third DomU

2012-10-02 Thread Ian Campbell
On Wed, 2012-09-26 at 11:08 -0400, Alan P. Laudicina wrote:
 Package: linux-2.6
 Version: 2.6.32-45
 Severity: important
 
 
 Whenever I start a third domU, my dom0 immediately crashes and reboots. 
 Because it is in colo, I cannot pull a capture of why it's happening. If 
 there is a way to do this without going onsite, I am willing to do that.

It's likely a kernel ooops of some sort, if you aren't able to get a
serial console or IP KVM via your colo provider then another option
(other than changing providers ;-)) might be to try netconsole to
another suitable machine in the colo or to setup a local repro scenario
somewhere more conveniently located.

I'm afraid that without some data about the actual crash which is
happening there is not likely to be much which can be done about this,
sorry.

Ian.

 
 All of the DomU's are using a simple Xen config that looks like this:
 
 bootloader = '/usr/lib/xen-default/bin/pygrub'
 vcpus   = '4'
 memory  = '512'
 root= '/dev/xvda2 ro'
 disk= [
'phy:/dev/mirrorset/test-disk,xvda2,w',
'phy:/dev/mirrorset/test-swap,xvda1,w',
]
 name= 'test'
 vif = [ 'ip=x.x.x.x,mac=00:16:3E:79:A1:CA' ]
 on_poweroff = 'destroy'
 on_reboot   = 'restart'
 on_crash= 'restart'
 
 -- Package-specific info:
 ** Version:
 Linux version 2.6.32-5-xen-amd64 (Debian 2.6.32-45) (da...@debian.org) 
 (gcc version 4.3.5 (Debian 4.3.5-4) ) #1 SMP Sun May 6 08:57:29 UTC 2012
 
 ** Command line:
 placeholder root=UUID=30110ed5-e675-4d0f-bf21-70dc8dbf56f2 ro quiet
 
 ** Not tainted
 
 ** Kernel log:
 [   11.105098] ADDRCONF(NETDEV_UP): peth0: link is not ready
 [   11.331582] block drbd0: recounting of set bits took additional 1 jiffies
 [   11.331588] block drbd0: 1608 MB (411738 bits) marked out-of-sync by 
 on disk bit-map.
 [   11.331619] block drbd0: Marked additional 388 MB as out-of-sync 
 based on AL.
 [   11.723118] block drbd0: disk( Attaching - UpToDate )
 [   11.758719] block drbd0: conn( StandAlone - Unconnected )
 [   11.758760] block drbd0: Starting receiver thread (from drbd0_worker 
 [1074])
 [   11.758841] block drbd0: receiver (re)started
 [   11.758848] block drbd0: conn( Unconnected - WFConnection )
 [   11.758905] block drbd0: bind before connect failed, err = -99
 [   11.758979] block drbd0: conn( WFConnection - Disconnecting )
 [   11.956022] block drbd0: Discarding network configuration.
 [   11.956081] block drbd0: Connection closed
 [   11.956089] block drbd0: conn( Disconnecting - StandAlone )
 [   11.956134] block drbd0: receiver terminated
 [   11.956138] block drbd0: Terminating drbd0_receiver
 [   12.791332] e1000e: peth0 NIC Link is Up 100 Mbps Full Duplex, Flow 
 Control: None
 [   12.791337] e1000e :04:00.0: peth0: 10/100 speed: disabling TSO
 [   12.792606] ADDRCONF(NETDEV_CHANGE): peth0: link becomes ready
 [   13.238785] device peth0 entered promiscuous mode
 [   13.251960] eth0: port 1(peth0) entering forwarding state
 [   13.428648] ip_tables: (C) 2000-2006 Netfilter Core Team
 [   23.320012] eth0: no IPv6 routers present
 [   23.420007] peth0: no IPv6 routers present
 [   68.750960] block drbd0: role( Secondary - Primary )
 [   78.075419]   alloc irq_desc for 2251 on node -1
 [   78.075424]   alloc kstat_irqs on node -1
 [   78.221963]   alloc irq_desc for 2250 on node -1
 [   78.221967]   alloc kstat_irqs on node -1
 [   78.255923] device vif4.0 entered promiscuous mode
 [   78.259688] eth0: port 2(vif4.0) entering forwarding state
 [   78.330986] nf_conntrack version 0.5.0 (16384 buckets, 65536 max)
 [   78.331523] CONFIG_NF_CT_ACCT is deprecated and will be removed soon. 
 Please use
 [   78.331526] nf_conntrack.acct=1 kernel parameter, acct=1 nf_conntrack 
 module option or
 [   78.331529] sysctl net.netfilter.nf_conntrack_acct=1 to enable it.
 [   78.368947] physdev match: using --physdev-out in the OUTPUT, FORWARD 
 and POSTROUTING chains for non-bridged traffic is not supported anymore.
 [   80.539376] blkback: ring-ref 8, event-channel 23, protocol 1 
 (x86_64-abi)
 [   80.539461]   alloc irq_desc for 2249 on node -1
 [   80.539465]   alloc kstat_irqs on node -1
 [   80.548719] blkback: ring-ref 768, event-channel 24, protocol 1 
 (x86_64-abi)
 [   80.548738]   alloc irq_desc for 2248 on node -1
 [   80.548741]   alloc kstat_irqs on node -1
 [   80.562789]   alloc irq_desc for 2247 on node -1
 [   80.562798]   alloc kstat_irqs on node -1
 [   89.060028] vif4.0: no IPv6 routers present
 [   90.548501]   alloc irq_desc for 2246 on node -1
 [   90.548507]   alloc kstat_irqs on node -1
 [   90.672853]   alloc irq_desc for 2245 on node -1
 [   90.672859]   alloc kstat_irqs on node -1
 [   90.711170] device vif5.0 entered promiscuous mode
 [   90.714890] eth0: port 3(vif5.0) entering forwarding state
 [   90.732431] physdev match: using --physdev-out in the OUTPUT, FORWARD 
 and POSTROUTING chains for non-bridged traffic is not supported anymore.
 [   

Bug#688882: linux-image-2.6.32-5-xen-amd64: Dom0 crashes when starting third DomU

2012-09-26 Thread Alan P. Laudicina

Package: linux-2.6
Version: 2.6.32-45
Severity: important


Whenever I start a third domU, my dom0 immediately crashes and reboots. 
Because it is in colo, I cannot pull a capture of why it's happening. If 
there is a way to do this without going onsite, I am willing to do that.


All of the DomU's are using a simple Xen config that looks like this:

bootloader = '/usr/lib/xen-default/bin/pygrub'
vcpus   = '4'
memory  = '512'
root= '/dev/xvda2 ro'
disk= [
  'phy:/dev/mirrorset/test-disk,xvda2,w',
  'phy:/dev/mirrorset/test-swap,xvda1,w',
  ]
name= 'test'
vif = [ 'ip=x.x.x.x,mac=00:16:3E:79:A1:CA' ]
on_poweroff = 'destroy'
on_reboot   = 'restart'
on_crash= 'restart'

-- Package-specific info:
** Version:
Linux version 2.6.32-5-xen-amd64 (Debian 2.6.32-45) (da...@debian.org) 
(gcc version 4.3.5 (Debian 4.3.5-4) ) #1 SMP Sun May 6 08:57:29 UTC 2012


** Command line:
placeholder root=UUID=30110ed5-e675-4d0f-bf21-70dc8dbf56f2 ro quiet

** Not tainted

** Kernel log:
[   11.105098] ADDRCONF(NETDEV_UP): peth0: link is not ready
[   11.331582] block drbd0: recounting of set bits took additional 1 jiffies
[   11.331588] block drbd0: 1608 MB (411738 bits) marked out-of-sync by 
on disk bit-map.
[   11.331619] block drbd0: Marked additional 388 MB as out-of-sync 
based on AL.

[   11.723118] block drbd0: disk( Attaching - UpToDate )
[   11.758719] block drbd0: conn( StandAlone - Unconnected )
[   11.758760] block drbd0: Starting receiver thread (from drbd0_worker 
[1074])

[   11.758841] block drbd0: receiver (re)started
[   11.758848] block drbd0: conn( Unconnected - WFConnection )
[   11.758905] block drbd0: bind before connect failed, err = -99
[   11.758979] block drbd0: conn( WFConnection - Disconnecting )
[   11.956022] block drbd0: Discarding network configuration.
[   11.956081] block drbd0: Connection closed
[   11.956089] block drbd0: conn( Disconnecting - StandAlone )
[   11.956134] block drbd0: receiver terminated
[   11.956138] block drbd0: Terminating drbd0_receiver
[   12.791332] e1000e: peth0 NIC Link is Up 100 Mbps Full Duplex, Flow 
Control: None

[   12.791337] e1000e :04:00.0: peth0: 10/100 speed: disabling TSO
[   12.792606] ADDRCONF(NETDEV_CHANGE): peth0: link becomes ready
[   13.238785] device peth0 entered promiscuous mode
[   13.251960] eth0: port 1(peth0) entering forwarding state
[   13.428648] ip_tables: (C) 2000-2006 Netfilter Core Team
[   23.320012] eth0: no IPv6 routers present
[   23.420007] peth0: no IPv6 routers present
[   68.750960] block drbd0: role( Secondary - Primary )
[   78.075419]   alloc irq_desc for 2251 on node -1
[   78.075424]   alloc kstat_irqs on node -1
[   78.221963]   alloc irq_desc for 2250 on node -1
[   78.221967]   alloc kstat_irqs on node -1
[   78.255923] device vif4.0 entered promiscuous mode
[   78.259688] eth0: port 2(vif4.0) entering forwarding state
[   78.330986] nf_conntrack version 0.5.0 (16384 buckets, 65536 max)
[   78.331523] CONFIG_NF_CT_ACCT is deprecated and will be removed soon. 
Please use
[   78.331526] nf_conntrack.acct=1 kernel parameter, acct=1 nf_conntrack 
module option or

[   78.331529] sysctl net.netfilter.nf_conntrack_acct=1 to enable it.
[   78.368947] physdev match: using --physdev-out in the OUTPUT, FORWARD 
and POSTROUTING chains for non-bridged traffic is not supported anymore.
[   80.539376] blkback: ring-ref 8, event-channel 23, protocol 1 
(x86_64-abi)

[   80.539461]   alloc irq_desc for 2249 on node -1
[   80.539465]   alloc kstat_irqs on node -1
[   80.548719] blkback: ring-ref 768, event-channel 24, protocol 1 
(x86_64-abi)

[   80.548738]   alloc irq_desc for 2248 on node -1
[   80.548741]   alloc kstat_irqs on node -1
[   80.562789]   alloc irq_desc for 2247 on node -1
[   80.562798]   alloc kstat_irqs on node -1
[   89.060028] vif4.0: no IPv6 routers present
[   90.548501]   alloc irq_desc for 2246 on node -1
[   90.548507]   alloc kstat_irqs on node -1
[   90.672853]   alloc irq_desc for 2245 on node -1
[   90.672859]   alloc kstat_irqs on node -1
[   90.711170] device vif5.0 entered promiscuous mode
[   90.714890] eth0: port 3(vif5.0) entering forwarding state
[   90.732431] physdev match: using --physdev-out in the OUTPUT, FORWARD 
and POSTROUTING chains for non-bridged traffic is not supported anymore.
[   90.735680] physdev match: using --physdev-out in the OUTPUT, FORWARD 
and POSTROUTING chains for non-bridged traffic is not supported anymore.
[   90.735687] physdev match: using --physdev-out in the OUTPUT, FORWARD 
and POSTROUTING chains for non-bridged traffic is not supported anymore.
[   90.768156] physdev match: using --physdev-out in the OUTPUT, FORWARD 
and POSTROUTING chains for non-bridged traffic is not supported anymore.
[   90.768163] physdev match: using --physdev-out in the OUTPUT, FORWARD 
and POSTROUTING chains for non-bridged traffic is not supported anymore.
[   90.773017] physdev match: using